- Probabilistic Modeling & Latent Space Geometry: Architected a custom von Mises-Fisher Mixture Model (vMFMM) via Expectation-Maximization to cluster 128-dimensional hyperspherical embeddings from ArcFace/CCE losses — 98% Top-4 Macro recall, 85% Top-1 Macro, a 1–5% improvement over Euclidean GMM baselines across 6 architectures on 500K+ images.
- Data Pipeline & Incremental Learning: Engineered a 200-day continuous learning simulation benchmarking 6 ML architectures (XGBoost, SVM, Random Forest, etc.); demonstrated peak accuracy within a 50-day window, directly informing deployment timelines and data collection strategy.
- Hyperparameter Ablation: Executed ablation studies across 380K+ training samples; statistically validated a 5-component vMFMM as optimal for long-tail imbalanced class distributions.
- Robustness & OOD Testing: Stress-tested the vision pipeline with targeted label corruption and Gaussian feature noise; validated zero-shot OOD generalization under sensor degradation conditions.
Hassen Said
AI/ML Research Engineer
Experience
- Datalogic USA, Inc. Feb2025 - Aug2025ML Research Intern
- Squared Tech Solutions Aug2023 - Apr2024Technical Project Coordinator
- ERP Implementation: Led end-to-end ERP system selection and deployment for an NGO, managing vendor negotiations and coordinating across technical and non-technical stakeholders to deliver on schedule.
- Digital Presence: Designed and built a fully responsive website and optimised backend administration, driving measurable improvements in digital engagement and user experience.
- Stakeholder Communication: Translated complex technical requirements for non-technical decision-makers and negotiated with external vendors to secure optimal software solutions.
- Training & Documentation: Designed and delivered technical training programmes for non-technical staff; authored documentation that accelerated ERP adoption and reduced ongoing support requests.
- Freelance Oct2021 - May2023Web Developer
- Full-Stack Development: Delivered end-to-end web applications for multiple clients using JavaScript, React, Node.js, and Java — managing the full project lifecycle from requirements gathering through deployment.
Projects
-
Full-stack GraphRAG system combining a Neo4j property graph and Qdrant vector store to answer structural and semantic queries over Python codebases.
-
Multi-label ICD coding for clinical radiology reports using BioClinical-ModernBERT and Matryoshka Representation Learning — adaptive embeddings across dimensions without retraining.
-
Custom von Mises-Fisher Mixture Model replacing Euclidean GMM in a production edge vision pipeline — 98% Top-4 Macro recall across 500K+ images from 6 encoder architectures.
Education
- University of Bologna 2023 — 2026MSc in Artificial Intelligence · Bologna, Italy
- Kalinga Institute of Industrial Technology 2021 — 2023MSc in Computer Science · India
- I.K. Gujral Punjab Technical University 2018 — 2021Bachelor's in Computer Applications · India
Skills
AI / ML
Representation Learning · Probabilistic Modelling · Contrastive Learning · Metric Learning · NLP · LLM Integration · GraphRAG · RAG · Computer Vision · Mixture Models · Continual Learning
Frameworks
PyTorch · HuggingFace Transformers · LangGraph · LangChain · TensorFlow · OpenCV · scikit-learn
Infrastructure
Docker · FastAPI · Neo4j · Qdrant · GNU/Linux · Git · Slurm
MLOps
Weights & Biases · MLflow · pytest · GitLab CI
Programming
Python · JavaScript · TypeScript · Java · C++ (basic)
Languages
English (Proficient) · Arabic (Native) · Amharic (Native) · Italian (Basic)
