Research Implementation
Annealed Sinkhorn for Optimal Transport
Reproduced the convergence, regularization path, and debiasing results from Lénaïc Chizat (2024) using OTT-JAX and packaged the workflow in Google Colab for peers.
Delivered annotated notebooks and benchmarks validating annealed Sinkhorn behavior across datasets.
Python · JAX · OTT-JAX · Colab
VINCI · Jan–Apr 2025
Agnostic LLM Retriever
Developing an essential retriever (LLM) that would be reboust enough to efficiently deal with generic use cases, but gnostic enough to be easily customized to deal with specific use cases. This would almost be like a package for information retrieval built for Python to service all LLM use cases that would want or need to use it
Built a flexible retriever module that can be easily integrated into RAG pipelines, improving retrieval relevance and reducing hallucinations across diverse applications.
Python · LangChain · HuggingFace · LLM
Data Science Sprint
Education Investment Ranking
Constructed composite indicators to score countries on education investment attractiveness, blending macro trends with education KPIs.
Produced a ranked list of countries with actionable insights for policymakers and investors, highlighting key drivers of education investment potential.
Python · PCA · EDA · Visualization
Collège de France · 2022–2023
Math Performance Gap Study
Analyzed DEPP Premier Degré panel data to uncover determinants of mathematics performance gaps among French students.
Identified key socioeconomic and pedagogical factors to inform policy recommendations.
R · Regression · EDA · Policy Analysis