I build ML systems from scratch — transformers, C++ neural engines, computer vision pipelines, advanced analytics platforms, and production trading infrastructure. MIT MicroMasters candidate. Published mathematician. Passionate about deep learning, quantitative research, and high-impact engineering.
I'm a junior at Soka University of America studying Computer Science, Economics, and Mathematics, concurrently completing an MIT MicroMasters in Statistics & Data Science.
I build ML systems from the ground up — custom C++ neural network engines, transformers from scratch, computer vision pipelines with C++ inference, and NLP systems. My live trading bot runs across 22 instruments on AWS. My derivatives platform serves 250+ Ghanaian farmers. I was awarded a $25,000 research grant for a cybersecurity cluster at Bletchley Park, Royal Holloway, Bloomberg, and the Bank of England.
My current focus: deep learning research engineering — understanding every layer of the stack from CUDA kernels to production APIs. Open to high-impact research and engineering roles where rigorous systems meet real-world scale.
Full neural network library built from scratch in C++17 — zero ML framework dependencies. Custom matrix engine, backpropagation, Adam/SGD optimizers, BatchNorm, Dropout. Achieves 97.8% on MNIST. SIMD-optimized matrix multiply benchmarks included.
Production-grade Bird's-Eye-View 3D object detection pipeline. Implements Lift-Splat-Shoot (ECCV 2020) with ResNet-50 + FPN backbone and anchor-free CenterPoint detection head. Gaussian Focal Loss, mixed precision training, KITTI evaluation.
Research-faithful NeRF (Mildenhall et al., ECCV 2020) in pure PyTorch — no NeRF libraries. Positional encoding, hierarchical coarse-fine sampling, volume rendering integral. Achieves 31+ dB PSNR on NeRF-Blender matching the paper.
SVR with advanced feature engineering and Bayesian hyperparameter optimisation achieving R² 0.788, outperforming Linear Regression (0.61), Decision Tree (0.68), and Random Forest (0.74). Pipeline includes log-transform, interaction terms, and geographic clustering.
Machine learning model predicting soccer match outcomes using team ELO ratings, recent form, home advantage, head-to-head records, and player availability features. Ensemble of XGBoost and Logistic Regression with calibrated probabilities.
PySpark ETL on 1M+ taxi records — schema-enforced reads, window function analytics, caching benchmarks. Delta Lake module: ACID upserts via merge, time travel queries. Partitioned Parquet output with ~60% faster downstream reads.
Complete options pricing and delta-hedging engine — Black-Scholes analytical solution, full Greeks (Delta, Gamma, Theta, Vega, Rho), implied volatility surface construction via Newton-Raphson IV solver, daily P&L attribution, and replication portfolio tracking.
Functional limit order book engine in pure OCaml — price-time priority matching with BidMap/AskMap balanced BSTs using custom comparators (highest bid first, lowest ask first). Supports limit orders, market orders, partial fills, multi-level sweeps, and cancellations. Interactive REPL with live depth display.
OCaml library of probability puzzles and quantitative finance models — every puzzle returns (analytical, MC) pairs that converge to <0.5%. Prices options via Black-Scholes, CRR binomial trees (500 steps), and Monte Carlo (100k paths). Full Greeks via finite differences, implied vol via Newton-Raphson, and Avellaneda-Stoikov optimal market-making vs naive fixed-spread.
End-to-end quant risk system on S&P 500 sector ETFs — Historical & Cornish-Fisher VaR, CVaR/Expected Shortfall, 10K GBM Monte Carlo simulation, rolling Sharpe/Sortino/Calmar, macro regime analysis (VIX, yield curve). Advanced PostgreSQL window functions + Python modelling.
Full analytics pipeline on Olist Brazilian E-Commerce (100K+ orders, 8 tables) — RFM segmentation, cohort retention, CLV estimation, churn prediction (Logistic Regression, AUC 0.84), K-Means clustering. Raw SQL to ML to interactive dashboard. Champions drive 34% of GMV.
SVR with feature engineering and Bayesian optimization achieving R² 0.788, outperforming LR (0.61), DT (0.68), RF (0.74).
Proved AC² = AD² + AB² in convex quadrilateral. Officially recognized and published.
View Publication →Convergence properties of generalized Fibonacci sequences. Novel proof via contraction mapping.
View Publication →Looking for opportunities in ML Engineering, Deep Learning Research, Computer Vision, Data Analytics, and Quantitative Finance at top-tier companies. Let's build something great.