⬢ Machine Learning Scientist  ·  Ph.D.

Mithun Ghosh

Meta  ·  Walmart Labs  ·  Deep Learning · LLMs · Scalable ML Systems

Building production ML at scale — from Transformer-based forecasting and Foundation Model fine-tuning to recommendation systems driving revenue across billions of ad auctions.

0
Years Industry Exp.
0%
Inference Time ↓
0%
MAPE Reduced
0%
Revenue Lifted
0%
Model Accuracy
0
Publications
Scroll

ML Scientist.
Industry & Research.

I'm a Machine Learning Scientist with a Ph.D. from the University of Arizona, with 6+ years of industry experience at Meta and Walmart Labs building and shipping large-scale ML systems.

My work spans deep learning model development, Foundation Model fine-tuning, Transformer-based time series forecasting, auction-based recommendation systems, and operational ML — handling billions of data points daily in production.

I bridge the gap between cutting-edge research and real business impact: cutting inference costs, lifting revenue, and accelerating model iteration across some of the world's largest technology companies.

Deep Learning Foundation Models Time Series RecSys Bayesian ML Optimization Computer Vision NLP / BERT Anomaly Detection
XGBoost — Proposal Stage Prediction85%
Bayesian GP — Anomaly Detection92%
Transformer — MAPE Reduction30%
Few-Shot LLM — Inference Speedup60%
DeepFM — Revenue Uplift15%
AI Tool Productivity Lift25%

Tools & Technologies

A full-spectrum ML toolkit — from model development to cloud deployment and big-data pipelines.

🧠

Machine Learning

Supervised LearningClassification RegressionEnsemble Methods Bayesian ModelingFeature Engineering A/B Experimentation

Deep Learning

PyTorchTensorFlow Neural NetworksTransformers BERTDeepFM Fine-Tuning
🎯

Reinforcement Learning

Policy OptimizationReward Modeling Bandit AlgorithmsSequential Decision-Making Exploration-ExploitationOptimization
👁️

Computer Vision

CNN ArchitecturesImage Classification Feature ExtractionObject Detection Medical ImagingAnomaly Detection
📈

Time Series Forecasting

Demand ForecastingFoundation Models Transformer-based TSAnomaly Detection Few-Shot ForecastingStatistical Methods
💬

NLP & Language

BERT Fine-TuningText Classification Sentiment AnalysisDocument Understanding Few-Shot LearningEmbedding Models
☁️

MLOps & Infrastructure

PySparkGCP AWSVertex AI Feature StoresModel Monitoring Distributed Training
🐍

Languages & Tools

PythonR JavaSQL HiveBigQuery

Professional Experience

Production ML systems serving millions of users at FAANG and top-tier tech.

Meta
Senior Data Scientist
Feb 2025 – Present
📍 Menlo Park, CA
  • Forecasted adoption metrics for AI tools using ML and statistical models, driving productivity up by +25%.
  • Built an XGBoost model to predict proposal stage progression to production with 85% accuracy.
  • Designed methodology for ML model ROI estimation and operational cost savings to set and track ROI progress across AI initiatives.
Walmart Labs
Senior Data Scientist
May 2022 – Feb 2025
📍 Sunnyvale, CA
  • Reduced MAPE by 30% using Transformer-based models for time series demand forecasting.
  • Applied few-shot learning on Foundation Models (TinyTimeMixture, Lag-Llama), cutting inference time by 60%.
  • Improved revenue by +15% by deploying DeepFM for auction-based recommendation systems.
  • Built and deployed a dynamic floor pricing optimization framework, boosting ad display by +5%.
Lumen Technologies
Data Science Intern
Jun – Aug 2021
📍 Broomfield, CO
  • Developed scalable forecasting tools to optimize headcount allocation across business units.
  • Automated SQL + Power BI pipelines for workforce analytics and product line reporting.
University of Arizona
Graduate Research Assistant
Aug 2017 – May 2022
📍 Tucson, AZ
  • Built Bayesian Gaussian Process model for anomaly detection achieving 92% accuracy.
  • Applied CNNs for X-ray image classification, accelerating medical image triage.
  • Developed NLP-based sentiment analysis model with +9% improvement over SOTA benchmarks.

Featured Projects

Industry ML projects from Meta and Walmart Labs spanning forecasting, recommendation systems, inference optimization, and MLOps.

Time SeriesForecastingWalmart Labs

Transformer-based Demand Forecasting System

Built a Transformer-based time series forecasting model for multi-horizon demand forecasting across product categories. Leveraged foundation model architectures with few-shot adaptation, achieving a 30% reduction in MAPE versus previous LSTM baselines and cutting inventory costs across Walmart's supply chain.

Internal Project
Deep LearningWalmart LabsRecommendation

DeepFM Auction-based Recommendation Engine

Designed and deployed a DeepFM-based recommendation model for auction-style ad placement, combining factorization machines with deep neural networks to capture high-order feature interactions. End-to-end production deployment drove a 15% revenue uplift in sponsored product placements.

Internal Project
MLOpsWalmart LabsInference

Foundation Model Inference Optimization

Led end-to-end optimization of large foundation model inference pipelines using quantization, model distillation, and batching strategies on GCP. Reduced inference latency by 60% while maintaining model accuracy, enabling real-time serving at scale for recommendation workloads.

Internal Project
OptimizationWalmart LabsAd Tech

Dynamic Floor Pricing Optimization

Developed a dynamic floor pricing system for programmatic advertising using reinforcement learning and bandit algorithms to adaptively set minimum bid prices per auction. Increased ad display win rate by +5% while maximizing revenue per impression across demand partners.

Internal Project
MLOpsMetaMonitoring

ML Model ROI & Health Monitoring Framework

Built a comprehensive ML observability platform at Meta to track model health, feature drift, and business ROI across production models. Automated alerting pipelines integrated with Airflow and internal dashboards, enabling proactive model retraining and reducing incident response time for ML degradations.

Internal Project
ForecastingMetaGenAI

AI Tool Adoption Forecasting

Designed and deployed a forecasting system to predict AI tool adoption and productivity impact across engineering teams at Meta. Combined time series models with causal inference to isolate tool-driven productivity gains, influencing 25% improvement in developer productivity and informing AI investment strategy.

Internal Project

Publications

Peer-reviewed research in applied ML, Bayesian methods, and industrial engineering.

Academic Background

🎓
Doctor of Philosophy
Systems & Industrial Engineering
The University of Arizona · Tucson, AZ
May 2022
🏛️
Bachelor of Engineering
Industrial & Production Engineering
Bangladesh University of Engineering & Technology
September 2015

Get In Touch

Open to senior MLE roles, applied ML research collaborations, and speaking on ML at scale.