Kshitiz Regmi
Machine Learning Engineer · Generative AI · Large Language Models · Production ML Systems
I am a Machine Learning Engineer with 6+ years of experience building and deploying production AI systems, from large language models and RAG to recommender engines and ML infrastructure at scale. Most recently, I architected the TIME AI platform for TIME Magazine, a customer-facing generative AI product serving over 15 million requests per month at under 100ms latency. I am currently an MS Computer Science candidate at Georgia State University.

About
Designing and deploying production AI systems.
I have spent the last 6 years building machine learning systems that run in production. Most recently, I architected the TIME AI platform for TIME Magazine, a customer-facing generative AI product serving over 15 million requests per month at under 100ms latency.
I developed vector search pipelines that reached 89% Recall@5 on proprietary benchmarks, and recommender systems that outperformed Google Vertex AI in production A/B tests, improving click-through rate by 1.23%. My work also spans retrieval-augmented generation with inline citations, LLM fine-tuning, semantic retrieval, multi-agent systems on AWS Bedrock, and production ML infrastructure.
I also built and owned the MLOps infrastructure behind these systems, including model serving, monitoring, data validation, scheduled retraining, and latency optimization across GCP and AWS.
I am currently pursuing my MS in Computer Science at Georgia State University, where I have a GPA of 4.2 on a 4.3 scale. I work as a Graduate Research Assistant in the ALSER Lab, where I apply machine learning to large-scale metagenomic data systems.
I am looking to join a team building reliable, customer-facing AI products that solve real problems and operate at production scale.
Atlanta, Georgia. Open to relocation and remote.
Currently
Open to workMS Computer Science
Georgia State University · GPA 4.2/4.3
Graduate Research Assistant
ALSER Lab · Synthetic metagenomics & genomics
Seeking Data Science, AI/ML Engineer roles
Full-time · Available 2026
Education
Academic background
M.S. in Computer Science
Expected 04/2027Georgia State University
Atlanta, GA · GPA: 4.2/4.3. Coursework: Distributed AI, Advanced Deep Learning, Digital Image Processing, Advanced Machine Learning, Introduction to Deep Learning.
Experience
Research & professional experience
Graduate Research Assistant — ALSER Lab
05/2026 — PresentGeorgia State University · Atlanta, GA
- Developing a scalable framework for generating synthetic metagenomic FASTQ replicas for benchmarking analysis pipelines.
- Optimizing large-scale genomic data processing on Arctic Server HPC for reproducibility, scalability, and performance.
Graduate Teaching Assistant
08/2025 — 04/2026Georgia State University · Atlanta, GA
- Taught and supported Python programming, SQL, Pandas, NumPy, SciPy, Matplotlib, Seaborn, data analysis, visualization, and debugging.
Machine Learning Engineer
03/2020 — 08/2025Fusemachines Inc. · Kathmandu, Nepal (Client: TIME Magazine)
- Designed the TIME AI platform ML architecture, owned design decisions and product roadmap, and communicated findings to stakeholders.
- Led development and deployment of the TIME AI beta — increased user engagement by 9%.
- Developed a production-grade conversational RAG system using embeddings retrieval, chunking, reranking, and vector DB; generated answers with citations.
- Implemented multi-turn conversation handling with context tracking and follow-up support using Datastore.
- Performed statistical data analysis on 200K articles to define RAG chunking strategy.
- Benchmarked retrieval engines: Vertex AI Feature Store vs OpenSearch, Pinecone, and FAISS — 89% Recall@5 on a 12K+ proprietary benchmark.
- Engineered a shared embeddings and indexing service so RAG, semantic search, recommender, and Document ChatAI reused the same pipeline; eliminated duplicate engineering effort.
- Developed Document ChatAI to help editors ask questions and find information across printed articles.
- Designed, developed, and deployed DeepDive topic extraction and topic-based recommendations for real-time content discovery.
- Developed an unsupervised user segmentation model with clustering on behavioral data; visualized embeddings via TensorFlow Embedding Projector.
- Developed the email marketing engine achieving a 43% unique open rate — won the INMA Global Media Award 2023.
- Developed and deployed a recommendation engine outperforming Google Vertex AI Recommendations — 1.23% CTR lift in production A/B testing.
- Optimized the recommendation engine API to <100ms at 15M+ requests/month with autoscaling and load balancing.
- Deployed ML services on Cloud Run with latency/error monitoring, data validation, and scheduled retraining for drift.
- Led cross-team collaboration with Google Cloud engineers, data engineers, product leads, and stakeholders; contributed to roadmaps, OKRs, and architecture docs.
- Implemented a multi-step multi-agent system on AWS Bedrock for fraud analytics, converting natural language to SQL.
- Developed an LLM-powered agent mapping client data into a common data model via dbt SQL on Trino over Iceberg tables.
Data Scientist
08/2022 — 09/2024Broadway Infosys · Kathmandu, Nepal
- Led Data Science and Machine Learning training for 500+ students — SQL, EDA, data visualization, feature engineering, and model evaluation.
- Mentored students on statistical hypothesis testing, hyperparameter tuning, and error analysis.
- Implemented MLOps pipelines with Git, Docker, and MLflow for experiment tracking and reproducibility.
- Deployed ML models as REST APIs with FastAPI and interactive frontends with Streamlit.
Artificial Intelligence Engineer Intern
04/2019 — 09/2019OYA INC · Kathmandu, Nepal
- Engineered an ETL pipeline processing 80K+ user-item records to generate model-ready signals for a collaborative filtering recommendation engine.
- Implemented data preprocessing, data cleaning, and model training and deployment pipelines.
- Deployed real-time inference APIs using Flask, moving the system from prototype to production.
Publications
Papers & research
K. Regmi, et al.
Working paper · 2026
Selected work
Projects
Novel federated conformal prediction method that mitigates hallucinations in federated fine-tuned vision-language models via abstention.
Reduced hallucination rate 13.05% → 3.79% (71% relative drop); useful-answer rate 76.63%; Precision@Commit 95.29%.
Attention-based forecasting of magnetic field trajectories combined with supervised contrastive learning for solar flare classification.
TSS = 0.75 on flare classification; outperformed persistence baseline on 60-step delta forecasts.
Imbalanced medical image classification using MobileViT v2 with dual-weighted learning — inverse square-root sampling + ENS loss — and Grad-CAM heatmaps for clinical interpretability.
0.92 weighted F1 on an imbalanced skin lesion dataset.
Custom Temporal Attention mechanism in PyTorch for market regime prediction, with a differentiable Sharpe Ratio Loss that optimizes directly for risk-adjusted returns.
15% improvement over MSE baselines in backtesting; Sharpe Ratio Loss outperforms standard regression objectives.
A framework for generating synthetic Q&A datasets using Gemini AI to benchmark semantic search models across diverse domains — eliminating the need for manual evaluation set curation.
Automated, domain-agnostic benchmarking framework for semantic search evaluation.
Open-source semantic image search supporting text→image, image→image, and text+image→image queries on the Myntra Fashion Product Dataset.
Real-time multi-modal retrieval across text and image modalities on a large fashion catalog.
Currency detection and classification for Nepali banknotes using InceptionV3 transfer learning, achieving 94% accuracy across 7+ denominations.
94% accuracy on 7+ Nepali banknote classes. Updated in 2023 with additional denominations.
Skills & tools
Technical toolkit
GenAI, LLMs & Agentic AI
ML & Deep Learning
MLOps, Cloud & Serving
Vector Search & Retrieval
Data Science & Engineering
Certifications
Credentials & training
- Introduction to AI and Machine Learning on Google CloudGoogle Cloud ·
- Machine Learning Engineering for Production (MLOps)Coursera ·
- TensorFlow Developer CertificationGoogle ·
- Sequences, Time Series and Predictiondeeplearning.ai ·
- NLP in TensorFlowdeeplearning.ai ·
- AI for Medical Prognosisdeeplearning.ai ·
- Data Science in Stratified Healthcare and Precision MedicineUniversity of Edinburgh ·
Service & awards
Mentorship, judging & awards
TreeHacks 2026 (Stanford University) — Mentored teams on agent development, MCP server setup, MVP scoping, and deployment best practices.
Georgia State Undergraduate Research Conference — Judge: evaluated research presentations and provided constructive feedback.
NFTE StartUp Tech Showcase (Tucker Middle School, DeKalb County) — Judge: evaluated student startup pitches on creativity, business viability, and presentation.
INMA Global Media Award 2023 — TIME email marketing engine (43% unique open rate).
Speaking & Teaching
Talks, mentorship & courses
Speaking engagements
Career in Data Science
Speaker · Nepal
Spoke on career pathways in data science — covering skills, industry trends, and practical advice for breaking into the field.
Application and Impact of AI in Nepal
Speaker · Nepal
Discussed the current landscape of AI adoption in Nepal, real-world applications, and the opportunities and challenges ahead.
Building Blocks of GenAI — RAG and VectorDB using AWS Bedrock
Speaker · Nepal
Technical talk on generative AI fundamentals — retrieval-augmented generation, vector databases, and practical implementation using AWS Bedrock.
Building an AI Ecosystem in Nepal
Himalayan Computational Linguistic AI Olympiad — Kathmandu University · Everest Engineering College, Kathmandu
Discussed AI education as a foundation for Nepal's technological growth, and how collaboration across government, universities, and industry can build a sustainable AI ecosystem.
Data Science Approach to Customer Segmentation
Guest Lecture — Presidential Graduate School · Kathmandu
Guest lecture on applying data science and clustering techniques to customer segmentation for marketing and product strategy.
AI Conference for Prosperous Nepal
Ministry of Education, Science and Technology · Kathmandu
Participated as a guest in the national AI conference organized by Nepal's Ministry of Education, Science and Technology.
Fighting Misinformation and Disinformation using AI
KTM Journathon — NIMJN · Park Valley Resort, Kathmandu
Mentored multidisciplinary teams of journalists and technologists over 2 days to build AI-powered solutions against misinformation.
Friend or Foe: Decoding AI's Dual Nature
ICT Meetup V7.0 — Prime College · Kathmandu
Panel discussion examining AI's multifaceted character — exploring beneficial applications alongside potential risks, biases, and ethical concerns.
AI Adoption in Nepalese Industries
Panel Discussion — Kathmandu University · Kathmandu University
Panel discussion on the state of AI adoption across industries in Nepal, barriers to adoption, and strategies for accelerating progress.
CRM Using Statistical Modeling & Machine Learning
Guest Lecture — Ace Institute of Management · Kathmandu
Presented how CRM systems apply statistical modeling and ML to strengthen customer relationships, reduce churn, and optimize marketing effectiveness.
Revealing the Boundless Horizons of AI
Panel Discussion — Kathmandu University · Kathmandu University
Panelist exploring the broad horizons of AI — from current capabilities to frontier research and long-term societal implications.
Teaching & training
Resource Person & Trainer
Government2023 – 2024Government of Nepal — Ministry of Forest and Environment (FRTC) · Kathmandu, Nepal
Trained Nepalese government officers and secretaries across four courses: Data Analysis using Python (Aug 2023), Statistical Analysis (Sept–Oct 2023), Advanced Data Analysis (Aug 2024), and Advanced Machine Learning (Oct 2024).
AI/ML Bootcamp Instructor
VolunteerJan 2024 (1 week)Institute of Engineering, Purwanchal Campus — Tribhuvan University · Dharan, Nepal
Conducted an intensive 1-week AI/ML bootcamp for engineering students covering data science, statistics, and ML with emphasis on hands-on learning and practical applications.
Senior Python, Data Science & AI/ML Trainer
Professional2022 – 2024Broadway Infosys · Tinkune, Kathmandu
Led professional training for 500+ students across Python, data science, and AI/ML — covering SQL, EDA, feature engineering, model evaluation, and MLOps basics.
Volunteer Instructor — Python for Research
VolunteerSept 2022 – Sept 2023Nepal Research and Collaboration Center (NRCC) · Kathmandu, Nepal
Volunteered and taught a one-month free "Python for Research" course and a week-long "Research Training Program" — helping students and researchers of different backgrounds use ML effectively in their projects.
Writing
From the blog
Jul 6, 2024 · 3 min read
Mean Can Lie — Discover the Real Insights with Mean and Standard Deviation
The mean alone can be misleading. Learn how standard deviation, normal distributions, the empirical rule, and z-scores reveal the true story hidden in your data.
Read →Mar 11, 2023 · 3 min read
NER-Powered Semantic Search Engine
Building a semantic search system enhanced with Named Entity Recognition using Pinecone, BERT-NER, and sentence transformers on 50,000 Medium articles.
Read →Mar 10, 2022 · 3 min read
XGBoost Hyperparameter Tuning — XGBRegressor with Scikit-Learn Pipelines
End-to-end XGBoost regression on the NASA airfoil noise dataset using scikit-learn Pipelines, ColumnTransformer, and RandomizedSearchCV for hyperparameter tuning.
Read →Contact
Get in touch
The fastest way to reach me is email. I'm open to ML / AI Engineer roles, PhD opportunities, research collaborations, and technical conversations.
Elsewhere