Open to full-time Data Science, AI/ML Engineer roles · 2026
Mentor · TreeHacks 2026 · Stanford University

Kshitiz Regmi

Machine Learning Engineer · Generative AI · Large Language Models · Production ML Systems

I am a Machine Learning Engineer with 6+ years of experience building and deploying production AI systems, from large language models and RAG to recommender engines and ML infrastructure at scale. Most recently, I architected the TIME AI platform for TIME Magazine, a customer-facing generative AI product serving over 15 million requests per month at under 100ms latency. I am currently an MS Computer Science candidate at Georgia State University.

Portrait of Kshitiz Regmi (placeholder — replace src/assets/portrait.jpg with your photo)

About

Designing and deploying production AI systems.

I have spent the last 6 years building machine learning systems that run in production. Most recently, I architected the TIME AI platform for TIME Magazine, a customer-facing generative AI product serving over 15 million requests per month at under 100ms latency.

I developed vector search pipelines that reached 89% Recall@5 on proprietary benchmarks, and recommender systems that outperformed Google Vertex AI in production A/B tests, improving click-through rate by 1.23%. My work also spans retrieval-augmented generation with inline citations, LLM fine-tuning, semantic retrieval, multi-agent systems on AWS Bedrock, and production ML infrastructure.

I also built and owned the MLOps infrastructure behind these systems, including model serving, monitoring, data validation, scheduled retraining, and latency optimization across GCP and AWS.

I am currently pursuing my MS in Computer Science at Georgia State University, where I have a GPA of 4.2 on a 4.3 scale. I work as a Graduate Research Assistant in the ALSER Lab, where I apply machine learning to large-scale metagenomic data systems.

I am looking to join a team building reliable, customer-facing AI products that solve real problems and operate at production scale.

Atlanta, Georgia. Open to relocation and remote.

Currently

Open to work

MS Computer Science

Georgia State University · GPA 4.2/4.3

Graduate Research Assistant

ALSER Lab · Synthetic metagenomics & genomics

Seeking Data Science, AI/ML Engineer roles

Full-time · Available 2026

Education

Academic background

  1. M.S. in Computer Science

    Expected 04/2027

    Georgia State University

    Atlanta, GA · GPA: 4.2/4.3. Coursework: Distributed AI, Advanced Deep Learning, Digital Image Processing, Advanced Machine Learning, Introduction to Deep Learning.

Experience

Research & professional experience

  1. Graduate Research Assistant — ALSER Lab

    05/2026 — Present

    Georgia State University · Atlanta, GA

    • Developing a scalable framework for generating synthetic metagenomic FASTQ replicas for benchmarking analysis pipelines.
    • Optimizing large-scale genomic data processing on Arctic Server HPC for reproducibility, scalability, and performance.
  2. Graduate Teaching Assistant

    08/2025 — 04/2026

    Georgia State University · Atlanta, GA

    • Taught and supported Python programming, SQL, Pandas, NumPy, SciPy, Matplotlib, Seaborn, data analysis, visualization, and debugging.
  3. Machine Learning Engineer

    03/2020 — 08/2025

    Fusemachines Inc. · Kathmandu, Nepal (Client: TIME Magazine)

    • Designed the TIME AI platform ML architecture, owned design decisions and product roadmap, and communicated findings to stakeholders.
    • Led development and deployment of the TIME AI beta — increased user engagement by 9%.
    • Developed a production-grade conversational RAG system using embeddings retrieval, chunking, reranking, and vector DB; generated answers with citations.
    • Implemented multi-turn conversation handling with context tracking and follow-up support using Datastore.
    • Performed statistical data analysis on 200K articles to define RAG chunking strategy.
    • Benchmarked retrieval engines: Vertex AI Feature Store vs OpenSearch, Pinecone, and FAISS — 89% Recall@5 on a 12K+ proprietary benchmark.
    • Engineered a shared embeddings and indexing service so RAG, semantic search, recommender, and Document ChatAI reused the same pipeline; eliminated duplicate engineering effort.
    • Developed Document ChatAI to help editors ask questions and find information across printed articles.
    • Designed, developed, and deployed DeepDive topic extraction and topic-based recommendations for real-time content discovery.
    • Developed an unsupervised user segmentation model with clustering on behavioral data; visualized embeddings via TensorFlow Embedding Projector.
    • Developed the email marketing engine achieving a 43% unique open rate — won the INMA Global Media Award 2023.
    • Developed and deployed a recommendation engine outperforming Google Vertex AI Recommendations — 1.23% CTR lift in production A/B testing.
    • Optimized the recommendation engine API to <100ms at 15M+ requests/month with autoscaling and load balancing.
    • Deployed ML services on Cloud Run with latency/error monitoring, data validation, and scheduled retraining for drift.
    • Led cross-team collaboration with Google Cloud engineers, data engineers, product leads, and stakeholders; contributed to roadmaps, OKRs, and architecture docs.
    • Implemented a multi-step multi-agent system on AWS Bedrock for fraud analytics, converting natural language to SQL.
    • Developed an LLM-powered agent mapping client data into a common data model via dbt SQL on Trino over Iceberg tables.
  4. Data Scientist

    08/2022 — 09/2024

    Broadway Infosys · Kathmandu, Nepal

    • Led Data Science and Machine Learning training for 500+ students — SQL, EDA, data visualization, feature engineering, and model evaluation.
    • Mentored students on statistical hypothesis testing, hyperparameter tuning, and error analysis.
    • Implemented MLOps pipelines with Git, Docker, and MLflow for experiment tracking and reproducibility.
    • Deployed ML models as REST APIs with FastAPI and interactive frontends with Streamlit.
  5. Artificial Intelligence Engineer Intern

    04/2019 — 09/2019

    OYA INC · Kathmandu, Nepal

    • Engineered an ETL pipeline processing 80K+ user-item records to generate model-ready signals for a collaborative filtering recommendation engine.
    • Implemented data preprocessing, data cleaning, and model training and deployment pipelines.
    • Deployed real-time inference APIs using Flask, moving the system from prototype to production.

Selected work

Projects

Fed-BLEND — Federated Conformal Prediction for VLMs

Novel federated conformal prediction method that mitigates hallucinations in federated fine-tuned vision-language models via abstention.

Reduced hallucination rate 13.05% → 3.79% (71% relative drop); useful-answer rate 76.63%; Precision@Commit 95.29%.

Python
PyTorch
LoRA
Qwen2.5-VL
Federated Learning
Conformal Prediction
Case study
GCTAF — Time-Series Forecasting & Flare Risk Classification

Attention-based forecasting of magnetic field trajectories combined with supervised contrastive learning for solar flare classification.

TSS = 0.75 on flare classification; outperformed persistence baseline on 60-step delta forecasts.

Python
PyTorch
Attention
Contrastive Learning
Time-Series
Case study
Skin Lesion Classification via Dual-Weighted Learning

Imbalanced medical image classification using MobileViT v2 with dual-weighted learning — inverse square-root sampling + ENS loss — and Grad-CAM heatmaps for clinical interpretability.

0.92 weighted F1 on an imbalanced skin lesion dataset.

Python
PyTorch
MobileViT v2
Computer Vision
Grad-CAM
Case study
Hull Tactical Market Prediction with Temporal Attention

Custom Temporal Attention mechanism in PyTorch for market regime prediction, with a differentiable Sharpe Ratio Loss that optimizes directly for risk-adjusted returns.

15% improvement over MSE baselines in backtesting; Sharpe Ratio Loss outperforms standard regression objectives.

Python
PyTorch
Temporal Attention
Time-Series
Quantitative Finance
Case study
Semantic Search Benchmarking via Synthetic Data Generation

A framework for generating synthetic Q&A datasets using Gemini AI to benchmark semantic search models across diverse domains — eliminating the need for manual evaluation set curation.

Automated, domain-agnostic benchmarking framework for semantic search evaluation.

Python
Gemini AI
Semantic Search
Synthetic Data
Benchmarking
Case study
Multi-Modal Semantic Image Search Engine

Open-source semantic image search supporting text→image, image→image, and text+image→image queries on the Myntra Fashion Product Dataset.

Real-time multi-modal retrieval across text and image modalities on a large fashion catalog.

Python
CLIP
FAISS
Multimodal Embeddings
Streamlit
Case study
Nepali Cash Detection and Recognition

Currency detection and classification for Nepali banknotes using InceptionV3 transfer learning, achieving 94% accuracy across 7+ denominations.

94% accuracy on 7+ Nepali banknote classes. Updated in 2023 with additional denominations.

Python
TensorFlow
InceptionV3
Transfer Learning
CNN
Case study

Skills & tools

Technical toolkit

GenAI, LLMs & Agentic AI

MCP
Agentic Systems
Multi-Agent
Tool Calling
LLM Fine-Tuning
LoRA
PEFT
RAG
LLM Evaluation
LangChain
Hugging Face
Gemini
AWS Bedrock

ML & Deep Learning

PyTorch
TensorFlow
Scikit-Learn
Transformers
CNN
LSTM
Attention
Recommender Systems
Time-Series
Model Optimization

MLOps, Cloud & Serving

GCP
Vertex AI
Cloud Run
BigQuery
AWS
SageMaker
EC2
MLflow
Docker
FastAPI
CI/CD
GitHub Actions
Drift Monitoring

Vector Search & Retrieval

Vertex AI Feature Store
AWS OpenSearch
Qdrant
Pinecone
FAISS
Semantic Search

Data Science & Engineering

Python
SQL
PostgreSQL
Pandas
NumPy
dbt
Apache Iceberg
Trino
EDA
A/B Testing
Statistical Analysis

Certifications

Credentials & training

  • Introduction to AI and Machine Learning on Google CloudGoogle Cloud ·
  • Machine Learning Engineering for Production (MLOps)Coursera ·
  • TensorFlow Developer CertificationGoogle ·
  • Sequences, Time Series and Predictiondeeplearning.ai ·
  • NLP in TensorFlowdeeplearning.ai ·
  • AI for Medical Prognosisdeeplearning.ai ·
  • Data Science in Stratified Healthcare and Precision MedicineUniversity of Edinburgh ·

Service & awards

Mentorship, judging & awards

FeaturedMentorship & Service

TreeHacks 2026 (Stanford University) — Mentored teams on agent development, MCP server setup, MVP scoping, and deployment best practices.

Mentorship & Service

Georgia State Undergraduate Research Conference — Judge: evaluated research presentations and provided constructive feedback.

Mentorship & Service

NFTE StartUp Tech Showcase (Tucker Middle School, DeKalb County) — Judge: evaluated student startup pitches on creativity, business viability, and presentation.

Awards

INMA Global Media Award 2023 — TIME email marketing engine (43% unique open rate).

Speaking & Teaching

Talks, mentorship & courses

Speaking engagements

SpeakerSept 2024

Career in Data Science

Speaker · Nepal

Spoke on career pathways in data science — covering skills, industry trends, and practical advice for breaking into the field.

SpeakerSept 2024

Application and Impact of AI in Nepal

Speaker · Nepal

Discussed the current landscape of AI adoption in Nepal, real-world applications, and the opportunities and challenges ahead.

SpeakerAug 2024

Building Blocks of GenAI — RAG and VectorDB using AWS Bedrock

Speaker · Nepal

Technical talk on generative AI fundamentals — retrieval-augmented generation, vector databases, and practical implementation using AWS Bedrock.

SpeakerJun 2024

Building an AI Ecosystem in Nepal

Himalayan Computational Linguistic AI Olympiad — Kathmandu University · Everest Engineering College, Kathmandu

Discussed AI education as a foundation for Nepal's technological growth, and how collaboration across government, universities, and industry can build a sustainable AI ecosystem.

Guest LecturerJun 2024

Data Science Approach to Customer Segmentation

Guest Lecture — Presidential Graduate School · Kathmandu

Guest lecture on applying data science and clustering techniques to customer segmentation for marketing and product strategy.

ParticipantMar 2024

AI Conference for Prosperous Nepal

Ministry of Education, Science and Technology · Kathmandu

Participated as a guest in the national AI conference organized by Nepal's Ministry of Education, Science and Technology.

MentorFeb 2024

Fighting Misinformation and Disinformation using AI

KTM Journathon — NIMJN · Park Valley Resort, Kathmandu

Mentored multidisciplinary teams of journalists and technologists over 2 days to build AI-powered solutions against misinformation.

Panel SpeakerJan 2024

Friend or Foe: Decoding AI's Dual Nature

ICT Meetup V7.0 — Prime College · Kathmandu

Panel discussion examining AI's multifaceted character — exploring beneficial applications alongside potential risks, biases, and ethical concerns.

Panel SpeakerDec 2023

AI Adoption in Nepalese Industries

Panel Discussion — Kathmandu University · Kathmandu University

Panel discussion on the state of AI adoption across industries in Nepal, barriers to adoption, and strategies for accelerating progress.

Guest SpeakerDec 2023

CRM Using Statistical Modeling & Machine Learning

Guest Lecture — Ace Institute of Management · Kathmandu

Presented how CRM systems apply statistical modeling and ML to strengthen customer relationships, reduce churn, and optimize marketing effectiveness.

Panel SpeakerSept 2023

Revealing the Boundless Horizons of AI

Panel Discussion — Kathmandu University · Kathmandu University

Panelist exploring the broad horizons of AI — from current capabilities to frontier research and long-term societal implications.

Teaching & training

  1. Resource Person & Trainer

    Government2023 – 2024

    Government of Nepal — Ministry of Forest and Environment (FRTC) · Kathmandu, Nepal

    Trained Nepalese government officers and secretaries across four courses: Data Analysis using Python (Aug 2023), Statistical Analysis (Sept–Oct 2023), Advanced Data Analysis (Aug 2024), and Advanced Machine Learning (Oct 2024).

  2. AI/ML Bootcamp Instructor

    VolunteerJan 2024 (1 week)

    Institute of Engineering, Purwanchal Campus — Tribhuvan University · Dharan, Nepal

    Conducted an intensive 1-week AI/ML bootcamp for engineering students covering data science, statistics, and ML with emphasis on hands-on learning and practical applications.

  3. Senior Python, Data Science & AI/ML Trainer

    Professional2022 – 2024

    Broadway Infosys · Tinkune, Kathmandu

    Led professional training for 500+ students across Python, data science, and AI/ML — covering SQL, EDA, feature engineering, model evaluation, and MLOps basics.

  4. Volunteer Instructor — Python for Research

    VolunteerSept 2022 – Sept 2023

    Nepal Research and Collaboration Center (NRCC) · Kathmandu, Nepal

    Volunteered and taught a one-month free "Python for Research" course and a week-long "Research Training Program" — helping students and researchers of different backgrounds use ML effectively in their projects.

Writing

From the blog

Jul 6, 2024 · 3 min read

Mean Can Lie — Discover the Real Insights with Mean and Standard Deviation

The mean alone can be misleading. Learn how standard deviation, normal distributions, the empirical rule, and z-scores reveal the true story hidden in your data.

Read →

Mar 11, 2023 · 3 min read

NER-Powered Semantic Search Engine

Building a semantic search system enhanced with Named Entity Recognition using Pinecone, BERT-NER, and sentence transformers on 50,000 Medium articles.

Read →

Mar 10, 2022 · 3 min read

XGBoost Hyperparameter Tuning — XGBRegressor with Scikit-Learn Pipelines

End-to-end XGBoost regression on the NASA airfoil noise dataset using scikit-learn Pipelines, ColumnTransformer, and RandomizedSearchCV for hyperparameter tuning.

Read →

Resume

Curriculum vitae

Full PDF embedded below. Use the button to download a copy.

Download PDF
Your browser can't display the PDF inline. Download it here.

Contact

Get in touch

The fastest way to reach me is email. I'm open to ML / AI Engineer roles, PhD opportunities, research collaborations, and technical conversations.