Skip to content
View jatinS-dev's full-sized avatar
💭
To the Moon 🚀💫
💭
To the Moon 🚀💫

Highlights

  • Pro

Block or report jatinS-dev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jatinS-dev/README.md

Building production-grade ML systems that are fast, reliable, and safe to deploy.

M.S. Computer Science, Machine Learning · University of Florida
Research · LLMs · Retrieval · Recommendation Systems · Agent Governance


What I work on

I build ML systems where model quality, latency, reliability, and safety all matter at the same time.

  • LLM systems: RAG, agent workflows, hallucination evaluation, structured outputs, tool-call governance
  • ML infrastructure: model serving, batching, retrieval pipelines, GPU-aware inference, production monitoring
  • Applied ML: recommendation systems, personalization, dense retrieval, A/B testing, privacy-preserving training

Highlights

ACL 2024

Published long paper on multimodal clinical document summarization.

1M+ requests/day

Built production ML systems for recommendation and personalization at ICICI Bank.

Sentient

Building an agent security layer for policy enforcement, approval flows, and auditability.


Featured projects

🛡️ Sentient

AI Agent Security Platform

A governance layer for AI agents that evaluates every tool call before execution. Safe actions pass, risky actions require approval, and dangerous actions are blocked.

Core pieces

  • Policy engine for allow, block, and human approval decisions
  • Tamper-evident JSONL audit logs
  • Agent registry with roles, owners, and tool permissions
  • Framework adapters for LangGraph, OpenAI Agents SDK, CrewAI, and AutoGen

View project →

⚡ GPU-Optimized Retrieval System

Distributed ML Inference

A personalized retrieval and re-ranking system over a 1M+ document index with dense retrieval, batching, memory pooling, FAISS search, and online evaluation.

Impact

  • 15–20% reduction in p95 latency
  • 8–12% improvement in Recall@100
  • Built for high-throughput recommendation workloads

🔬 Hallucination Self-Refinement in VLMs

Vision-Language Model Evaluation

Benchmarked whether iterative self-critique and revision can reduce hallucinations in vision-language models.

Focus

  • Multi-round refinement loops
  • Faithfulness and factuality evaluation
  • CHAIR and POPE-style hallucination metrics

View repo →

🔒 Anisotropic DP-SGD

Private ML Training Framework

A differentially private training method that shapes noise using gradient covariance structure instead of adding isotropic Gaussian noise everywhere.

Built with PyTorch · NumPy · CUDA · Per-sample gradients · Backward hooks

View repo →


Publications

Work Venue Area
From Sights to Insights: Summarization of Multimodal Clinical Documents ACL 2024 Main Conference Multimodal clinical AI
Structure-Grounded Medical QA: RDF Retrieval and Claim-Level Verification ACL 2026 Workshop, submitted Faithful medical QA
Anisotropic Noise Injection for Improving Utility in DP-SGD Under review Differential privacy

Experience

Role Scope
Research Assistant · University of Florida LLM agents, retrieval, personalization, hallucination evaluation
ML Engineer · ICICI Bank Production ML serving, recommendation systems, A/B testing, 1M+ daily requests
Research Assistant · IIT Patna AI-ML-NLP Lab Dense retrieval, GPU inference, 500K+ document pipelines
MITACS Globalink Fellow · Polytechnique Montréal Sequence modeling, parallel computing, high-performance C++
Google Summer of Code · Joomla CMS Backend engineering, production open source, contributor mentorship

Tech I use

ML / LLMs

PyTorch
JAX
Hugging Face
LoRA / PEFT
SFT / DPO
Knowledge Distillation

Retrieval / Recsys

FAISS
Dense Retrieval
Re-ranking
Two-Tower Models
A/B Testing
CTR Optimization

Agents / Safety

LangGraph
OpenAI Agents SDK
CrewAI
AutoGen
Guardrails
Policy Engines

Infrastructure

FastAPI
Ray Serve
Docker
Kubernetes
AWS ECS / S3 / EC2
CI/CD


GitHub Stats Top Languages

Pinned Loading

  1. joomla-projects/soc21_guided-tour joomla-projects/soc21_guided-tour Public archive

    PHP 6 11

  2. DSA-Resources DSA-Resources Public

    C++ 9

  3. CAI6307-BetterAI/betterai-project CAI6307-BetterAI/betterai-project Public

    Python 1

  4. CIS6930-Distributed-Machine-Learning/anisotropic-dp-sgd CIS6930-Distributed-Machine-Learning/anisotropic-dp-sgd Public

    Anisotropic Noise Injection for Improving Utility in Differentially Private SGD

    Python 1

  5. Sentinent-AI/backend-go Sentinent-AI/backend-go Public

    Go