Jatin Avinash Salve jatinS-dev

Building production-grade ML systems that are fast, reliable, and safe to deploy.

M.S. Computer Science, Machine Learning · University of Florida
Research · LLMs · Retrieval · Recommendation Systems · Agent Governance

What I work on

I build ML systems where model quality, latency, reliability, and safety all matter at the same time.

LLM systems: RAG, agent workflows, hallucination evaluation, structured outputs, tool-call governance
ML infrastructure: model serving, batching, retrieval pipelines, GPU-aware inference, production monitoring
Applied ML: recommendation systems, personalization, dense retrieval, A/B testing, privacy-preserving training

Highlights

ACL 2024

Published long paper on multimodal clinical document summarization.

1M+ requests/day

Built production ML systems for recommendation and personalization at ICICI Bank.

Sentient

Building an agent security layer for policy enforcement, approval flows, and auditability.

Featured projects

🛡️ Sentient

AI Agent Security Platform

A governance layer for AI agents that evaluates every tool call before execution. Safe actions pass, risky actions require approval, and dangerous actions are blocked.

Core pieces

Policy engine for allow, block, and human approval decisions
Tamper-evident JSONL audit logs
Agent registry with roles, owners, and tool permissions
Framework adapters for LangGraph, OpenAI Agents SDK, CrewAI, and AutoGen

View project →

⚡ GPU-Optimized Retrieval System

Distributed ML Inference

A personalized retrieval and re-ranking system over a 1M+ document index with dense retrieval, batching, memory pooling, FAISS search, and online evaluation.

Impact

15–20% reduction in p95 latency
8–12% improvement in Recall@100
Built for high-throughput recommendation workloads

🔬 Hallucination Self-Refinement in VLMs

Vision-Language Model Evaluation

Benchmarked whether iterative self-critique and revision can reduce hallucinations in vision-language models.

Focus

Multi-round refinement loops
Faithfulness and factuality evaluation
CHAIR and POPE-style hallucination metrics

View repo →

🔒 Anisotropic DP-SGD

Private ML Training Framework

A differentially private training method that shapes noise using gradient covariance structure instead of adding isotropic Gaussian noise everywhere.

Built with PyTorch · NumPy · CUDA · Per-sample gradients · Backward hooks

View repo →

Publications

Work	Venue	Area
From Sights to Insights: Summarization of Multimodal Clinical Documents	ACL 2024 Main Conference	Multimodal clinical AI
Structure-Grounded Medical QA: RDF Retrieval and Claim-Level Verification	ACL 2026 Workshop, submitted	Faithful medical QA
Anisotropic Noise Injection for Improving Utility in DP-SGD	Under review	Differential privacy

Experience

Role	Scope
Research Assistant · University of Florida	LLM agents, retrieval, personalization, hallucination evaluation
ML Engineer · ICICI Bank	Production ML serving, recommendation systems, A/B testing, 1M+ daily requests
Research Assistant · IIT Patna AI-ML-NLP Lab	Dense retrieval, GPU inference, 500K+ document pipelines
MITACS Globalink Fellow · Polytechnique Montréal	Sequence modeling, parallel computing, high-performance C++
Google Summer of Code · Joomla CMS	Backend engineering, production open source, contributor mentorship

Tech I use

ML / LLMs

PyTorch
JAX
Hugging Face
LoRA / PEFT
SFT / DPO
Knowledge Distillation

Retrieval / Recsys

FAISS
Dense Retrieval
Re-ranking
Two-Tower Models
A/B Testing
CTR Optimization

Agents / Safety

LangGraph
OpenAI Agents SDK
CrewAI
AutoGen
Guardrails
Policy Engines

Infrastructure

FastAPI
Ray Serve
Docker
Kubernetes
AWS ECS / S3 / EC2
CI/CD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jatin Avinash Salve jatinS-dev

Achievements

Achievements

Highlights

Block or report jatinS-dev

Building production-grade ML systems that are fast, reliable, and safe to deploy.

What I work on

Highlights

ACL 2024

1M+ requests/day

Sentient

Featured projects

🛡️ Sentient

⚡ GPU-Optimized Retrieval System

🔬 Hallucination Self-Refinement in VLMs

🔒 Anisotropic DP-SGD

Publications

Experience

Tech I use

Pinned Loading

Uh oh!