Hello, I'm Arpit Singh Gautam

I am a Data Scientist in the CSG CTO Lab at Dell Technologies, working on efficient LLM inference and the reliability of large models. My research identity is efficiency × reliability for foundation models: making large models cheaper to run and more trustworthy.

My deepest current line is the reliability of quantized models - what post-training quantization does to calibration, factual recall, and security, and how to preserve each cheaply. My work appears at EACL 2026 (FEVER) and AAAI 2026 (ToM), with preprints on RL-based quantization (RAMP) and disaggregated LLM serving (StreamServe).

Research interests: LLM systems & efficient inference - quantization, KV-cache optimization, serving · Trustworthy / honest LLMs - hallucination, calibration, safety · Reinforcement learning & reasoning for foundation models · Interpretability / mechanistic ML

Arpit Singh Gautam
0 Publications
0 Yr Full-Time Exp
0 Talks, Panels & Mentoring
0 Hackathon Wins
0 1-on-1 Students Helped 5/5 on TopMate ★

Recent Updates

Papers · Projects · Talks · Blog posts - all in one place.

LLM Quantization Gallery
Project Launch
Apr 7, 2026
An interactive gallery comparing INT4, INT8, GPTQ, AWQ, QLoRA and GGUF methods - with perplexity scores, memory footprints, and throughput benchmarks side by side. Read the blog post →
RAMP Paper
Paper · arXiv
Mar 18, 2026
Off-policy SAC framework that learns per-layer bit-width assignments to minimize perplexity under a global bit budget. Achieves 5.54 perplexity at 3.68 GB on Llama 2 7B - outperforming uniform 4-bit AWQ. Zero-shot transfer to Llama 2 13B and Mistral 7B.
EACL 2026
Paper Accepted · EACL 2026
Apr 2026
Accepted at the FEVER Workshop @ EACL 2026 - now on ACL Anthology. Proposes a diffusion-based generative stability method for automated fact verification with Kailash Talreja and Saurabh Jha.
ARC-AGI-3
Competition · Ongoing
2026 · Ongoing
Participating in François Chollet's ARC-AGI-3 challenge - one of AI's hardest benchmarks for fluid intelligence. Submitted a random agent baseline and exploring learning-based approaches. Read the blog post →