Shubham Agarwal
Hi! I am a PhD student in Computer Science at the Sky Lab, UC Berkeley. I am advised by Prof. Ion Stoica and Prof. Aditya Parameswaran. My research focuses on ML Systems, improving the reliability and efficiency of LLMs and agents.
Previously, I was a Research Associate at Adobe Research, where I worked with Dr. Subrata Mitra and Dr. Shiv Kumar Saini. In this role, I built large-scale systems for generative models, optimizing efficiency and resource use with techniques like approximate caching. I also worked on system reliability, developing ML tools for outage prediction and failure diagnosis.
I graduated with a Bachelor's in Computer Science from BITS Pilani in 2022. During my undergraduate study, I also interned at Adobe Research and American Express AI lab.
To get in touch with me, please email me at shubham3@berkeley.edu
Publications
Patents
-
USPTOMicromanaging Prompts for High-Throughput Text-to-Image Inference.US Patent App. 18/808,654 Filed
-
USPTOIntelligent Use of Caching and Retrieval of Intermediate Noise for Resource Efficient Diffusion Models.US Patent App. 18/637,024 Filed
-
USPTOReinforcement Learning Based Framework for Scaling Visualization Recommendation Models on Large Data.US Patent App. 18/668,888 Filed
-
USPTOData Exploration using Natural Language with Data Sampling.US Patent App. 18/675,930 Filed
-
USPTOA System and Method for Outage Forecasting.US Patent App. 17/656,263 Filed
News
| Aug 21, 2025 | Starting my Ph.D. at UC Berkeley! |
|---|---|
| Jul 25, 2025 | New Paper accepted at SIGMOD 2025 on approximate KV Cache sharing in RAG workflows! |
| Jul 25, 2024 | New Paper accepted at ECCV 2024 on Approximate Caching using Image Concepts! |
| Mar 25, 2024 | Presenting a Poster on "Quality-Aware Prompt Scheduling" at NSDI. |
| Mar 25, 2024 | Presenting our Paper "NIRVANA for Efficiently Serving Diffusion Models" at NSDI. |
| Feb 10, 2024 | Promoted to Research Associate 2 at Adobe Research! |
| Jan 25, 2024 | New Paper accepted at PAKDD 2024 on Scaling Visualization Recommendation Models! |
| Dec 11, 2023 | New Paper accepted at NSDI 2024 on Approximate Caching for Diffusion Models! |
| Sep 14, 2023 | Presented our Paper "ESRO: Experience Assisted Service Reliability against Outages" at ASE. |
| Jul 07, 2023 | New Paper accepted at ASE 2023 on Root Cause Detection using alerts and outage reports! |
| Feb 26, 2023 | New Paper accepted at SIGMOD 2023 Demo on Exporatory Data Analysis tool! |
| Aug 20, 2022 | Graduated with a Bachelor's in Computer Science from BITS Pilani. |
| Jul 10, 2022 | Joined Adobe Research as a Research Associate. |