Shubham Agarwal

Shubham Agarwal Profile Pic

Hi! I am a PhD student in Computer Science at the Sky Lab, UC Berkeley.
I am advised by Prof. Ion Stoica and Prof. Aditya Parameswaran. My research focuses on ML Systems, improving the reliability and efficiency of LLMs and agents.

Previously, I was a Research Associate at Adobe Research, where I worked with Dr. Subrata Mitra and Dr. Shiv Kumar Saini. In this role, I built large-scale systems for generative models, optimizing efficiency and resource use with techniques like approximate caching. I also worked on system reliability, developing ML tools for outage prediction and failure diagnosis.

I graduated with a Bachelor's in Computer Science from BITS Pilani in 2022. During my undergraduate study, I also interned at Adobe Research and American Express AI lab.

To get in touch with me, please email me at shubham3@berkeley.edu

Publications

  1. SIGMOD'25
    Cache-Craft: Managing Chunk-Caches for Efficient Retrieval-Augmented Generation
    Shubham Agarwal*,Sai Sundaresan* , Subrata Mitra , Debabrata Mahapatra , Archit Gupta , Rounak Sharma , Nirmal Joshua Kapu , Tong Yu , Shiv Saini
    In The Proceedings of the 2025 International Conference on Management of Data (SIGMOD) 2025
  2. ECCV’24
    ReCON: Training-Free Acceleration for Text-to-Image Synthesis with Retrieval of Concept Prompt Trajectories
    Chen-Yi Lu*Shubham Agarwal*,Mehrab Tanjim , Kanak Mahadik , Anup Rao , Subrata Mitra , Shiv K Saini , Saurabh Bagchi , Somali Chaterji
    In 18th European Conference on Computer Vision (ECCV) 2024
  3. PAKDD’24
    ScaleViz: Scaling Visualization Recommendation Models on Large Data
    Ghazi Shazan Ahmad, Shubham Agarwal, Subrata Mitra, Ryan Rossi,  Manav Doshi,  Syam Manoj Kumar Paila
    In 28th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD) 2024
  4. NSDI'24
    Approximate Caching for Efficiently Serving Diffusion Models
    Shubham Agarwal, Subrata Mitra, Sarthak Chakraborty, Srikrishna Karanam, Koyel Mukherjee, Shiv Saini
    In 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI) 2024
  5. ESEC/FSE'23
    Outage-Watch: Early Prediction of Outages using Extreme Event Regularizer
    Shubham Agarwal, Sarthak Chakraborty, Shaddy Garg, Sumit Bisht, Chahat Jain, Ashritha Gonuguntla, Shiv Saini
    In Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE) 2023
  6. ASE'23
    ESRO: Experience Assisted Service Reliability against Outages
    Sarthak Chakraborty, Shubham Agarwal,  Shaddy Garg, Abhimanyu Sethia, Udit Narayan Pandey, Videh Aggarwal, Shiv Saini
    In Proceedings of the 38th IEEE/ACM International Conference on Automated Software Engineering (ASE) 2023
  7. WWW'23
    CausIL: Causal Graph for Instance Level Microservice Data
    Sarthak Chakraborty, Shaddy Garg*Shubham Agarwal*,  Ayush Chauhan, Shiv Saini
    In Proceedings of the ACM Web Conference 2023 (WWW) 2023
  8. SIGMOD'23
    Fast Natural Language Based Data Exploration with Samples
    Shubham Agarwal, Gromit Yeuk-Yin Chan, Shaddy Garg, Tong Yu, Subrata Mitra
    In Companion of the 2023 International Conference on Management of Data (SIGMOD) 2023

Patents

  1. USPTO
    Micromanaging Prompts for High-Throughput Text-to-Image Inference.
    Shubham Agarwal, Subrata Mitra, Saud Iqbal
    US Patent App. 18/808,654 Filed
  2. USPTO
    Intelligent Use of Caching and Retrieval of Intermediate Noise for Resource Efficient Diffusion Models.
    Shubham Agarwal, Subrata Mitra, Sarthak Chakraborty, Srikrishna Karanam, Koyel Mukherjee, Shiv Kumar Saini
    US Patent App. 18/637,024 Filed
  3. USPTO
    Reinforcement Learning Based Framework for Scaling Visualization Recommendation Models on Large Data.
    Shubham Agarwal, Subrata Mitra, Ryan Rossi, Ghazi Shazan Ahmad, Manav Doshi, Syam Manoj Kumar Paila
    US Patent App. 18/668,888 Filed
  4. USPTO
    Data Exploration using Natural Language with Data Sampling.
    Subrata Mitra, Shubham Agarwal, Gromit Yeuk-Yin Chan, Shaddy Garg, Tong Yu
    US Patent App. 18/675,930 Filed
  5. USPTO
    A System and Method for Outage Forecasting.
    Shaddy Garg, Shubham Agarwal, Sumit Bisht, Nikhil Sheoran, Chahat Jain, Ashritha Gonuguntla, Shiv Saini
    US Patent App. 17/656,263 Filed

News

Aug 21, 2025 Starting my Ph.D. at UC Berkeley!
Jul 25, 2025 New Paper accepted at SIGMOD 2025 on approximate KV Cache sharing in RAG workflows!
Jul 25, 2024 New Paper accepted at ECCV 2024 on Approximate Caching using Image Concepts!
Mar 25, 2024 Presenting a Poster on "Quality-Aware Prompt Scheduling" at NSDI.
Mar 25, 2024 Presenting our Paper "NIRVANA for Efficiently Serving Diffusion Models" at NSDI.
Feb 10, 2024 Promoted to Research Associate 2 at Adobe Research!
Jan 25, 2024 New Paper accepted at PAKDD 2024 on Scaling Visualization Recommendation Models!
Dec 11, 2023 New Paper accepted at NSDI 2024 on Approximate Caching for Diffusion Models!
Sep 14, 2023 Presented our Paper "ESRO: Experience Assisted Service Reliability against Outages" at ASE.
Jul 07, 2023 New Paper accepted at ASE 2023 on Root Cause Detection using alerts and outage reports!
Feb 26, 2023 New Paper accepted at SIGMOD 2023 Demo on Exporatory Data Analysis tool!
Aug 20, 2022 Graduated with a Bachelor's in Computer Science from BITS Pilani.
Jul 10, 2022 Joined Adobe Research as a Research Associate.