Shubham Agarwal

Hello! My name is Shubham. I graduated with a Bachelor's in Computer Science Engineering from BITS Pilani in 2022, where I ranked at the top of my undergraduate cohort and received 100% scholarship. I'm presently working at Adobe Research as a Pre-doctoral Researcher, where I primairly work with Dr. Subrata Mitra and Dr. Shiv Kumar Saini.
I am interested in the area of Machine Learning + Systems where my current focus is on optimizing ML systems for large-scale training and deployment. With over two years of academic and industrial research experience, I have first author publications in SIGMOD, NSDI, ECCV and FSE, and co-authorship in WWW, ASE, and PAKDD.
If you have any ideas to discuss or questions about my work, please contact me at skejriwal44@gmail.com
I am a part of the Systems and Insights Group at Adobe Research, where I primarily focus on building large-scale systems to enhance resource utilization and efficiency for generative models. I focus on techniques such as Approximate Computing, Constraint-based scheduling, and runtime adaptation strategies for distributed systems. I am collaborating with Purdue and UMass professors on some of these projects. I have also worked on System Reliability Research where I developed ML-based tools for predicting cloud outages and utilized causal techniques for cloud failure diagnosis, aiming to improve the reliability and availability of distributed cloud systems.
- Currently, my ongoing projects include:
- Approximate caching: Resource-Efficient Text-to-image diffusion Models.
- Cluster scheduling: Optimizing Inferencing Platforms for text-to-image workloads.
- Model-less inferencing: Query-Aware Model Selection for Serving Generative Workloads.
- Predicting Cloud Outages: Develop model for anticipating cloud outages.
- Outage forecasting based on system metrics and alerts
- Latency prediction to detect anomalous data ingestion workflows
Publications
Patents
-
USPTOMicromanaging Prompts for High-Throughput Text-to-Image Inference.US Patent App. 18/808,654 Filed
-
USPTOIntelligent Use of Caching and Retrieval of Intermediate Noise for Resource Efficient Diffusion Models.US Patent App. 18/637,024 Filed
-
USPTOReinforcement Learning Based Framework for Scaling Visualization Recommendation Models on Large Data.US Patent App. 18/668,888 Filed
-
USPTOData Exploration using Natural Language with Data Sampling.US Patent App. 18/675,930 Filed
-
USPTOA System and Method for Outage Forecasting.US Patent App. 17/656,263 Filed
News
Jul 25, 2024 | New Paper accepted at SIGMOD 2025 on approximate KV Cache sharing in RAG workflows! |
---|---|
Jul 25, 2024 | New Paper accepted at ECCV 2024 on Approximate Caching using Image Concepts! |
Mar 25, 2024 | Presenting a Poster on "Quality-Aware Prompt Scheduling" at NSDI. |
Mar 25, 2024 | Presenting our Paper "NIRVANA for Efficiently Serving Diffusion Models" at NSDI. |
Feb 10, 2024 | Promoted to Research Associate 2 at Adobe Research! |
Jan 25, 2024 | New Paper accepted at PAKDD 2024 on Scaling Visualization Recommendation Models! |
Dec 11, 2023 | New Paper accepted at NSDI 2024 on Approximate Caching for Diffusion Models! |
Sep 14, 2023 | Presented our Paper "ESRO: Experience Assisted Service Reliability against Outages" at ASE. |
Jul 07, 2023 | New Paper accepted at ASE 2023 on Root Cause Detection using alerts and outage reports! |
Feb 26, 2023 | New Paper accepted at SIGMOD 2023 Demo on Exporatory Data Analysis tool! |
Aug 20, 2022 | Graduated with a Bachelor's in Computer Science from BITS Pilani. |
Jul 10, 2022 | Joined Adobe Research as a Research Associate. |