Skip to main content

Research Engineer @ Adobe · LLM Agent Evals & Benchmarking · AI Security

SDR-Bench · CSAW ESC '22 winner · WACV & ECCV · Rust systems · IIT Roorkee

// What I work on:
const research = {
  primary:  ["LLM evals", "agent benchmarking", "process rewards"],
  security: ["adversarial ML", "CTF", "systems security"],
  systems:  ["Rust", "OS internals", "infra at scale"],
  thesis:   "measure what agents can't do yet"
};

~/publications

Research in AI/ML and computer vision with collaborators from top institutions

3 publications • Collaborations with Adobe Research, Stanford, Microsoft Research, CMU, and premier IITs

Under Review2025

SDR-Bench: Benchmarking the Personalization Capabilities of Large Language Models

Ashutosh Srivastava
Adobe
Under Review
WACV 20252025

ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models

Ashutosh Srivastava, Tarun Ram Menta, Abhinav Java, Avadhoot Jadhav, Silky Singh, Surgan Jandial, Balaji Krishnamurthy
IIT RoorkeeAdobe ResearchMicrosoft ResearchIIT BombayStanford UniversityCarnegie Mellon University
Winter Conference on Applications of Computer Vision (WACV) & ECCV 2024 Workshop (AI4VA)
ECCV 2024W2024

Towards Efficient Exemplar Based Image Editing with Multimodal VLMs

Avadhoot Jadhav, Ashutosh Srivastava, Abhinav Java, Silky Singh, Tarun Ram Menta, Surgan Jandial, Balaji Krishnamurthy
IIT BombayIIT RoorkeeMicrosoft ResearchStanford UniversityAdobe ResearchCarnegie Mellon University
European Conference on Computer Vision Workshop (ECCV 2024 - AI4VA)

~/experience

Applied Research Engineer

Adobe
Jul 2025 - Present

Working on the Sales Qualifier Agent (AJO B2B AO) — an AI-driven application that automates B2B prospect qualification and outreach. Leading personalization research including SDR-Bench, the first benchmark for measuring personalization capabilities of Deep Research agents for B2B sales.

Research Intern

Trinity College Dublin
Dec 2024 - Mar 2025

Conducted research in machine learning and AI applications with the primary focus on interdisciplinary application of AI in Brain Tumor Detection

Infrastructure Engineer

Abacus.AI
Oct 2024 - Feb 2025

Developed and maintained scalable infrastructure for AI/ML workloads, optimizing cloud systems and deployment pipelines.

Research Intern

Adobe
May 2024 - Jul 2024

Conducted research in Diffusion based image editing. Developed a novel end-to-end framework for exemplar-based image editing - ReEdit.

Team Captain

InfoSecIITR
Jun 2022 - May 2025

Led cybersecurity initiatives and CTF competitions (Rank #40 globally, #4 in India on CTFtime). Won CSAW ESC 2022 in Research Track for adversarial attacks against ML models. Mentored team members and developed security tools and frameworks. Visit: infoseciitr.in

Developer

SDSLabs
Apr 2022 - May 2025

Contributed to open-source projects at SDSLabs, IIT Roorkee. Developed VectorDB, Katana, RusticOS, and participated in multiple hackathons. Visit: sdslabs.co

~/expertise

Multi-domain technical expertise across cutting-edge technologies

AI Research & Evaluation

Building and evaluating agentic AI systems. Research on LLM benchmarking, process rewards, and long-horizon agent reasoning. Published at WACV and ECCV.

PyTorchTensorFlowResearch

AI & Systems Security

Adversarial ML research, CTF competitions, and security engineering. CSAW ESC winner. Led InfoSecIITR to rank #40 globally on CTFtime.

AppSecCTF

Infrastructure & DevOps

Architecting scalable ML infrastructure on Kubernetes. Cloud-native systems, CI/CD pipelines, and infrastructure-as-code for AI/ML workloads.

KubernetesAWSTerraform

Blockchain (side projects)

Smart contract development, zkVM systems (RISC-0), and AVS (Actively Validated Services). Experience with EigenLayer, zero-knowledge proofs, and decentralized consensus protocols.

SolidityzkVMAVS

~/projects

Open-source contributions and personal projects across multiple domains

SDR-Bench

SDR-Bench

AI/MLFeatured

The first framework to systematically benchmark generative personalization capabilities of LLMs for B2B sales. Features a dual-layered dataset spanning 6,279 articles across 20+ industries.

PythonLLMsNLP
View Project
RusticOS

RusticOS

OS Development

Modular operating system kernel written completely in Rust. Features custom memory management, process scheduling, and x86-64 architecture support.

RustOSSystems
View on GitHub
VectorDB

VectorDB

AI/ML

VectorDB is a high-performance vector database for storing and querying embeddings. Built for ML applications with efficient similarity search and HNSW indexing.

RustML Infra
View on GitHub
Katana CTF Platform

Katana CTF Platform

Security

Production-ready attack and defense CTF platform with automated infrastructure setup.

GoKubernetesDocker
View on GitHub
ValidAI

ValidAI

BlockchainFeatured
EigenLayer Infinite Prize

Decentralized AI validation system leveraging Actively Validated Services (AVS). Implements custom consensus for ML model verification on-chain. Winner of EigenLayer Infinite Prize.

SolidityEigenLayerAVS
View on GitHub
Proof of Optima

Proof of Optima

Blockchain

Zero-knowledge proof system for verifiable computation using zkVM and RISC-0. Demonstrates advanced cryptographic protocols for smart contracts.

RISC-0zkVMSolidity
View on GitHub

~/skills

Technical expertise across multiple domains

AI & Machine Learning

PyTorchDiffusion ModelsTransformersLLMsComputer VisionNLPMLOpsAgentic AI

Security & CTF

Web AppSecPenetration TestingCTFs

Infrastructure & DevOps

KubernetesDockerAWSGCPTerraformCI/CDMonitoringLinux

Blockchain & Web3

Smart ContractsSolidityzkVMAVS

Languages & Tools

PythonRustGoC/C++SQLGitLinux