hashkat.Ashutosh.
Research Engineer @ Adobe · LLM Agent Evals & Benchmarking · AI Security
SDR-Bench · CSAW ESC '22 winner · WACV & ECCV · Rust systems · IIT Roorkee
// What I work on:
const research = {
primary: ["LLM evals", "agent benchmarking", "process rewards"],
security: ["adversarial ML", "CTF", "systems security"],
systems: ["Rust", "OS internals", "infra at scale"],
thesis: "measure what agents can't do yet"
};~/publications
Research in AI/ML and computer vision with collaborators from top institutions
3 publications • Collaborations with Adobe Research, Stanford, Microsoft Research, CMU, and premier IITs
SDR-Bench: Benchmarking the Personalization Capabilities of Large Language Models
ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models
Towards Efficient Exemplar Based Image Editing with Multimodal VLMs
~/achievements
Recognition for contributions to AI research, blockchain innovation, and cybersecurity competitions
CSAW ESC 2022 — 1st Place, Research Track
World's oldest hardware security competition · Adversarial attacks on ML models
Won the Embedded Security Challenge research track for work on adversarial attacks against machine learning models — before AI security became a mainstream research area. CSAW ESC is run by NYU and is the world's oldest hardware security competition.
EigenLayer Infinite Prize
ValidAI - AVS Development
Won the EigenLayer Infinite Prize for ValidAI, an Actively Validated Service (AVS) leveraging EigenLayer's restaking infrastructure for decentralized AI validation.
CTF Global Rankings
InfoSecIITR Team
Led InfoSecIITR to rank #40 globally and #4 in India on CTFtime, competing in international cybersecurity capture-the-flag competitions.
~/experience
Applied Research Engineer
Working on the Sales Qualifier Agent (AJO B2B AO) — an AI-driven application that automates B2B prospect qualification and outreach. Leading personalization research including SDR-Bench, the first benchmark for measuring personalization capabilities of Deep Research agents for B2B sales.
Research Intern
Conducted research in machine learning and AI applications with the primary focus on interdisciplinary application of AI in Brain Tumor Detection
Infrastructure Engineer
Developed and maintained scalable infrastructure for AI/ML workloads, optimizing cloud systems and deployment pipelines.
Research Intern
Conducted research in Diffusion based image editing. Developed a novel end-to-end framework for exemplar-based image editing - ReEdit.
Team Captain
Led cybersecurity initiatives and CTF competitions (Rank #40 globally, #4 in India on CTFtime). Won CSAW ESC 2022 in Research Track for adversarial attacks against ML models. Mentored team members and developed security tools and frameworks. Visit: infoseciitr.in
Developer
Contributed to open-source projects at SDSLabs, IIT Roorkee. Developed VectorDB, Katana, RusticOS, and participated in multiple hackathons. Visit: sdslabs.co
~/expertise
Multi-domain technical expertise across cutting-edge technologies
AI Research & Evaluation
Building and evaluating agentic AI systems. Research on LLM benchmarking, process rewards, and long-horizon agent reasoning. Published at WACV and ECCV.
AI & Systems Security
Adversarial ML research, CTF competitions, and security engineering. CSAW ESC winner. Led InfoSecIITR to rank #40 globally on CTFtime.
Infrastructure & DevOps
Architecting scalable ML infrastructure on Kubernetes. Cloud-native systems, CI/CD pipelines, and infrastructure-as-code for AI/ML workloads.
Blockchain (side projects)
Smart contract development, zkVM systems (RISC-0), and AVS (Actively Validated Services). Experience with EigenLayer, zero-knowledge proofs, and decentralized consensus protocols.
~/projects
Open-source contributions and personal projects across multiple domains

SDR-Bench
AI/MLFeaturedThe first framework to systematically benchmark generative personalization capabilities of LLMs for B2B sales. Features a dual-layered dataset spanning 6,279 articles across 20+ industries.

RusticOS
OS DevelopmentModular operating system kernel written completely in Rust. Features custom memory management, process scheduling, and x86-64 architecture support.

VectorDB
AI/MLVectorDB is a high-performance vector database for storing and querying embeddings. Built for ML applications with efficient similarity search and HNSW indexing.

Katana CTF Platform
SecurityProduction-ready attack and defense CTF platform with automated infrastructure setup.

ValidAI
BlockchainFeaturedDecentralized AI validation system leveraging Actively Validated Services (AVS). Implements custom consensus for ML model verification on-chain. Winner of EigenLayer Infinite Prize.

Proof of Optima
BlockchainZero-knowledge proof system for verifiable computation using zkVM and RISC-0. Demonstrates advanced cryptographic protocols for smart contracts.
~/skills
Technical expertise across multiple domains
AI & Machine Learning
Security & CTF
Infrastructure & DevOps
Blockchain & Web3
Languages & Tools
~/contact
Open to research collaborations, speaking opportunities, and exciting projects