Inkit Padhi

Bio

Please visit my personal page for the most recent updates: inkit.nyc

Publications

Building a Foundational Guardrail for General Agentic Systems via Synthetic Data
- - Yue Huang
  - Hang Hua
  - et al.
- 2026
- ICLR 2026
Conference paper
Final-Model-Only Data Attribution with a Unifying View of Gradient-Based Methods
- - Dennis Wei
  - Inkit Padhi
  - et al.
- 2025
- NeurIPS 2025
Conference paper
When in Doubt, Cascade: Towards Building Efficient and Capable Guardrails
- - Manish Nagireddy
  - Inkit Padhi
  - et al.
- 2025
- AIES 2025
Conference paper
Granite Guardian: Comprehensive LLM Safeguarding
- - Inkit Padhi
  - Manish Nagireddy
  - et al.
- 2025
- NAACL 2025
Conference paper
Programming Refusal with Conditional Activation Steering
- - Bruce Lee
  - Inkit Padhi
  - et al.
- 2025
- ICLR 2025
Conference paper
Contextual Value Alignment
- - Kush Varshney
  - Miao Liu
  - et al.
- 2025
- ICASSP 2025
Conference paper
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
- - Yufang Hou
  - Alessandra Pascale
  - et al.
- 2024
- NeurIPS 2024
Conference paper
Final-Model-Only Data Attribution with a Unifying View of Gradient-Based Methods
- - Dennis Wei
  - Inkit Padhi
  - et al.
- 2024
- NeurIPS 2024
Workshop paper
Value Alignment from Unstructured Text
- - Inkit Padhi
  - Karthikeyan Natesan Ramamurthy
  - et al.
- 2024
- NeurIPS 2024
Workshop paper
Value Alignment from Unstructured Text
- - Inkit Padhi
  - Karthikeyan Natesan Ramamurthy
  - et al.
- 2024
- EMNLP 2024
Conference paper

Visit Google Scholar

Blog posts

Lightweight tools for ‘steering’ LLMs down the right path
Research
Kim Martineau
15 Oct 2025
IBM’s safety checkers top a new AI benchmark
News
Kim Martineau
09 Apr 2025
An AI foundation model that learns the grammar of molecules
News
Payel Das, Youssef Mroueh, Inkit Padhi, Vijil Chenthamarakshan, Jerret Ross, and Brian Belgodere
25 Jan 2023
IBM researchers check AI bias with counterfactual text
Research
Nishtha Madaan, Inkit Padhi, Naveen Panwar, and Diptikalyan Saha
05 Feb 2021
5 minute read
- AI Testing
- Fairness, Accountability, Transparency