Publications

954 results for Trustworthy AI

Neural Reasoning Networks: Efficient interpretable neural networks with automatic textual explanations
- - Steve Carrow
  - Kyle Harper Erwin
  - et al.
- 2025
- AAAI 2025
Poster
Agent Trajectory Explorer: Visualizing and Providing Feedback on Agent Trajectories
- - Michael Desmond
  - Ja Young Lee
  - et al.
- 2025
- AAAI 2025
Demo paper
Usage Governance Advisor: from Intent to AI Governance
- - Elizabeth Daly
  - Sean Rooney
  - et al.
- 2025
- AAAI 2025
Demo paper
Adaptive PII Mitigation Framework for Large Language Models
- - Shubhi Asthana
  - Ruchi Mahindru
  - et al.
- 2025
- AAAI 2025
Workshop paper
Leveraging Interpretability in the Transformer to Automate the Proactive Scaling of Cloud Resources
- - Amadou Ba
  - Pavithra Harsha
  - et al.
- 2025
- AAAI 2025
Workshop paper
Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring
- - Buse Korkmaz
  - Rahul Nair
  - et al.
- 2025
- AAAI 2025
Workshop paper
Deploying Privacy Guardrails for LLMs: A Comparative Analysis of Real-World Applications
- - Shubhi Asthana
  - Bing Zhang
  - et al.
- 2025
- AAAI 2025
Workshop paper
An Open Ecosystem to Support AI Value Alignment & Human Feedback
- - Peter Santhanam
- 2025
- AAAI 2025
Workshop paper
Usage Governance Advisor: From Intent to AI Governance
- - Elizabeth Daly
  - Sean Rooney
  - et al.
- 2025
- AAAI 2025
Workshop paper
Scopes of Alignment
- - Kush Varshney
  - Zahra Ashktorab
  - et al.
- 2025
- AAAI 2025
Workshop paper