Evaluating perturbation robustness of generative systems that use COBOL code inputsSamuel AckermanWesam Ibraheemet al.2026ICSE 2026Workshop paper
PACIFIC: a framework for generating benchmarks to check Precise Automatically Checked Instruction Following In CodeItay DreyfussAntonio Abu Nassaret al.2026ICSE 2026Workshop paper
Exploring Straightforward Methods for Automatic Conversational Red-TeamingGeorge KourNaama Zwerdlinget al.2025NAACL 2025Conference paper
Unveiling Safety Vulnerabilities of Large Language ModelsGeorge KourMarcel Zalmanoviciet al.2023EMNLP 2023Workshop paper
Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights GenerationGeorge KourMarcel Zalmanoviciet al.2022AAAI 2022Workshop paper
Density-based interpretable hypercube region partitioning for mixed numeric and categorical dataSamuel AckermanEitan Farchiet al.2021JSM 2021Conference paper
Machine Learning Model Drift Detection Via Weak Data SlicesSamuel AckermanParijat Dubeet al.2021ICSE 2021Workshop paper
FreaAI: Automated extraction of data slices to test machine learning modelsSamuel AckermanOrna Razet al.2020AAAI 2020Workshop paper
Detection of data 'drift' over time affecting ML model performanceSamuel AckermanOrna Razet al.2019ISA Annual Meeting 2019Invited talk
Managing the risk in AI: Spotting the “unknown unknowns”ResearchOrna Raz, Sam Ackerman, and Marcel Zalmanovici06 Jun 20215 minute readAIAI Testing