AI systems can inadvertently leak sensitive information or compromise
user privacy.
This area investigates techniques to safeguard privacy and ensure responsible data handling in
AI applications.
Publications
July 2025
Robust Utility-Preserving Text Anonymization Based on Large
Language Models
Tianyu Yang, Xiaodan Zhu, Iryna Gurevych
ACL 2025
Paper: Link Repository: GitHub
Jan. 2025
Differentially Private Steering for Large Language Model
Alignment
Anmol Goel, Yaxi Hu, Iryna Gurevych, Amartya Sanyal
ICLR 2025
Paper: Link Repository: GitHub
Feb. 2025
Towards Privacy-aware Mental Health AI Models: Advances,
Challenges, and
Opportunities Aishik Mandal, Tanmoy Chakraborty, Iryna Gurevych
Preprint under review
Paper: Link