Hannah Cyberey

prof_pic.jpg

I’m a Computer Science PhD candidate at the University of Virginia, advised by Prof. David Evans and Prof. Yangfeng Ji. I’m broadly interested in topics on AI safety and ethics. My primary research focuses on trustworthy natural language processing (NLP), addressing issues related to robustness and fairness of language models. Currently, my work explores representation engineering methods for mitigating bias and countering censorship.

Email: hannahcyberey at virginia dot edu

news

Jul 08, 2025 Our paper “Steering the CensorShip: Uncovering Representation Vectors for LLM “Thought” Control” is accepted to COLM 2025!
Apr 24, 2025 Our paper “Do Prevalent Bias Metrics Capture Allocational Harms from LLMs?” is accepted to the Workshop on Insights from Negative Results in NLP
May 09, 2024 I passed my PhD dissertation proposal defense!
Apr 03, 2024 Our paper “Addressing Both Statistical and Causal Gender Fairness in NLP Models” is accepted to NAACL 2024 Findings.
Jun 07, 2023 Co-leading Causal Learning Reading Group this summer at UVA, along with Anshuman Suri

latest posts