publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. Sensing and Steering Stereotypes: Extracting and Applying Gender Representation Vectors in LLMs
    Hannah Cyberey, Yangfeng Ji , and David Evans
    ArXiv Preprint, Feb 2025

2024

  1. The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models
    Hannah Cyberey, Yangfeng Ji , and David Evans
    ArXiv Preprint, Aug 2024
  2. Addressing Both Statistical and Causal Gender Fairness in NLP Models
    Hannah Cyberey, Yangfeng Ji , and David Evans
    In Findings of NAACL 2024 , Jun 2024

2022

  1. Balanced Adversarial Training: Balancing Tradeoffs between Fickleness and Obstinacy in NLP Models
    Hannah Cyberey, Yangfeng Ji , and David Evans
    In EMNLP 2022 , Dec 2022

2020

  1. Finding Friends and Flipping Frenemies: Automatic Paraphrase Dataset Augmentation Using Graph Theory
    Hannah Cyberey, Yangfeng Ji , and David Evans
    In Findings of EMNLP 2020 , Nov 2020
  2. Pointwise Paraphrase Appraisal is Potentially Problematic
    Hannah Cyberey, Yangfeng Ji , and David Evans
    In ACL 2020 Student Research Workshop , Jul 2020