Training-Data Attribution

Understanding model behavior by finding training examples that influence a particular prediction.

  1. Simfluence: Modeling the Influence of Individual Training Examples by Simulating Training Runs
    Simfluence: Modeling the Influence of Individual Training Examples by Simulating Training Runs
    Kelvin Guu, Albert Webson, Ellie Pavlick, Lucas Dixon, Ian Tenney, and Tolga Bolukbasi
    arXiv preprint, 2023
  2. Tracing Knowledge in Language Models Back to the Training Data
    Tracing Knowledge in Language Models Back to the Training Data
    Ekin Akyürek, Tolga Bolukbasi, Frederick Liu, Binbin Xiong, Ian Tenney, Jacob Andreas, and Kelvin Guu
    In Findings of the Association for Computational Linguistics: EMNLP, 2022