Training-Data Attribution

Understanding model behavior by finding training examples that influence a particular prediction.

  1. Scalable Influence and Fact Tracing for Large Language Model Pretraining
    Scalable Influence and Fact Tracing for Large Language Model Pretraining
    Tyler A. Chang, Dheeraj Rajagopal, Tolga Bolukbasi, Lucas Dixon, and Ian Tenney
    arXiv preprint, 2024
  2. Simfluence: Modeling the Influence of Individual Training Examples by Simulating Training Runs
    Simfluence: Modeling the Influence of Individual Training Examples by Simulating Training Runs
    Kelvin Guu, Albert Webson, Ellie Pavlick, Lucas Dixon, Ian Tenney, and Tolga Bolukbasi
    arXiv preprint, 2023
  3. Tracing Knowledge in Language Models Back to the Training Data
    Tracing Knowledge in Language Models Back to the Training Data
    Ekin Akyürek, Tolga Bolukbasi, Frederick Liu, Binbin Xiong, Ian Tenney, Jacob Andreas, and Kelvin Guu
    In Findings of the Association for Computational Linguistics: EMNLP, 2022