Training Data Attribution: Examining Its Adoption & Use Cases

post by Deric Cheng (deric-cheng), Justin Bullock (justin-bullock), David_Kristoffersson · 2025-01-22T15:41:19.744Z · LW · GW · 0 comments

This is a link post for https://www.convergenceanalysis.org/research/training-data-attribution-tda-examining-its-adoption-use-cases

Contents

No comments

Note: This report was conducted in June 2024 and is based on research originally commissioned by the Future of Life Foundation (FLF). The views and opinions expressed in this document are those of the authors and do not represent the positions of FLF.

This report investigates Training Data Attribution (TDA) and its potential importance to and tractability for reducing extreme risks from AI. TDA techniques aim to identify training data points that are especially influential on the behavior of specific model outputs. They are motivated by the question: how would the model's behavior change if one or more data points were removed from or added to the training dataset? 

Report structure:

Key takeaways from our report:

0 comments

Comments sorted by top scores.