Previous Work on Recreating Neural Network Input from Intermediate Layer Activations

bglass

Previous Work on Recreating Neural Network Input from Intermediate Layer Activations

post by bglass · 2022-10-12T19:28:39.058Z · LW · GW · No comments

This is a question post.

  Answers
    5 the gears to ascenscion
    3 Garrett Baker
None
No comments

Recently I've been experimenting with recreating a neural network's input layer from intermediate layer activations.

The possibility has implications for interpretability. For example, if certain neurons are activated on certain input, you know those neurons are 'about' that type of input.

My question is: Does anyone know of prior work/research in this area?

I'd appreciate even distantly-related work. I may write a blog post about my experiments if there is an interest and if there isn't already adequate research in this area.

Answers

answer by the gears to ascension (the gears to ascenscion) · 2022-10-13T01:01:59.517Z · LW(p) · GW(p)

search quality: skimmed the abstracts search method: semantic scholar + browsing note that many of these results are kind of old

↑ comment by the gears to ascension (lahwran) · 2022-10-13T01:04:25.807Z · LW(p) · GW(p)

interesting to me but not what you asked for

https://www.semanticscholar.org/paper/Inducing-Causal-Structure-for-Interpretable-Neural-Geiger-Wu/ccd04c27bf1237368b35eb456b3dd1c18ef9a9b9

https://www.semanticscholar.org/paper/Interpreting-Deep-Learning%3A-The-Machine-Learning-Charles/b7488a0ac799a2c62882a5b40f4ea4b1c88f04c4 https://www.semanticscholar.org/paper/Minimizing-Control-for-Credit-Assignment-with-Meulemans-Farinha/0bb32a1b9a8702a38f54b64ca08df8abffc097a8 https://www.semanticscholar.org/paper/The-Union-of-Manifolds-Hypothesis-and-its-for-Deep-Brown-Caterini/3c0a4afc8f430f32442a8efa306f898d9198d7c5

answer by Garrett Baker · 2022-10-12T21:53:01.716Z · LW(p) · GW(p)

Myself and some others did some work looking at the mutual information between intermediate layers of a network, and it's input here [LW · GW].

No comments

Comments sorted by top scores.

Previous Work on Recreating Neural Network Input from Intermediate Layer Activations

Contents

Answers

No comments