

Comment by dang on Gender Vectors in ROME’s Latent Space · 2023-05-22T15:57:24.176Z · LW · GW

Why are the output probabilities in your results so small in general?

Also, are other output capabilities of the network affected? For example, does the network performance in any other task decrease? Ideally for your method I think this should not be the case, but it would be hard to enforce or verify as far as I can tell.

The fact that the outputs after the gender completely change is weird for me as well, any reason for that?