Posts
Comments
Comment by
Sophie Y (sophie-y) on
How does GPT-3 spend its 175B parameters?
·
2023-05-30T03:08:40.297Z ·
LW ·
GW
The architecture shown for "Not in GPT" seems to be wrong? GPT is decoder only. The part labeled as "Not in GPT" is decoder part.