Posts

Comments

Comment by Sophie Y (sophie-y) on How does GPT-3 spend its 175B parameters? · 2023-05-30T03:08:40.297Z · LW · GW

The architecture shown for "Not in GPT" seems to be wrong? GPT is decoder only. The part labeled as "Not in GPT" is decoder part.