Posts

Comments

Comment by Charles Martin (charles-martin) on Basic Facts about Language Model Internals · 2023-01-05T03:20:40.375Z · LW · GW

The PowerLaw behavior has been noted for some time.  See https://weightwatcher.ai  and the publications in JMLR and Nature Communications.