veered's Shortform
post by veered (lucas-hansen) · 2023-04-07T17:39:41.941Z · LW · GW · 2 commentsContents
2 comments
2 comments
Comments sorted by top scores.
comment by veered (lucas-hansen) · 2023-04-07T17:39:42.467Z · LW(p) · GW(p)
For GPT-style LLMs, is it possible to prove statements like the following?
Choose some tokens , and a fixed :
There does not exist a prefix of tokens such that
More generally, is it possible to prove interesting universal statements? Sure, you can brute force it for LLMs with a finite context window but that's both infeasible and boring. And you can specifically construct contrived LLMs where this is possible but that's also boring.
I suspect that it's not possible/practical in general because the LLM can do arbitrary computation to predict the next token, but maybe I'm wrong.
Replies from: JBlack↑ comment by JBlack · 2023-04-08T11:37:03.987Z · LW(p) · GW(p)
Yes, in general statements like this are theoretically possible to prove, but not remotely practical. There might be some specific (A,B,LLM) triples for which you can prove such a statement but I expect that none of these are generalizable to actually useful statements.
No GPT-style architecture is (in itself) capable of truly universal computation, but in practice functions they can implement are far beyond our ability to adequately analyze.