veered's Shortform

lucas-hansen

veered's Shortform

post by veered (lucas-hansen) · 2023-04-07T17:39:41.941Z · LW · GW · 2 comments

2 comments

2 comments

Comments sorted by top scores.

comment by veered (lucas-hansen) · 2023-04-07T17:39:42.467Z · LW(p) · GW(p)

For GPT-style LLMs, is it possible to prove statements like the following?

Choose some tokens , $B$ and a fixed $L L M$ :

There does not exist a prefix of tokens $P$ such that $L L M (P + A) \to B$

More generally, is it possible to prove interesting universal statements? Sure, you can brute force it for LLMs with a finite context window but that's both infeasible and boring. And you can specifically construct contrived LLMs where this is possible but that's also boring.

I suspect that it's not possible/practical in general because the LLM can do arbitrary computation to predict the next token, but maybe I'm wrong.

Replies from: JBlack

↑ comment by JBlack · 2023-04-08T11:37:03.987Z · LW(p) · GW(p)

Yes, in general statements like this are theoretically possible to prove, but not remotely practical. There might be some specific (A,B,LLM) triples for which you can prove such a statement but I expect that none of these are generalizable to actually useful statements.

No GPT-style architecture is (in itself) capable of truly universal computation, but in practice functions they can implement are far beyond our ability to adequately analyze.

veered's Shortform

Contents

2 comments