Posts

Comments

Comment by David Bolin (david-bolin) on No, really, it predicts next tokens. · 2023-04-18T14:46:46.727Z · LW · GW

To be fair, it outputs "no" two thirds of the time not because the OP was wrong, but because it interprets that as "ignore previous instructions."