Posts

Comments

Comment by Lee.aao (leonid-artamonov) on On Dwarkesh’s Podcast with OpenAI’s John Schulman · 2024-05-28T08:27:51.899Z · LW · GW
  •  
    1. Note: It was a 100-point Elo improvement based on the ‘gpt2’ tests prior to release, but GPT-4o itself while still on top saw only a more modest increase.
  •  

Didn't he meant the early GPT-4 vs GPT-4 turbo?



As I get it, it's the same pre-trained model, but with more post-training work.
GPT-4o is probably a newly trained model, so you can't compare it like that.

Comment by Lee.aao (leonid-artamonov) on AI #58: Stargate AGI · 2024-04-09T15:17:13.041Z · LW · GW

and these aren’t normies, they work on tech, high paying 6 figure salaries, very up to date with current events.

If you are a true normie not working in tech, it makes sense to be unaware of such details. You are missing out, but I get why.

If you are in tech, and you don’t even know GPT-4 versus GPT-3.5? Oh no.


Is it just me, or do you also feel intellectually lonely lately? 

I think my relatives and most of my friends think I'm crazy for thinking and talking so much about AI. And they listen to me more out of respect and politeness than out of any real interest in the topic.

Comment by Lee.aao (leonid-artamonov) on AI Timelines · 2024-03-22T21:36:22.079Z · LW · GW

Ege, do you think you'd update if you saw a demonstration of sophisticated sample-efficient in-context learning and far-off-distribution transfer?
 

Yes.

Suppose it could get decent at the first-person-shooter after like a subjective hour of messing around with it. If you saw that demo in 2025, how would that update your timelines?

I would probably update substantially towards agreeing with you.


DeepMind released an early-stage research model SIMA: https://deepmind.google/discover/blog/sima-generalist-ai-agent-for-3d-virtual-environments/

It was tested on 600 basic (10-sec max) videogame skills and had only video from the screen + text with the task. The main takeaway is that an agent trained on many games performs in a new unseen game almost as well as another agent, trained specifically on this game.



Seems like by 2025 its really possible to see more complex generalization (harder tasks and games, more sample efficiency) as in your crux for in-context learning.

Comment by Lee.aao (leonid-artamonov) on Report on Frontier Model Training · 2023-12-10T14:28:20.267Z · LW · GW

Since OpenAI are renting MSFT compute for both training and inference.. 
Seems reasonable to think that inference >> training.  Am I right? 

Comment by Lee.aao (leonid-artamonov) on Report on Frontier Model Training · 2023-12-09T15:01:49.883Z · LW · GW

Is there a cheap of free way to read Semianalysis posts? 
Cant afford the $500 subscription sadly