Posts

Comments

Comment by Rasool on Eulogy to the Obits · 2025-04-22T09:13:56.697Z · LW · GW

It looks like this is a linkpost to:

https://press.asimov.com/articles/obit

Comment by Rasool on jacquesthibs's Shortform · 2025-04-20T13:41:09.887Z · LW · GW

Might Leopold Aschenbrenner also be involved? He runs an investment fund with money from Nat Friedman, Daniel Gross, and Patrick Collison, so the investment in Mechanize might have come from that?

https://situationalawarenesslp.com/

https://www.forourposterity.com/

Comment by Rasool on GPT-4.1 Is a Mini Upgrade · 2025-04-17T08:33:24.997Z · LW · GW

Does this match your understanding?

 

AI CompanyPublic/Preview NameHypothesized Base ModelHypothesized EnhancementNotes
OpenAIGPT-4oGPT-4oNone (Baseline)The starting point, multimodal model.
OpenAIo1GPT-4oReasoningFirst reasoning model iteration, built on the GPT-4o base. Analogous to Anthropic's Sonnet 3.7 w/ Reasoning.
OpenAIGPT-4.1GPT-4.1NoneAn incremental upgrade to the base model beyond GPT-4o.
OpenAIo3GPT-4.1ReasoningPrice/cutoff suggest it uses the newer GPT-4.1 base, not GPT-4o + reasoning.
OpenAIGPT-4.5GPT-4.5NoneA major base model upgrade
OpenAIGPT-5GPT-4.5Reasoning"GPT-5" might be named this way, but technologically be GPT-4.5 + Reasoning.
AnthropicSonnet 3.5Sonnet 3.5NoneExisting model.
AnthropicSonnet 3.7 w/ ReasoningSonnet 3.5ReasoningBuilt on the older Sonnet 3.5 base, similar to how o1 was built on GPT-4o.
AnthropicN/A (Internal)Newer SonnetNoneInternal base model analogous to OpenAI's GPT-4.1.
AnthropicN/A (Internal)Newer SonnetReasoningInternal reasoning model analogous to OpenAI's "o3".
AnthropicN/A (Internal)Larger OpusNoneInternal base model analogous to OpenAI's GPT-4.5.
AnthropicN/A (Internal)Larger OpusReasoningInternal reasoning model analogous to hypothetical GPT-4.5 + Reasoning.
GoogleN/A (Internal)Gemini 2.0 ProNonePlausible base model for Gemini 2.5 Pro according to the author.
GoogleGemini 2.5 ProGemini 2.0 ProReasoningAuthor speculates it's likely Gemini 2.0 Pro + Reasoning, rather than being based on a GPT-4.5 scale model.
GoogleN/A (Internal)Gemini 2.0 UltraNoneHypothesized very large internal base model. Might exist primarily for knowledge distillation (Gemma 3 insight).
Comment by Rasool on AI #102: Made in America · 2025-04-12T10:46:30.112Z · LW · GW

I actually ended up listening to this episode and found it quite high-signal. Lex kept his peace-and-love-kumbaya stuff to a minimum and Dylan and Nathan actually went quite deep on specifics like innovations in Deepseek V3/R1/R1Zero, and hardware and export controls

Comment by Rasool on OpenAI #12: Battle of the Board Redux · 2025-04-06T09:20:40.022Z · LW · GW

Matt Levine, in response to:

If you lie to board members about other board members in an attempt to gain control over the board, I assert that the board should fire you, pretty much no matter what

 writes:

No! Wrong! Not no matter what! In a normal company with good governance, absolutely. Lying to the board is the main bad thing that the CEO can do, from a certain perspective. But there are definitely some companies — Elon Musk runs like eight of them, but also OpenAI — where, if you lie to board members about other board members in an attempt to gain control over the board, the board members you lie about should probably say “I’m sure that deep down this is our fault, we’re sorry we made you lie about us, we’ll see ourselves out.”

To be clear, I am very sympathetic to the OpenAI board’s confusion. This was not a simple dumb mistake. They did not think “we are the normal board of a normal public company, and we have to supervise our CEO to make sure that he pursues shareholder value effectively.” This was a much weirder and more reasonable mistake. They thought “we are the board of a nonprofit set up to pursue the difficult and risky mission of achieving artificial general intelligence for the benefit of humanity, and we have to supervise our CEO to make sure he does that.” Lying to the board seems quite bad as a matter of, you know, AI misalignment. 

Comment by Rasool on How I force LLMs to generate correct code · 2025-03-22T09:43:47.558Z · LW · GW

Am I correct in thinking that you posted this a couple of days ago (with a different title - now deleted), and this version has no substantial changes?

Comment by Rasool on Mo Putera's Shortform · 2025-03-21T14:17:48.703Z · LW · GW

Another good blog:
https://nintil.com/mistakes

Comment by Rasool on Joseph Miller's Shortform · 2025-02-18T16:14:44.388Z · LW · GW

The 200k GPU number has been mentioned since October (Elon tweet, Nvidia announcement), so are you saying that that they managed to get the model trained so fast is what beat the predictions you heard?

Comment by Rasool on Yonatan Cale's Shortform · 2025-02-18T09:52:42.856Z · LW · GW

I met someone in SF doing this but cannot remember the name of the company! If I remember I'll let you know

One idea I thought would be cool related to this is to have several LLMs with different 'personalities' each giving different kinds of feedback. Eg. a 'critic', an 'aesthete', a 'layperson', so just like in Google Docs where you get comments from different people, here you can get inline feedback from different kinds of readers 

Comment by Rasool on Florian_Dietz's Shortform · 2025-02-18T09:48:56.868Z · LW · GW

There is usually a Google Sheet export of the Swapcard data provided, which makes this easier - but at previous conferences other attendees were apprehensive when informed that people were doing this

Comment by Rasool on AI #102: Made in America · 2025-02-13T08:33:46.159Z · LW · GW

Haven't used it much but dexa.ai tries to let you interact with podcast episodes, here's this episode:

https://dexa.ai/d/e2fc9f6e-e1d5-11ef-8e88-ffec9447dc76

Comment by Rasool on Tail SP 500 Call Options · 2025-01-23T18:05:10.944Z · LW · GW

What do you make of Hynix?

Comment by Rasool on Tax Price Gouging? · 2025-01-18T12:28:31.692Z · LW · GW

There is a very good Rationally Speaking podcast episode about this - one solution that is proposed by economist Ami Glazer is to not restrict pricing, but then issue vouchers or cash to those who need it. Glazer brings up that this is how the food stamp system works at present

That episode goes into other topics around this issue, like hoarding, rationing, positive externalities (eg. face masks protect not just the wearer but those around them)

Comment by Rasool on Patent Trolling to Save the World · 2025-01-18T12:15:08.596Z · LW · GW

A bit of a tangent, but economist Alex Tabarrok has talked about buying coal mines in order to not mine coal

One of the challenges until recently (as outlined in that link) was:

There are also some crazy “use it or lose it” laws that say that you can’t buy the right to extract a natural resource and not use it. When the high-bidder for an oil and gas lease near Arches National Park turned out to be an environmentalist the BLM cancelled the contract!

Comment by Rasool on [Cross-post] Welcome to the Essay Meta · 2025-01-17T09:11:00.325Z · LW · GW

This is another one that was doing the rounds in the UK progress / YIMBY / growth space:

https://ukfoundations.co/

Comment by Rasool on Mo Putera's Shortform · 2025-01-16T17:02:18.625Z · LW · GW

How interesting, I was curious about copyright etc but this is annotated by the author himself!

Comment by Rasool on AGI Will Not Make Labor Worthless · 2025-01-13T23:29:53.655Z · LW · GW

Base rates, historical context, it is debated in this highly-upvoted post

Comment by Rasool on AGI Will Not Make Labor Worthless · 2025-01-13T10:39:27.502Z · LW · GW

I don't think this post deserves to be downvoted so much (currently sitting at -11)

Even if one disagrees with the main thesis, it's not a low-quality post, and does add to the debate