Where to find reliable reviews of AI products?

post by Elizabeth (pktechgirl) · 2024-09-17T23:48:25.899Z · LW · GW · 1 comment

This is a question post.

Contents

  Answers
    2 Elizabeth
None
1 comment

Being able to quickly incorporate AI tools seems important, including for working on AI risk (people who disagree: there's a thread for doing so in the comments). But there are a lot of AI products and most of them suck.  Does anyone know a good source of reviews, or even just listing product features and naming obvious slop?

Answers

answer by Elizabeth · 2024-09-17T23:52:20.571Z · LW(p) · GW(p)

If you'd like to recommend a particular AI product, please reply to this thread. 

comment by Nathan Helm-Burger (nathan-helm-burger) · 2024-09-18T21:42:24.253Z · LW(p) · GW(p)

Nice idea having a thread for specific recommendations. The current best options are changing fast so I expect this thread to go out of date within a few months. Maybe even a few weeks.

Here's my preferences currently (Sept 2024):
Sonnet 3.5 - best all-round general model

Opus 3 - best at philosophy and complex theoretical discussions. Usually I only resort to this after I try Sonnet and notice that it fell slightly short at some point in the conversation.

o1-preview (strawberry) - does better on structured multi-step reasoning problems than Sonnet 3.5  Overall, I don't prefer it, but for something like coding, I'd probably want to at least compare its outputs with Sonnet 3.5's.

You.com and Perplexity AI  - they basically act as wrappers around other models, including Sonnet 3.5 and o1-preview. They add in desirable features like web search, and structured prompt chains for reflective reasoning. They could be handy if you're into that sort of thing. I personally have tried both of them out, and prefer just doing the prompting and web searching manually over the course of an LLM conversation to solve a problem.

Google AI notebook thingy - haven't tried it yet, but some people say it's impressive.

comment by habryka (habryka4) · 2024-09-18T22:58:23.669Z · LW(p) · GW(p)

I quite like Exa.ai for search purposes. It's not there yet to replace my default search, but it's been useful a few times. 

I strongly recommend Cursor for programming. Especially the "apply-model" (where you can press a button to have an AI model figure out how the changes that the chat model suggested to your code should actually apply) makes the experience miles better than vanilla VSCode. 

Midjourney is still the best image generator. They now have a website so you no longer need to use their janky Discord UI.

Suno is the best AI music generator. 

1 comment

Comments sorted by top scores.

comment by Elizabeth (pktechgirl) · 2024-09-17T23:48:52.697Z · LW(p) · GW(p)

People who think my premise is faulty: please give your arguments under this thread.