Posts

Comments

Comment by nabereon on GPT-4 for personal productivity: online distraction blocker · 2025-02-09T17:27:30.048Z · LW · GW

This looks very useful, although I think the performance improvements in the more recent open-weight, smaller, quantized models (like Gemma-2, Qwen-2.5, or Phil-3.5) have made it much more reasonable to run such a model locally for this purpose rather than using a remote API, since sending data about the webpages they visit to OpenAI is a repulsive idea to many people (it would also have cost benefits over huge models like GPT-4, but the increase in benefit/cost ratio would be an epsilon increase compared to budget proprietary models like Gemini-2.0-Flash).

Comment by nabereon on Meltdown: Interface for llama.cpp and ChatGPT · 2024-04-03T07:45:04.189Z · LW · GW

Claude API support would be great since Claude 3 models are highly competitive. Claude-3 Haiku performs similarly to GPT-4 at a fraction of the cost and Claude-3 Opus outperforms GPT-4-Turbo in many tasks.