Claude just got upgraded, and why you should use it over ChatGPT
| Get to know Anthropic's new Claude 3.5 sonnet and 'computer use' in simple terms.
On Oct 22, 2024, Anthropic announced upgraded Claude 3.5 sonnet, new model Claude 3.5 Haiku, and computer use.
We are going to see how they compares to other publicly available models like OpenAI’s GPT-4o or Google’s Gemini.
Claude 3.5 sonnet - Anthropic’s top model
Claude 3.5 Haiku - Fast and efficient model (uses less tokens/resources).
Claude 3.5 sonnet is now the top model in SWE-bench Verified.
SWE-bench tests language models on real-world GitHub issues (coding)
- Resolved 49%
Claude 3.5 sonnet is also higher on TAU-bench than GPT-4o.
Computer use (beta) -
AI can control/use your computer and apps on it (by looking at a screen, moving a cursor, clicking buttons, and typing text).
Computer use is still experimental and currently available with Claude 3.5 sonnet.
OSWorld Benchmark for Claude 3.5 Sonnet in screenshot-only category.
(OSWorld evaluates AI models’ ability to use computers like people)
- 50 steps - 22%
- 15 steps - 14.9 %
OpenAI (ChatGPT) offers GPT-4o and GPT-4o mini as free models. After 4 or 5 prompts, you will hit the free plan limit and get switched to 4o mini.
Claude 3.5 sonnet is also available for free for limited use. You should consider using Claude, because Sonnet is a better model than 4o especially for coding.
(Claude can’t create images or have internet access, but you can upload images or docs.)
*Comparison table is taken from Anthropic’s post