Large Language Models
Analysis | Diagrams | Thinkers
Who is winning the race to the bottom?
| LLM Vendor | All Purpose Model | Multi | Open Source | Voice/TTS | Image Gen | TACO Agent | Code Agent | Deep Research | Strengths |
|---|---|---|---|---|---|---|---|---|---|
| Anthropic | Claude 3.7 | MCP | TRUE | Creative and Socially Engaging, AI Coding | |||||
| DeepSeek | R1 V3 | TRUE | Cheap, Scientific Research | ||||||
| ElevenLabs | Turbo v2.5 | Voice cloning, TTS leader | |||||||
| Gemini 2.0 | Chirp STT | Imagen-3 | Cheap, Rounded, In-depth option | ||||||
| Meta | TRUE | ||||||||
| Microsoft | |||||||||
| Nous Research | TRUE | Decentralized | |||||||
| OpenAI | GPT-4o | TTS-1-HD, Whisper | DALL-E3 | Operator | |||||
| OpenAI o3 | o3 | ||||||||
| Perplexity | |||||||||
| Qwen | Qwen 2.5 | TRUE | Qwen3-TTS | Open source, multilingual | |||||
| Venice | TRUE | ||||||||
| XAI | Aurora |
- Multi Modal: Can interpret voice and images.
- Voice/TTS: Text-to-Speech, Speech-to-Text capabilities
- Image Gen: Graphic Design
- Strengths: Strongest Use Cases
- TACO Agent:
- Coding Agent:
- Deep Research: available or not
See AI Modalities for detailed breakdown of capability types (voice, vision, video, audio, 3D).
Model Selection
Constantly review Minimum Viable Toolkit to gain maximum leverage by focusing on one critical job to be done at a time.
- Identify a recurring need
- Search for the best tool
- Cost
- Speed
- Accuracy
- Master functionality
- Glue to workflows
If the tool does not exist, investigate building it.
Subject Expertise
- AI Prompting: How to instruct models most effectively
- AI Agents: AI Agents and the jobs they perform
- AI Coding: Best AI coding tools and strategies
Context
- Agent Frameworks
- Business Playbook
- SaaS Toolkit