Large Language Models
Explore the intersection of Functionality vs Jobs To Be Done to gain maximum leverage for an evolving business playbook.
LLM Vendor | Best Model | Open Source | Image Gen | Agent | Deep Research | Strengths |
---|---|---|---|---|---|---|
Anthropic | Claude 3.5 | Creative and Socially Engaging, AI Coding | ||||
DeepSeek | R1 V3 | TRUE | Cheap, Scientific Research | |||
Gemini 2.0 | Imagen-3 | Cheap, Rounded, In-depth option | ||||
Meta | TRUE | |||||
Microsoft | ||||||
OpenAI | GPT-4o | DALL-E3 | Operator | |||
OpenAI | o3 | |||||
XAI | Aurora |
Meta:
- Image Gen: AI Art Direction
- Strengths: Strongest Use Cases
- Agent: Native agent operations
- Deep Research: available or not
Analyis Tools
- countless.dev: See and compare every AI model easily
Open Source LLMs:
Context
- AI Prompting: How to instruct models most effectively
- AI Agents: AI Agents and the jobs they perform
- AI Coding: Best AI coding tools and strategies
- AI Architecture: Platform requirements
- Business Playbook: How to use AI in business
- SaaS Marketplace: How AI is shaking up the SaaS Marketplace
Model Selection
Constantly review Minimum Viable Toolkit to gain maximum leverage by focusing on one critical job to be done at a time.
- Identify a recurring need
- Search for the best tool
- Cost
- Speed
- Accuracy
- Master functionality
- Glue to workflows
If the tool does not exist, investigate building it.
Evaluation Tools
Compare LLM performance across different tasks.
Inference Interfaces
Wrappers between user and multiple models.
Conversational User Interfaces (CUIs), enable users to interact with LLMs through natural language, facilitating a more intuitive and engaging experience.
Service | Privacy | Open Source | Anthropic | Claude | DeepSeek |
---|---|---|---|---|---|
Groq | |||||
Ninja Chat | |||||
Perplexity | |||||
Morphic | TRUE | ||||
Venice | TRUE | ||||
You |