docs: update cursor rules to prefer gemini/qwen over Claude

- Change default model ladder to gemini-3-flash → qwen → deepseek
- Add cost warning: never use Claude models (10-50x more expensive)
- Update model tier examples to exclude Claude
- Update DEFAULT_MODEL documentation
This commit is contained in:
Thomas Marchand
2025-12-19 11:59:11 +00:00
parent 7c9f280d65
commit 89e8dabe6f
2 changed files with 22 additions and 7 deletions

View File

@@ -93,15 +93,28 @@ src/
- **Expensive models**: high token cost, low failure rate
- **Optimal**: minimizes `E[total_cost] = Σ P(retry_n) × cost_n`
Default ladder: `claude-haiku-4.5` → `claude-sonnet-4` → `claude-sonnet-4.5`
Default ladder: `gemini-3-flash` → `qwen/qwen3-235b-a22b-instruct` → `deepseek/deepseek-v3.2`
### Model Preferences
**NEVER use Claude models** (anthropic/claude-*) - they are prohibitively expensive.
**Preferred models (in order):**
1. `google/gemini-3-flash-preview` - Fast, cheap, good tool use
2. `qwen/qwen3-235b-a22b-instruct` - Strong reasoning, affordable
3. `deepseek/deepseek-v3.2` - Good value, capable
4. `x-ai/grok-4.1-fast` - Fast alternative
5. `mistralai/mistral-large-2512` - European provider option
When implementing model selection or defaults, always prefer these models over Claude.
## Model Family System
The agent automatically upgrades outdated model names to the latest versions. This prevents issues where training data suggests old model names like `claude-3.5-sonnet` instead of the newer, cheaper, and smarter `claude-sonnet-4.5`.
The agent automatically upgrades outdated model names to the latest versions. This prevents issues where training data suggests old model names like `gemini-1.5-flash` instead of the newer `gemini-3-flash-preview`.
### How It Works
1. **Model Families**: Models are grouped into families (e.g., `claude-sonnet`, `gpt-4`)
1. **Model Families**: Models are grouped into families (e.g., `gemini-flash`, `gpt-4`, `deepseek`)
2. **Auto-Upgrade**: When an old model is requested, it's resolved to the latest in its family
3. **Aliases**: Common aliases like "sonnet" or "gpt4" resolve to the latest
@@ -116,9 +129,11 @@ The agent automatically upgrades outdated model names to the latest versions. Th
| Tier | Examples | Use Case |
|------|----------|----------|
| **flagship** | claude-opus-4.5, o1, deepseek-r1 | Complex reasoning, important tasks |
| **mid** | claude-sonnet-4.5, gpt-4.1, gemini-pro | Default for most tasks |
| **fast** | claude-haiku-4.5, gpt-4.1-mini | Quick, cheap tasks |
| **flagship** | deepseek-r1, o1, qwen3-235b | Complex reasoning, important tasks |
| **mid** | gemini-3-flash, deepseek-v3.2, grok-4.1 | Default for most tasks |
| **fast** | gpt-4.1-mini, gemini-flash | Quick, cheap tasks |
> ⚠️ **Cost Warning**: Never use Claude models (anthropic/*) - they are 10-50x more expensive than alternatives with similar capability.
### API Endpoints

View File

@@ -69,7 +69,7 @@ The `upload_image` tool requires a public storage bucket named `images`:
| Variable | Default | Description |
|----------|---------|-------------|
| `DEFAULT_MODEL` | `anthropic/claude-sonnet-4.5` | Default LLM |
| `DEFAULT_MODEL` | `google/gemini-3-flash-preview` | Default LLM (prefer gemini/qwen, never Claude) |
| `WORKING_DIR` | `/root` (prod), `.` (dev) | Working directory |
| `HOST` | `127.0.0.1` | Bind address |
| `PORT` | `3000` | Server port |