This document is a snapshot of the current built-in model presets in KForge. It reflects the state at the time of Phase 3.12 and is intended as a reference for documentation, maintenance, and future remote presets work.
Cost is represented by color labels, and usage is represented separately.
Legend:
Usage:
| Model | Cost | Usage | Notes |
|---|---|---|---|
| claude-opus-4-5 | π΄ Paid | Heavy | Highest capability; use sparingly |
| claude-sonnet-4-5 | π‘ Paid | Main | Balanced default for dev + writing |
| claude-haiku-4-5 | π’ Paid | Sandbox | Cheap + fast; small tasks |
β No free Claude β consistent with provider pricing.
| Model | Cost | Usage | Notes |
|---|---|---|---|
| deepseek-chat | π΅ Free | Sandbox | Provider-dependent |
| llama-3.1-8b-instant | π΅ Free | Sandbox | Provider-dependent |
| mistral-medium-latest | π‘ Paid | Main | Paid workhorse |
| mistral-small-latest | π’ Paid | Sandbox | Low-cost paid |
| openai/gpt-4o-mini | π’ Paid | Sandbox | Cheap paid default |
βͺ βFreeβ here is endpoint-dependent and not guaranteed.
| Model | Cost | Usage | Notes |
|---|---|---|---|
| deepseek-reasoner | π‘ Paid | Main | Stronger reasoning; slower/costlier |
| deepseek-chat | π’ Paid | Sandbox | Cheap general chat |
| Model | Cost | Usage | Notes |
|---|---|---|---|
| gemini-3-pro-preview | π΄ Paid | Heavy | Preview; high capability |
| gemini-2.5-pro | π‘ Paid | Main | Strong reasoning |
| gemini-3-flash-preview | π‘ Paid | Main | Preview; may change |
| gemini-2.5-flash | π’ Paid | Sandbox | Fast |
| gemini-2.5-flash-lite | π’ Paid | Sandbox | Fast + cheap |
β Preview models are subject to change or removal.
| Model | Cost | Usage | Notes |
|---|---|---|---|
| llama-3.3-70b-versatile | π‘ Paid | Main | Large + fast |
| llama-3.1-8b-instant | π’ Paid | Sandbox | Very fast |
| Model | Cost | Usage | Notes |
|---|---|---|---|
| codestral-latest | π‘ Paid | Main | Coding-focused |
| mistral-small-latest | π’ Paid | Sandbox | General starter |
| Model | Cost | Usage | Notes |
|---|---|---|---|
| codellama:13b | π΄ Paid* | Heavy | Local compute cost |
| deepseek-coder:6.7b | π‘ Paid* | Main | Local |
| llama3.1:8b | π‘ Paid* | Main | Local default |
| qwen2.5-coder:7b | π‘ Paid* | Main | Local |
| mistral:7b | π’ Paid* | Sandbox | Fast |
| qwen2.5-coder:1.5b | π’ Paid* | Sandbox | Very fast |
* Paid refers to local hardware/energy cost, not API billing.
| Model | Cost | Usage | Notes |
|---|---|---|---|
| gpt-4.1-mini | π‘ Paid | Main | Day-to-day |
| gpt-5-mini | π’ Paid | Sandbox | Cheap paid testing |
| Model | Cost | Usage | Notes |
|---|---|---|---|
| meta-llama/llama-3.3-70b-instruct:free | π΅ Free | Sandbox | Rotating / rate-limited |
| mistralai/devstral-2512:free | π΅ Free | Sandbox | Rotating |
| qwen/qwen3-coder:free | π΅ Free | Sandbox | Availability may change |
| xiaomi/mimo-v2-flash:free | π΅ Free | Sandbox | Availability may change |
β OpenRouter free models are not guaranteed and may disappear without notice.
| Model | Providers |
|---|---|
| deepseek-chat | Custom, DeepSeek |
| llama-3.1-8b-instant | Custom, Groq |
| mistral-small-latest | Custom, Mistral |
Overlaps are intentional and reflect different tradeoffs (cost, routing, availability, privacy).
High risk:
:free models*-preview modelsMedium risk:
Low risk: