Last updated: October 4, 2025 • PRs/issues welcome
Languages: Español • Português • 中文 • Français • 日本語 • हिन्दी • Türkçe
Many AI coding tools claim to be "free," but access to pro-grade models usually runs out fast, then you're downgraded. Each tool uses different limits (credits, tokens, requests), making comparison difficult. This list puts them side by side to show what you actually get for free.
(tools with higher limits listed first)
| Tool | Pro‑grade models | Free tier limit | Credit card |
|---|---|---|---|
| Qwen Code | Qwen3-Coder-480B | 2,000 requests/day | No |
| Rovo Dev CLI | Claude Sonnet 4 | 5M tokens/day (beta) | No |
| Gemini CLI | Gemini 2.5 Pro | 100 requests/day | No |
| Kilo Code | Claude Opus/Sonnet, Gemini 2.5 Pro, GPT‑4.1 | $20 signup credits (one‑time) | Yes |
| Warp | GPT‑5, Claude Opus 4.1, Claude Sonnet 4, Gemini 2.5 Pro | 150 requests/month | No |
| Trae | Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, GPT‑4.1, GPT‑4o, Gemini 2.5 Pro | 10 fast + 50 slow requests/month | No |
| Amazon Q Developer | Claude Sonnet 4 | 50 agentic requests/month | Yes |
| GitHub Copilot | GPT‑4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1 | 50 chat requests + 2,000 completions/month | No |
| Windsurf | OpenAI, Anthropic, Google, xAI | 25 credits/month | Yes |
| Jules | Gemini 2.5 Pro | 15 tasks/day | No |
| AWS Kiro | Claude 4 Sonnet, Claude 3.7 Sonnet | 50 credits/month | No |
| Qoder | Qwen3-Coder-480B, Claude, GPT, Gemini | Limited credits (preview) | No (preview) |
Real usage varies widely by coding style, task complexity, and tool implementation. Help improve this resource by sharing your actual experience →
Only models achieving >60% on SWE-bench Verified qualify as pro-grade for real-world coding tasks. Below is the current list
| Model | SWE-bench Verified | Provider |
|---|---|---|
| GPT-5 | 74.9% | OpenAI |
| Claude Opus 4.1 | 74.5% | Anthropic |
| Claude Sonnet 4 | 72.7% (80.2% w/ parallel) | Anthropic |
| GPT-5 mini | 71.0% | OpenAI |
| Qwen3-Coder-480B | 69.6% (interactive) / 67.0% (single) | Alibaba |
| Gemini 2.5 Pro | 63.2% |
Help improve this resource by sharing your actual usage experience → Submit your feedback
If you spot an error, missing source link, or have updated quota/model information, please open an issue or pull request with a source. New tool contributions are welcomed! See CONTRIBUTING.md for detailed guidelines.
No affiliation with any vendor. All trademarks belong to their owners. Information is for research; accuracy not guaranteed; limits/pricing change frequently.
- 1. AI-coding Tools with Free Access to Pro-Grade Models
- 2. API Providers for AI Coding Tools
- 3. Tools with Paid Tiers with Pro-Grade Models
- 4. Tools with Free Access to Basic Models
- 5. Local Models
- Comparison Notes
- Related Resources
(ordered from most generous to least)
Qwen3-Coder-480B access
- 2,000 requests/day free tier via Qwen OAuth
- 60 requests/minute rate limit
- Command-line AI workflow tool (adapted from Gemini CLI)
- One-click browser authentication
- No credit card required
Links: GitHub | Documentation
Claude Sonnet 4 access during beta
- 5M tokens/day free tier (20M on first day only)
- Claude Sonnet 4 model (confirmed via testing)
- No credit card required during beta
- Token limits reset at midnight UTC
- Note: Upgrade to Jira Standard/Premium/Enterprise for 20M tokens/day
Links: Documentation | Token Limits
Gemini 2.5 Pro access
- 100 requests/day limit for Gemini 2.5 Pro
- 250 requests/day limit for Gemini 2.5 Flash
- No credit card required
- Google models only
- Switches to paid rates after free quota
Links: Rate Limits | Pricing
Claude Opus/Sonnet, Gemini 2.5 Pro, GPT-4.1 access
- $20 signup credits (one-time bonus)
- Open source VS Code extension
- Pay-as-you-go with no markup on model pricing
- Credit card required to claim bonus credits
- Supports bringing your own API keys
Links: GitHub | Documentation | Pricing
GPT‑5, Claude Opus 4.1, Claude Sonnet 4, Gemini 2.5 Pro access
- 150 requests/month limit
- Multiple providers (OpenAI GPT‑5, Claude Opus 4.1, Claude Sonnet 4, Gemini 2.5 Pro)
- No credit card required for basic signup
- Pay-as-you-go overages available
Links: Pricing
Claude Sonnet 4 access
- 50 agentic requests/month limit (multi-turn conversations)
- Latest Claude models (AWS-hosted)
- Credit card required
- Must upgrade to Pro for continued access
- Perpetual free tier
Links: Pricing
Agent Mode with GPT‑4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1
- 50 chat requests + 2,000 completions/month limit
- Agent Mode with autonomous multi-step coding
- Multiple providers (GPT-4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1)
- No credit card required
- Limited to basic features after quota
Links: Plans Details | Agent Mode
Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, Claude 3.5 Sonnet, GPT‑4.1, GPT‑4o, Gemini 2.5 Pro access
- 10 fast requests + 50 slow requests/month for premium models
- 1,000 slow requests/month for advanced models
- 5,000 auto-completions/month
- VS Code-based IDE with AI integration
- Multiple premium models including Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, GPT‑4.1
- No credit card required for free tier
- Pro Plan: $10/mo (600 fast + unlimited slow requests)
Links: Pricing | Documentation
OpenAI, Anthropic, Google, xAI model access
- 25 prompt credits/month limit
- Multiple providers (OpenAI, Claude, Gemini, xAI)
- Credit card required
- Can purchase add-on credits to continue
Links: Pricing
Gemini 2.5 Pro access
- 15 tasks/day free tier
- 3 concurrent tasks
- Gemini 2.5 Pro model
- Gmail account required (18+ years)
- Task limits reset on rolling 24-hour window
- No credit card required
Links: Usage Limits | Documentation
Claude 4 Sonnet, Claude 3.7 Sonnet access
- 50 credits/month (Free tier)
- Claude 4 Sonnet and Claude 3.7 Sonnet models (AWS-hosted)
- No credit card required
- 14-day welcome bonus: 500 credits
- Paid tiers: Pro ($20/mo - 1,000 credits), Pro+ ($40/mo - 2,000 credits), Power ($200/mo - 10,000 credits)
Links: Pricing | Introduction Blog
Qwen3-Coder-480B, Claude, GPT, Gemini models (free preview)
- Limited credits for chat and agent requests during preview
- AI-powered IDE from Alibaba (public preview launched August 2025)
- Available for Windows and macOS
- Primarily uses Qwen3-Coder-480B (Alibaba's flagship coding model)
- Also supports Claude, GPT-4, Gemini models
- Agent Mode and Quest Mode for autonomous coding
- No credit card required (preview period)
- Credit-based pricing coming soon
Limits change fast. If you see a mistake, a newer quota/model, or want to add a new tool, open an issue or PR with a source. See CONTRIBUTING.md for guidelines.
(ordered from most generous to least)
These services provide API access to coding-optimized models that integrate with popular AI coding tools like Cursor, Continue.dev, Cline, and others. They don't provide standalone coding tools but offer the AI backend for existing tools.
Qwen3-Coder-480B via OpenRouter
- 2,000 requests/day free tier for Qwen3-Coder-480B
- Additional free models: Qwen3-30B-A3B, Qwen3-235B-A22B, Gemini Flash
- OpenAI-compatible API for all major IDEs
- No credit card required for free models
- Rate limiting during high traffic for free tier
- Works with Continue.dev, Cline, Cursor, etc.
Links: Free Models | Qwen3-Coder API
Qwen3-Coder-480B and Llama 3.1 access
- Free tier: 1M tokens/day (no credit card required)
- Models: Qwen3-Coder-480B (matches Claude Sonnet 4 performance), Llama 3.1 70B
- OpenAI-compatible API (works with Cursor, Continue.dev, Cline, RooCode, etc.)
- Ultra-fast inference: 2,000 tokens/second (40x faster than typical providers)
- Paid tiers: Developer ($10+ self-serve), Code ($50/mo - 24M daily tokens), Enterprise (custom)
Links: Pricing | API Docs | Integration Guides
Jira Standard ($7.53/user/mo): 20M tokens/day Jira Premium ($15.25/user/mo): 20M tokens/day Jira Enterprise (custom): 20M tokens/day
- 4x increase from free tier (5M → 20M tokens/day)
- Same Claude-based model as free tier
- Token limits reset at midnight UTC
Links: Documentation | Token Limits | Jira Pricing
Pro ($17/mo with annual): Sonnet 4 access with more usage than free tier Max (From $100/mo): Opus 4.1 + Sonnet 4 access (5x or 20x more usage than Pro)
- Usage limits reset weekly
- 5-hour rolling window limits apply
- Priority access during high traffic (Max tier)
Links: Pricing
Pro ($19/mo): Increased limits for agentic requests
- Usage may be adjusted based on regional factors and usage patterns
Links: Pricing
Pro ($15/mo annually, $18/mo monthly): 2,500 requests/month
Turbo ($40/mo annually, $50/mo monthly): 10,000 requests/month
Lightspeed ($200/mo annually, $225/mo monthly): 50,000 requests/month
- Pay-as-you-go available for overages
- Enterprise tier: Custom pricing
Links: Pricing
Pro ($10/mo): 300 premium requests + unlimited completions/month Pro+ ($39/mo): 1,500 premium requests + unlimited completions/month Business ($19/user/mo): 300 premium requests + unlimited completions/user/month Enterprise ($39/user/mo): 1,000 premium requests + unlimited completions/user/month
- Access to multiple models (GPT-4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1)
- Overage billing available at $0.04/request
Links: Plans Details
Pro ($10/mo): 600 fast requests + unlimited slow requests for premium models
- Unlimited slow requests for advanced models
- Zero rate limits and faster access to premium models
- Extra packages available: $3-$12 for additional fast requests
- Multiple premium models: Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, Claude 3.5 Sonnet, Gemini 2.5 Pro, GPT‑4.1, GPT‑4o
- VS Code-based IDE with full AI integration
- First month available for $3
Links: Pricing | Documentation
Pro ($15/mo): 500 prompt credits/month Teams ($30/user/mo): 500 prompt credits/user/month Enterprise ($60+/user/mo): 1,000 prompt credits/user/month
Links: Pricing
Pro ($25/mo): 150 credits/month (5 daily credits) Teams ($30/mo): Higher limits (undisclosed)
Links: Messaging Limits
$20/mo: 10M tokens/month $200/mo: 120M tokens/month
Links: Token Documentation
Hobby (Free): Limited agent requests with basic models only + 2-week Pro trial Pro ($20/mo): Extended limits on Agent, access to Claude Sonnet 4, OpenAI o3-pro, GPT-4.1, Gemini 2.5 Pro, Claude Opus 4 Pro+ ($60/mo): 3x usage on all OpenAI, Claude, Gemini models Ultra ($200/mo): 20x usage on all OpenAI, Claude, Gemini models Teams ($40/user/mo): Pro features + team management
- Two-week Pro trial available
- Credit card required for free tier
- AI-powered code editor with autonomous coding capabilities
Links: Pricing
Free with ChatGPT Plus ($20/mo): GPT-5 access for coding tasks Pay-as-you-go: Use with OpenAI API key Free OSS mode: Access to open-source models only (via --oss flag)
- Lightweight coding agent running locally
- Interactive terminal UI with sandbox mode
- Cross-platform support: macOS 12+, Ubuntu 20.04+, Windows 11 via WSL2
- Experimental project under active development
Links: GitHub Repo
Pro ($10/mo): Unlimited usage with advanced context awareness
- Claude 3.5 Sonnet, GPT-4o access
- Enhanced context window and personalization Teams ($12/user/mo): Pro features + team management Enterprise (Custom): On-premise deployment, custom models
Links: Pricing
Pro ($12/mo): Enhanced AI completions and chat Enterprise ($39/user/mo): Multiple LLMs, private deployment
- Models: Claude 3.5 Sonnet, GPT-4o, Llama 3.3 70B, proprietary models
- 600+ programming languages supported
- On-premises and air-gapped deployment options
- Bring your own fine-tuned models
Links: Pricing
AI Pro ($15/mo): Increased cloud quota + unlimited local models AI Ultimate ($25/mo): Maximum cloud quota + advanced features
- Free tier: Unlimited code completion + local models + limited cloud quota
- 30-day Pro trial included
- All Products Pack includes AI Pro
- Offline mode with local models via Ollama/LM Studio
Links: AI Pricing
Pro ($19.99/mo via Google AI Pro): 100 tasks/day
- 5x higher limits than free tier (15 concurrent tasks)
- Higher access to latest models
- Gmail account required (18+ years)
Links: Usage Limits | Google AI Plans
Pro ($10/mo): 1M token context window + chat credits
- Alternative: $99/year
- Chat interface with GPT-4o, Claude 3.5 Sonnet, GPT-4 Team ($10/user/mo): Pro features + team management
- Note: Merged with Cursor IDE in November 2024
Links: Pricing
Know better pricing or limits? Share a link in an issue or PR to help keep this updated. See CONTRIBUTING.md for guidelines.
(unspecified/basic models)
Unspecified models
- 1M tokens/month limit
- Specific model not publicly specified
- Credit card required
Links: Token Documentation
Unspecified models
- 5 daily credits, max 30 per month (free)
- Models not publicly enumerated
- Credit card required
Links: Messaging Limits
Proprietary models (not frontier)
- GPT-5 access requires v0 Premium subscription
- $5 in credits/month limit
- Uses proprietary models with varied routing
- Credit card required
Links: Updated Pricing Blog
Unlimited free usage of basic AI coding assistance
- Individual plan: Free forever with unlimited code completions, AI chat, commands
- 70+ programming languages supported
- IDE integrations: VS Code, JetBrains, Vim/Neovim, Jupyter
- No credit card required
- Limited context awareness (expanded in paid tiers)
- Base model only (Llama 3.1 70B), pro-graded models require subscription
Links: Pricing | Documentation
Free tier with limited features
- Basic AI code completions and chat (limited)
- Local processing available
- Context heavily limited in free tier
- Performance dialed down to save resources
- 600+ programming languages supported
Links: Pricing
AI Free tier included with IDEs
- Unlimited code completion and local model support
- Limited quota for cloud-based features
- 30-day AI Pro trial
- Chat, code generation, commit messages with local models
Links: AI Features
Free tier with basic features
- Basic code suggestions
- 7-day data retention limit
- Credit card required for registration
- 1M token context window (impressive for free tier)
Links: Pricing
Free open-source extension with flexible model support
- Free VS Code and JetBrains extension
- Full support for local models via Ollama, LM Studio
- Solo tier: Private/team/public visibility options
- Supports 200+ models (requires your own API keys for cloud models)
- Community hub for custom AI assistants
- No vendor lock-in or usage limits for local models
Know the official limits or models? Share a link in an issue or PR to update the information. See CONTRIBUTING.md for guidelines.
Running open-weight frontier models locally provides unlimited coding assistance without API costs or usage limits. Popular tools for local deployment include Cline (VS Code extension with Plan/Act modes and MCP support), Aider (command-line assistant with built-in Git integration), and Continue.dev (open-source VS Code extension supporting 200+ models). All work seamlessly with Ollama to run frontier models like Devstral (24B parameters, optimized for agentic coding), Qwen3-Coder, DeepSeek Coder V2, Codestral, and GLM-4.5.
Note: Frontier models require substantial RAM/VRAM. In particular, for Qwen3‑Coder‑480B the Ollama‑friendly GGUF is ~150GB, and practical local inference can require ~150GB of unified memory (RAM+VRAM), which makes it hard on typical laptops; the 30B quant commonly needs ~18GB. See the Unsloth Qwen3‑Coder local guide for details (docs) and Simon Willison's article on running GLM‑4.5 AIR on his laptop to build Space Invaders for a practical example.
- Goal: Compare AI coding tools by their access to pro-grade models and free tier limits.
- What qualifies a model as "pro-grade"? Models must achieve ≥60% on SWE-bench Verified, demonstrating real-world software engineering capability. Current qualifying models: GPT-5 (74.9%), Claude Opus 4.1 (74.5%), Claude Sonnet 4 (72.7%), GPT-5 mini (71.0%), Qwen3-Coder-480B (69.6%), and Gemini 2.5 Pro (63.2%).
- Different limit types: Tools use various quota systems - requests, tokens, credits, chats - making direct comparison challenging. Check documentation for specifics.
- Real-world usage: Actual consumption varies dramatically based on coding style, task complexity, and tool implementation.
- Free LLM API Resources - Comprehensive list of free LLM APIs for building custom integrations