Cohere

Models

Command A, Command R+, Aya Expanse 32B

RPM

20

Context

128K

Best For

Lightweight tasks, embedding

Pricing: Free tier, then pay-as-you-go

Verdict: Limited free tier

✓ Good option

Google Gemini

Models

Gemini 2.5 Pro, Gemini 2.5 Flash, Gemini 2.0 Flash-Lite

RPM

15

Context

1M

Best For

Multimodal tasks, long context

Pricing: Free tier available

Verdict: Limited RPM on free tier

⚠️ Limited

Mistral AI

Models

Mistral Large 3, Small 3.1, Ministral 8B

RPM

Variable

Context

128K

Best For

European data, reasoning

Pricing: Free tier + paid

Verdict: Generous free tier

✓ Good option

Groq

Models

Llama 3.3 70B, Llama 4 Scout, Mixtral 8x7B

RPM

30 (free)

Context

8K-32K

Best For

Speed-critical applications

Pricing: Very generous free tier

Verdict: Best free tier for speed

✓ Good option

Hugging Face

Models

Llama 3.3 70B, Qwen2.5 72B, Mistral 7B

RPM

Rate limited

Context

Varies

Best For

Experiments, testing

Pricing: Free with rate limits

Verdict: Can be unreliable

⚠️ Limited

OpenRouter

Models

DeepSeek R1, Llama 3.3 70B, GPT-4o

RPM

20 (free)

Context

Varies by model

Best For

Model diversity

Pricing: Free credits + paid

Verdict: Good variety, limited free

⚠️ Limited

Cerebras

Models

Llama 3.3 70B, Qwen3 235B

RPM

30 (free)

Context

8K

Best For

High-volume, speed-critical

Pricing: Extremely generous free tier

Verdict: Best pure free option

✓ Good option
Provider RPM RPD Context Window Latency Reliability Verdict
Cohere
20 1,000 128K Fast Good Limited free tier
Google Gemini
15 100-1,500 1M Fast Excellent Limited RPM on free tier
Mistral AI
Variable 1B tokens/mo 128K Fast Good Generous free tier
Groq
30 (free) 14,400 8K-32K Very Fast Good Best free tier for speed
Hugging Face
Rate limited Unknown Varies Variable Unreliable Can be unreliable
OpenRouter
20 (free) 50 Varies by model Variable Good Good variety, limited free
Cerebras
30 (free) 14,400 8K Ultra Fast Excellent Best pure free option
MiniMax
Unlimited* Unlimited* 1M Fast Excellent Best value for production

* MiniMax offers pay-as-you-go pricing with no monthly commitment. Use code CXWzfLSdF5 for 10% off.

Ready for Production-Ready AI?

Stop fighting rate limits and hello to reliable, fast AI inference.

Get Started Free

* No credit card required • 10% referral discount