LLM APIs with permanent free tiers for text inference.
All endpoints are OpenAI SDK-compatible unless noted. Each link points to the provider's API key page.
Contents
Provider APIs
APIs run by the companies that train or fine-tune the models themselves.
Cohere π¨π¦
Free "Trial" API key, no credit card. 1,000 API calls/month. Non-commercial use only.
Base URL: https://api.cohere.com/v2
Model Name
Context
Max Output
Modality
Rate Limit
Command A (111B)
256K
4K
Text
20 RPM
Command R+
128K
4K
Text
20 RPM
Command R
128K
4K
Text
20 RPM
Command R7B
128K
4K
Text
20 RPM
Embed 4
β
β
Embeddings (Text + Image)
2,000 inputs/min
Rerank 3.5
β
β
Reranking
10 RPM
Google Gemini πΊπΈ
Free tier unavailable in EU/UK/Switzerland. Free-tier prompts may be used by Google to improve products. 1
Base URL: https://generativelanguage.googleapis.com/v1beta
Model Name
Context
Max Output
Modality
Rate Limit
Gemini 2.5 Flash