Question 1

How are prices verified?

Accepted Answer

Data is scraped from multiple sources (OpenRouter, LiteLLM, provider docs) and reconciled. Divergent values >5% are flagged for manual operator review to ensure zero guesswork.

Question 2

How fast do changes appear?

Accepted Answer

Price updates are ingested continuously and usually surface through API endpoints and the SSE stream within 60 seconds of confirmation.

Question 3

Can I use this in production agents?

Accepted Answer

Yes. Our /v1/context and /v1/ask endpoints are optimized for prompt-injection and low-latency agent tool-use, returning lean JSON snapshots.

Question 4

What kind of questions can I ask the /v1/ask endpoint?

Accepted Answer

You can ask natural language questions like 'Cheapest model for summarization?', 'Models under $1/M input?', or 'Which providers dropped prices recently?'.

Question 5

How do GPT-4o and Claude 3.5 pricing compare?

Accepted Answer

GPT-4o is currently more aggressive on input pricing ($2.50 vs $3.00 per 1M tokens), while output costs are highly competitive across both. Use our comparison tool for live deltas.

Question 6

Are webhook payloads signed?

Accepted Answer

Yes. Webhook events are signed with HMAC-SHA256 using your secret key so recipients can verify they originated from LLMRates.

Question 7

What do 'input' and 'output' prices mean?

Accepted Answer

Input price is the cost per 1M tokens sent in your prompt. Output price is the cost per 1M tokens generated by the model in response.

Features

Frequently Asked