chore: update model catalog from bot issues#785
Conversation
|
The latest updates on your projects. Learn more about Vercel for GitHub.
|
|
Codex (@codex) review |
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: 71bef1504a
ℹ️ About Codex in GitHub
Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "Codex (@codex) review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "Codex (@codex) address that feedback".
| "input_cost_per_mil_tokens": 0.3, | ||
| "output_cost_per_mil_tokens": 1.2, |
There was a problem hiding this comment.
Add cached-input pricing for Together MiniMax M3
For Together requests that return cached prompt tokens, this entry cannot apply the provider's discounted cache-read rate because it only records input/output pricing. The Together serverless catalog lists MiniMaxAI/MiniMax-M3 with $0.06 per 1M cached input tokens, matching the adjacent M2.7 entry, so add input_cache_read_cost_per_mil_tokens: 0.06 to avoid overstating/omitting cache pricing for this new model.
Useful? React with 👍 / 👎.
| "input_cost_per_mil_tokens": 0.95, | ||
| "output_cost_per_mil_tokens": 4, |
There was a problem hiding this comment.
Add cached-input pricing for new Fireworks models
For Fireworks requests with cached prompt tokens, the newly added serverless entries in this block lose the discounted cached-input price because they record only input/output rates. The Fireworks serverless pricing table lists cached-input rates for these same models (Kimi K2.7 Code $0.19, Qwen 3.7 Plus $0.08, and MiniMax M3 $0.06 per 1M), so please add input_cache_read_cost_per_mil_tokens to each new Fireworks entry.
Useful? React with 👍 / 👎.
71bef15 to
c468ad9
Compare
Codex Review ResponseSummaryAddressed both Codex suggestions by adding missing Changes Made1. Together MiniMax M3 (
|
Automated daily batch of model catalog updates from bot issues.
Included issues
Summary
grok-3-betagrok-4-latestgrok-3-betagrok-4-latest2
3
accounts/fireworks/models/kimi-k2p7-codeaccounts/fireworks/models/qwen3p7-plusaccounts/fireworks/models/minimax-m3accounts/fireworks/models/kimi-k2p7-codeaccounts/fireworks/models/qwen3p7-plusaccounts/fireworks/models/minimax-m32
MiniMaxAI/MiniMax-M3MiniMaxAI/MiniMax-M32
Verified metadata
#777: [BOT ISSUE] xAI: update pricing for grok-3-beta and grok-4-latest (now grok-4.3 aliases)
Verification notes
Verification
sync_models vs proposed update
sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.
#783: [BOT ISSUE] Fireworks: add missing Kimi K2.7 Code, Qwen 3.7 Plus, and MiniMax M3 models
Verification notes
No LLM verification step ran; model metadata was already complete in the issue.
sync_models vs proposed update
sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.
#784: [BOT ISSUE] Together: add missing MiniMaxAI/MiniMax-M3 model
Verification notes
No LLM verification step ran; model metadata was already complete in the issue.
sync_models vs proposed update
sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.