Back to AI information
GPT-5.1 officially opened APIs, adding gpt-5.1-codex and codex-mini for long-term coding tasks

GPT-5.1 officially opened APIs, adding gpt-5.1-codex and codex-mini for long-term coding tasks

AI information Admin 254 views

OpenAI announced that GPT-5.1 is now available to developers through an API, with pricing aligned with the existing GPT-5 model and available for all paid tiers. This means that without increasing the unit price of model calls, developers can directly switch existing GPT-5 integration to GPT-5.1 to obtain stronger reasoning and instruction following capabilities without adjusting the cost structure or quota configuration.

At the same time, GPT-5.1-Codex and GPT-5.1-Codex-Mini, which are specifically designed for long-term coding and agent-based development scenarios, are also launched, both of which are optimized for long-running code generation, refactoring, and automated development processes. While the base price remains unchanged, OpenAI has also extended the Prompt cache retention time to a maximum of 24 hours for GPT-5.1 and its Codex variants, which can reuse the same long context across multiple rounds of long sessions or ongoing tasks, significantly reducing comprehensive fees and reducing first-round cold start delays.

FAQsQ

: What is the price change of GPT-5.1 in the API?

A: OpenAI has made it clear that GPT-5.1 is billed the same as GPT-5, using the original unit price and rate limit, which is an iteration of "capability upgrades but prices remain unchanged".

Q: What are gpt-5.1-codex and gpt-5.1-codex-mini mainly used for?

A: These two models are optimized for long-running coding tasks and are more suitable for scenarios such as code proxies, automatic refactoring, and large-scale project transformation, and are more focused on the stability and sustainability of engineering workflows than GPT-5.1.

Q: What is the use of extending the prompt cache to 24 hours?

A: In complex projects, developers can cache long system prompts or large codebase contexts as prompts and call them repeatedly within 24 hours without repeatedly paying for them, significantly reducing the context cost of long sessions and long tasks while reducing request latency.

Q: Does 24-hour caching only work for GPT-5.1?

A: The extended prompt cache duration is currently mainly for GPT-5.1 and its related family models, including gpt-5.1-codex and gpt-5.1-codex-mini, and the specific scope of application is subject to the official documentation.

GPT5.1API price is in line with GPT5 The developers seamlessly switched GPT5 to GPT5.1 GPT5.1 inference and instruction following capabilities have been upgraded GPT5.1 is open to all paid tiers GPT5.1codex long-time encoding scenario optimization description GPT5.1CodexMini is suitable for automating the development process A long-running code agent task model Large-scale project code restructuring and transformation are more suitable for models GPT5.1 capability upgrade but API price remains unchanged strategy Developers do not need to adjust the cost structure and limits Prompt cache duration extended to 24 hours Long context caching reduces synthetic call costs Reuse the same long prompt to reduce cold start delays GPT5.1 is suitable for complex multi-round long session scenarios GPT5.1codex focuses on engineering workflow stability Specially optimized for long-term coding agent development The development team can directly replace the original GPT5 interface Improve model performance without increasing unit price Maintain the original rate limit and billing tier settings The new version of GPT5.1 API is suitable for building intelligent agents The cost of long task code generation and reconstruction is significantly reduced Support continuous automation development pipeline integration GPT5.1codex is suitable for long-term code review processes Developers can improve product intelligence within the same budget Prompt caching mechanism reduces context duplication The delay in the first round of response for long-session agent tasks is reduced GPT5.1CodexMini is suitable for lightweight proxy bots Enterprises can use GPT5.1 to build large-scale coding assistants GPT5.1 API upgrade is friendly to the transformation of existing projects Prompt caching strategy designed for long-chain calls Developers can centrally cache system prompts and codebases GPT5.1 is conducive to enhancing the stability of engineering-grade reasoning Enhanced long-term session support for agent-based development API call limits and cost controls are easier to manage GPT5.1 brings free performance upgrades to existing applications Support for multiple reuse of long prompts in the same session gpt5.1codex is suitable for automatic code repair in CI pipelines Running code agents for long periods of time reduces the need for manual intervention The GPT5.1 API upgrade is of great significance to SaaS product iteration GPT5.1 and codex descriptions have been added to the developer documentation Use GPT5.1 to build complex automated O&M scripts Prompt caching is 24 hours a day, suitable for daily development GPT5.1codex improves the depth of code understanding in large repositories The new API is more conducive to implementing end-to-end coding agents Long session caching reduces the overhead of multi-module collaboration projects Developers can smoothly migrate legacy model configurations and quotas The GPT5.1 family model enjoys the cache optimization policy in a unified manner Automatic refactoring and code auditing are better left to codex Extending the prompt cache can help reduce peak hashrate GPT5.1 API upgrade reflects a balance between cost performance and performance

Recommended Tools

More