Introducing more enterprise-grade features for API customers

To help organizations scale their AI usage without overstretching their budgets, we’ve added two new ways to reduce the cost of consistent asynchronous workloads.

Usage discount on committed throughput: Customers with ongoing levels of tokens per minute (TPM) usage on GPT-4 or GPT-4 Turbo can request access to provisioned throughput and receive a 10-50% reduction based on the size of their commitment. You can get a range of discounts.
Cost savings for asynchronous workloads: Customers can use the new Batch API to run non-emergency workloads asynchronously. Batch API requests are priced at 50% off the common price, offer much higher rate limits, and return results within 24 hours. This is ideal for use cases such as model evaluation, offline classification, summarization, and synthetic data generation.

We plan to continue adding new features with a focus on enterprise-grade security, administrative controls, and cost management. To learn more about these releases, please see our API documentation or contact our team to discuss a custom solution for your enterprise.

Introducing more enterprise-grade features for API customers

Byautomateinsider

By automateinsider

Related Post

Bringing the world-class journalism of the Financial Times to ChatGPT

Adopting safe design principles

Introducing OpenAI Japan

Introducing AI for customer service

You missed

Meta releases llama4, a new crop of flagship AI models

Recovering the imperfect face – Atlantic

How Alibaba is transforming into a catalyst for China’s AI boom – South China’s Morning Post

Meta is approaching the release of its new AI model, the Llama 4 this month.

Automate insider