OpenAI announced today that its new flagship model, GPT-4o mini, has been launched on Azure AI, supporting text processing capabilities, with plans to introduce image, audio, and video functionalities in the future.

image.png

GPT-4o mini is significantly smarter than GPT-3.5Turbo—scoring 82% on the Massive Multitask Language Understanding (MMLU) benchmark compared to GPT-3.5Turbo's 70%—and is over 60% cheaper. This model offers an expanded 128K context window and integrates the enhanced multilingual capabilities of GPT-4o. The Azure OpenAI Studio Playground allows free trials of GPT-4o mini.

Microsoft Azure AI brings default security to GPT-4o mini, along with extended data residency and service availability upgrades. Customers can expect enhanced performance and features on Azure AI, particularly for streaming scenarios such as assistants, code interpreters, and retrieval.

Azure AI has announced global pay-as-you-go options with the highest throughput limits for GPT-4o mini. Now, customers can pay flexibly based on consumed resources, with traffic routed globally for higher throughput, and have control over the storage location of their data. The global pay-as-you-go deployment option offers a throughput of 15 million tokens per minute (TPM) and 99.99% availability for GPT-4o mini, at the same industry rates as OpenAI.

GPT-4o mini will be available on Azure AI this month and will be offered in Batch services. Batch delivers high-throughput jobs at a 50% discount rate within 24 hours by utilizing off-peak capacity. This is only possible with Microsoft running on Azure AI, enabling Microsoft Azure AI to provide off-peak capacity to customers.

Microsoft Azure AI will also release fine-tuning capabilities for GPT-4o mini this month, allowing customers to further customize the model according to specific use cases and scenarios. Following last month's update to token-based training billing, Microsoft Azure AI has reduced hosting costs by 43%. Combined with its low inference prices, this makes Azure OpenAI service fine-tuning deployments the most cost-effective option for customers with production workloads.

Key Highlights:

⭐ GPT-4o mini launches on Azure AI, supporting fast and comprehensive text processing functionalities

⭐ The new model is smarter than its predecessor, over 60% cheaper, and offers a broader context window and multilingual capabilities

⭐ Azure AI provides a global pay-as-you-go deployment option for GPT-4o mini, offering high throughput and 99.99% availability