
Image: Getty Images/ For illustrative purposes
Core42 has made OpenAI’s latest open-weight AI models, including gpt-oss-20B and gpt-oss-120B, available on its AI Cloud platform, with instant access through the Core42 Compass API.
The deployment allows enterprises, researchers and developers to run the models on a choice of silicon platforms with sovereign, scalable and high-performance capabilities.
Integrated into the Compass API, Core42 said it delivers inference speeds of up to 3,000 tokens per second per user, enabling real-time AI at global scale while matching workloads with optimal infrastructure for price-performance and scalability.
The deployment is aimed at low-latency inference workloads and applications, underscoring the company’s focus on secure and optimised sovereign-enabled AI infrastructure.
“Core42 AI Cloud, powered by silicon-diverse infrastructure, delivers the flexibility and performance needed for today’s AI workloads,” said Kiril Evtimov, CEO of Core42 and group CTO of G42. “Through the Compass API, organisations can access the latest open-weight AI models and choose the optimal platform to scale transformation, optimise performance and cost, and drive progress across global markets.”
Key benefits of the open-weight deployment on Core42’s AI cloud
-
Enterprise-scale performance for automation, decision-making and real-time AI at global scale.
-
Sovereign-ready scalability for secure, in-country operations in regulated sectors such as healthcare, finance and national security.
-
Optimised performance for committed infrastructure agreements, ensuring predictable cost and capacity.
-
Cost-efficient agentic AI capabilities for in-country, sovereign-controlled deployments in cost-sensitive use cases.
Available now through the Compass API, the models can be run and adapted locally or in the cloud with options for transparency, fine-tuning and sovereign deployment.
The launch marks a step toward enterprise AI autonomy, giving businesses more control to adapt AI to specific needs and scale innovation.
The announcement follows G42 milestones including plans for a 5GW US-UAE AI campus, the launch of the 1GW Stargate UAE facility as Phase 1 of the project, and Microsoft’s $1.5bn investment in 2024, moves that reinforce the UAE’s position as a growing AI hub.