AI hyperscaler Nscale launches Serverless Inference Platform
)
The platform’s token-based, pay-as-you-go pricing ensures users only pay for what they consume, eliminating idle capacity costs and reducing financial barriers to experimenting and deploying generative AI models.
Users can immediately access popular generative AI models such as Meta's Llama, Alibaba's Qwen, and DeepSeek through OpenAI-compatible APIs or via Nscale’s intuitive web console. The broader Nscale platform provides comprehensive functionality, including Slurm and Kubernetes orchestration, observability, and multi-tenant security. These features deliver the reliability, performance, and compliance required for enterprise AI workloads.
Register to get instant access to a range of AI models in the Nscale ecosystem today: https://console.nscale.com/auth/signup?_gl=1*902a80*_gcl_au*Mjk4OTkxMTc5LjE3NDE2MDg2OTU.*FPAU*Mjk4OTkxMTc5LjE3NDE2MDg2OTU.