Deepinfra Secures $107M Series B Led by 500 Global
Deepinfra Inc., a dedicated inference cloud provider, has secured $107 million in a Series B funding round. The financing was led by 500 Global, with participation from prominent investors including Georges Harik, Nvidia Corp., Samsung Next, Supermicro Computer Inc., A.Capital Ventures, Crescent Cove, Felicis, Peak6, and Upper90. This capital injection is intended to expand Deepinfra’s global capacities and enhance its specialised infrastructure for AI workloads.
Deepinfra aims to address inefficiencies in AI model deployment caused by general-purpose cloud platforms. Co-founded by Nikola Borisov and former engineers of the imo messaging platform, the startup is focused on providing a cloud service optimised for the demands of AI inference—where AI models are operationalised in real-time. Unlike traditional platforms influenced by the high latency of "spot" capacity rentals, Deepinfra operates its own hardware across eight U.S. data centers, using Nvidia's advanced GPUs to achieve cost efficiency that it claims is up to 20 times greater than competitors.
The move is strategically aligned with the ongoing shift from experimental AI chatbots to more complex, autonomous agent systems. These systems require robust and reliable infrastructure due to their resource-intensive nature. Deepinfra aims to deliver a "token factory" approach, treating AI inference as a primary service, to meet this growing demand. The company currently supports over 190 open-source AI models and adheres to a zero-data retention policy to alleviate the concerns of enterprises wary of cloud data storage.
The funding arrives as demand for AI inference capabilities surges. According to Tony Wang of 500 Global, the industry increasingly requires faster and more adaptable infrastructure solutions, making Deepinfra's offerings critical to the next AI growth phase. The company's focus on agentic AI reflects an industry-wide trend towards more autonomous systems, which, according to Borisov, are now the dominant AI workload driver.
Looking ahead, the funding will enhance Deepinfra’s ability to scale its infrastructure and support an anticipated expansion in AI workloads. As the cloud infrastructure market evolves to accommodate these new computing demands, Deepinfra's development trajectory will test whether specialized infrastructure can outperform traditional cloud providers in this burgeoning segment. Regulatory considerations and further technical evolution will be key areas to monitor as the company progresses.
This transaction is classified in Cloud Infrastructure with a reported deal value of $107M. Figures and status may change as sources update.