The Sequence Radar #795: The New Inference Kids

Ali NematiAli NematiJan 2544 sec read12 views

The latest developments include BaseTen raising $300M at a $5B valuation for enterprise model serving, Inferact securing $150M to commercialize vLLM, RadixArk spinning out of SGLang with a $400M valuation focused on KV-cache optimization, and Quadric receiving $30M for on-device inference. Funding rounds also saw Sequoia investing in Anthropic, Bolna raising $6.3M for voice AI orchestration, LiveKit obtaining $100M to expand real-time audio/video capabilities, and World Labs negotiating a new funding round at a $5B valuation. Policy updates include Google launching Gemini-powered SAT practice, China preparing for NVIDIA H200 orders, and Google investing in Tokyo-based Sakana AI for Japanese language model development. Additionally, notable tech releases involve Z.ai open sourcing GLM-4.7-Flash and Liquidi releasing LFM2.5 Thinking for on-device reasoning. Research

Read the full article at TheSequence


Want to create content about this topic? Use Nemati AI tools to generate articles, social posts, and more.

12
Comments
Ali Nemati
Ali NematiWritten by Ali
View all posts

Related Articles

The Sequence Radar #795: The New Inference Kids | OSLLM.ai