The latest developments include BaseTen raising $300M at a $5B valuation for enterprise model serving, Inferact securing $150M to commercialize vLLM, RadixArk spinning out of SGLang with a $400M valuation focused on KV-cache optimization, and Quadric receiving $30M for on-device inference. Funding rounds also saw Sequoia investing in Anthropic, Bolna raising $6.3M for voice AI orchestration, LiveKit obtaining $100M to expand real-time audio/video capabilities, and World Labs negotiating a new funding round at a $5B valuation. Policy updates include Google launching Gemini-powered SAT practice, China preparing for NVIDIA H200 orders, and Google investing in Tokyo-based Sakana AI for Japanese language model development. Additionally, notable tech releases involve Z.ai open sourcing GLM-4.7-Flash and Liquidi releasing LFM2.5 Thinking for on-device reasoning. Research
Read the full article at TheSequence
Want to create content about this topic? Use Nemati AI tools to generate articles, social posts, and more.





