Rather than large platform acquisitions, Google is focusing on narrowly scoped teams and technologies that fill gaps in its Gemini model ecosystem, while using minority investments and licensing arrangements to expand reach and reduce integration risk.
Spatial and Voice Intelligence
Google acquired Common Sense Machines, a startup specializing in AI-generated 3D imagery, according to The Information. The company had been developing models that convert 2D images into structured 3D assets, brings technology that can be used to improve spatial reasoning and visual consistency in generative systems.
Generating stable 3D representations from limited visual input is a step toward world models that understand physical space rather than just pixels. That capability has implications for robotics simulation, augmented reality, video generation and training data efficiency. Integrating this technology into Gemini could reduce hallucination in visual outputs and improve coherence across frames and viewpoints.
Google has also moved to strengthen its voice and emotional intelligence capabilities by hiring the core team behind Hume AI, a startup known for models that analyze tone, prosody and emotional cues in speech. Under the deal, reported by TechCrunch, Hume licensed its technology to Google while its founders and several engineers joined DeepMind. Hume continues to operate independently.
The arrangement mirrors a growing pattern in AI dealmaking, where large platforms prioritize talent and research expertise over full acquisitions.
Advertisement: Scroll to Continue
Investing in Sakana AI Signals Regional Model Strategy
Beyond acquisitions, Google has taken an equity stake in Sakana AI, a Tokyo-based startup developing models inspired by collective intelligence and evolutionary systems, according to Bloomberg. The investment is intended to strengthen Gemini’s presence in Japan, where demand for domestically aligned AI models is growing among enterprises and government agencies.
Sakana’s work focuses on training methods that combine multiple smaller models into adaptive systems, rather than relying on a single monolithic architecture. Smaller models can be tuned to local language, cultural norms and data constraints, while still benefiting from shared learning.
Google’s investment reflects a recognition that global AI adoption will not be driven solely by U.S.-trained models deployed everywhere. By partnering with regional labs, Google can localize Gemini more quickly, reduce reliance on centralized compute, and compete more effectively with domestic AI providers in markets such as Japan.
Creative Output
Google’s dealmaking is occurring alongside demonstrations of its AI capabilities. Google DeepMind debuted an AI-assisted animated short film at the Sundance Film Festival, using its internal video and image generation models.
While the film itself was a creative project, it served a strategic purpose. It showcased advances in temporal consistency, visual control and human-in-the-loop workflows, areas critical for enterprise video tools, advertising and content production. For executives, the message was that Google’s models are moving beyond experimental demos toward systems that can support professional-grade outputs.
Google’s recent acquisitions, talent deals and investments show a company refining its AI strategy. Rather than chasing scale for its own sake, Google is selectively adding capabilities that improve multimodal reasoning, voice interaction and regional relevance. The emphasis on small teams, licensing structures and minority stakes suggests a focus on speed and flexibility.