Story Details

  • How Google built its Gemini robotics models

    Posted: 2025-04-02 14:47:38

    Google's Gemini robotics models are built by combining Gemini's large language models with visual and robotic data. This approach allows the robots to understand and respond to complex, natural language instructions. The training process uses diverse datasets, including simulation, videos, and real-world robot interactions, enabling the models to learn a wide range of skills and adapt to new environments. Through imitation and reinforcement learning, the robots can generalize their learning to perform unseen tasks, exhibit complex behaviors, and even demonstrate emergent reasoning abilities, paving the way for more capable and adaptable robots in the future.

    Summary of Comments ( 68 )
    https://news.ycombinator.com/item?id=43557310

    Hacker News commenters generally express skepticism about Google's claims regarding Gemini's robotic capabilities. Several point out the lack of quantifiable metrics and the heavy reliance on carefully curated demos, suggesting a gap between the marketing and the actual achievable performance. Some question the novelty, arguing that the underlying techniques are not groundbreaking and have been explored elsewhere. Others discuss the challenges of real-world deployment, citing issues like robustness, safety, and the difficulty of generalizing to diverse environments. A few commenters express cautious optimism, acknowledging the potential of the technology but emphasizing the need for more concrete evidence before drawing firm conclusions. Some also raise concerns about the ethical implications of advanced robotics and the potential for job displacement.