Story Details

  • Create and edit images with Gemini 2.0 in preview

    Posted: 2025-05-07 16:06:44

    Google's Gemini 2.0 now offers advanced image generation and editing capabilities in a limited preview. Users can create realistic images from text prompts, modify existing images with text instructions, and even expand images beyond their original boundaries using inpainting and outpainting techniques. This functionality leverages Gemini's multimodal understanding to accurately interpret and execute complex requests, producing high-quality visuals with improved realism and coherence. Interested users can join a waitlist to access the preview and explore these new creative tools.

    Summary of Comments ( 97 )
    https://news.ycombinator.com/item?id=43917461

    Hacker News commenters generally expressed excitement about Gemini 2.0's image generation and editing capabilities, with several noting its impressive speed and quality compared to other models. Some highlighted the potential for innovative applications, particularly in design and creative fields. A few commenters questioned the pricing and access details, while others raised concerns about the potential for misuse, such as deepfakes. Several people also drew comparisons to other generative AI models like Midjourney and Stable Diffusion, discussing their relative strengths and weaknesses. One recurring theme was the rapid pace of advancement in AI image generation, with commenters expressing both awe and apprehension about future implications.