• Google Whisk makes use of photos as inputs as a substitute of text-based prompts
  • It is constructed on Google’s Imagen 3 generative AI mannequin
  • The experimental device is free to attempt for customers within the US

Google’s new AI device makes it simpler to create and remix your visible ideas. As an alternative of asking you to explain what’s in your thoughts’s eye, Whisk allows you to enter three picture prompts: one for topic, one for scene and one for model. Whisk takes care of the remainder, making it a extra intuitive method to experiment with completely different concepts.

Whereas a lot of the best AI image generators require you to jot down an in depth immediate, Whisk handles that behind the scenes. Once you drop photos into the web-based Whisk interface as inspiration, Google’s Gemini mannequin routinely analyzes them and writes an in depth caption for every. These are then fed into the Imagen 3 mannequin, to create an identical picture.


Source link