Re: Synthetic Media & Deepfakes News and Discussions
Posted: Sun May 29, 2022 9:36 am
>
Google claims its text-to-image AI delivers 'unprecedented photorealism'
https://www.engadget.com/google-imagen- ... 05123.html
Google claims its text-to-image AI delivers 'unprecedented photorealism'
https://www.engadget.com/google-imagen- ... 05123.html
The examples are curated and it's not publicly available. Still needs improvement in social biases and limitations of large language models.Google has shown off an artificial intelligence system that can create images based on text input. The idea is that users can enter any descriptive text and the AI will turn that into an image. The company says the Imagen diffusion model, created by the Brain Team at Google Research, offers "an unprecedented degree of photorealism and a deep level of language understanding."
This isn't the first time we've seen AI models like this. OpenAI's DALL-E (and its successor) generated headlines as well as images because of how adeptly it can turn text into visuals. Google's version, however, tries to create more realistic images.
To assess Imagen against other text-to-image models (including DALL-E 2, VQ-GAN+CLIP and Latent Diffusion Models), the researchers created a benchmark called DrawBench. That's a list of 200 text prompts that were entered into each model. Human raters were asked to assess each image. They "prefer Imagen over other models in side-by-side comparisons, both in terms of sample quality and image-text alignment," Google said.