OpenAI Announces DALL-E 3 After Weeks of Speculation (And It Looks Incredible)
DALL-E 3 is coming soon and looks to be waaaay ahead of it's predecessor. Images are of higher quality, remember context, and can process text a lot better. It's shaping up to be a viable competitor to Midjourney and will offer a direct integration with ChatGPT.
Justin Gluska
Updated September 20, 2023
Reading Time: 3 minutes
In an exciting announcement today, OpenAI finally revealed the latest iteration of its groundbreaking AI image generation model, DALL-E 3. This advanced system represents a significant leap forward in the realm of text-to-image synthesis, promising to revolutionize the way users translate their ideas into highly accurate visual images.
DALL-E 3, currently in research preview, is set to become available to ChatGPT Plus and Enterprise customers in October through an API release, with plans for a broader release in Labs later this fall. An immediate feature that stands out is the ability for DALL-E to synthesize and process text, as seen in the first image OpenAI showcased:
One of the key challenges with modern text-to-image systems has been their tendency to overlook nuances and details in user prompts, often requiring users to write super complex prompts.
DALL-E 3 aims to address this issue by enhancing its understanding of textual descriptions, ensuring that the generated images closely align with the provided text.
DALL-E 3 is built natively on ChatGPT, allowing users to seamlessly integrate it as a brainstorming partner and prompt refiner. With the new system, future users can simply express their ideas, ranging from a simple sentence to a detailed paragraph, and DALL-E 3 will automatically generate tailored and detailed images to bring those ideas to life.
Users can also make quick tweaks to generated images with just a few words, enhancing creative control. OpenAI CEO Sam Altman tweeted a video that gives hints into DALL-E 3 being able to maintain style and character accuracy through multiple images:
Compared to its DALL-E 2, DALL-E 3 demonstrates remarkable improvements in image generation. Even when given the same prompt, it consistently produces images that are more faithful to the user's intent, offering greater precision and detail. Something many people disliked about DALL-E 2 and even converted them into Midjourney users.
DALL-E 3 will also include safeguards to limit the generation of violent, adult, or hateful content. Additionally, measures have been put in place to decline requests for public figures by name, as part of ongoing efforts to mitigate harmful biases and ensure responsible AI use.
OpenAI is also actively exploring ways to help users identify AI-generated images, including the development of a provenance classifier. This tool will assist in determining whether an image was created by DALL-E 3, which is aimed at improving transparency in AI-generated content. This comes a few weeks after they disabled their ChatGPT writing detector due to its inaccuracy.
Creators will also have the option to opt their images out from being used in future image model training, offering people greater control over their creations.
As DALL-E 3 gets ready for an official release in October, anticipation is building among ChatGPT Plus and Enterprise customers, looking to easily use it within their existing ChatGPT workflow.
Want to Learn Even More?
If you enjoyed this article, subscribe to our free newsletter where we share tips & tricks on how to use tech & AI to grow and optimize your business, career, and life.