Listen to this story
|
Stability AI has announced Stable Diffusion 3 in early preview, its most capable text-to-image model with greatly improved performance in multi-subject prompts, image quality, and spelling abilities.
While the model is currently in an early preview phase and not yet widely available, the waitlist has been opened for those interested in exploring its capabilities. This preview phase is crucial for gathering insights to enhance the model’s performance and safety before its open release. Interested individuals can sign up for the waitlist to get early access.
The Stable Diffusion 3 suite of models currently range from 800M to 8B parameters.Combining a diffusion transformer architecture and flow matching, Stable Diffusion 3 is poised to provide a variety of options for users seeking both scalability and quality. A detailed technical report on the model is expected to be published soon.
Google Pauses Gemini Image Generation
Meanwhile, Google announced it is pausing its Gemini artificial intelligence image generation feature after saying it offers “inaccuracies” in historical pictures.
Gemini-generated pictures went viral on social media recently, leading to widespread ridicule and anger. Some users criticized Google, claiming that the company is overly concerned with being socially aware, even if it means sacrificing truth and accuracy.
Users on social media had been complaining that the AI tool generates images of historical figures — like the U.S. Founding Fathers — as people of color, calling this inaccurate.
Google, in a post on platform X, stated that its AI feature has the capability to “generate a wide range of people,” which is generally beneficial for users worldwide. However, the company acknowledged a current deficiency in the software feature, noting that it is “missing the mark here.”
Google affirmed its commitment to prompt improvement, stating that the tech giant is “working to enhance these kinds of depictions immediately.”