AI already knew how to create images. OpenAI says it has found the missing piece with the new ChatGPT Images 2.0
Over the last few years we have seen image generators become increasingly more spectacular, faster and also more popular. The problem is that a striking image is not always useful to work with. It is one thing to ask for an astronaut cat and quite another to obtain a usable marketing poster, a coherent vignette or a graphic that respects what we have asked for. That’s where OpenAI now wants to move the conversation with its new model: not so much towards the pretty image, but towards the useful image. The answer. What OpenAI proposes goes in that direction. The company led by Sam Altman He maintains that his new model is not only created to generate attractive images, but to solve visual assignments with more intention and less trial and error. In the presentation he went so far as to state that “images are a language, not decoration”, a fairly clear way of summarizing where he wants to take the product in a present with quite a bit of competition. The thesis is that: that asking for an image in ChatGPT It’s less like launching a creative prompt and more like commissioning a piece that we can actually use. The missing piece. If the firm wants us to talk about something more than showy images, it had to improve exactly the points where these models usually fail. Here they promise important changes on three very specific fronts: following complex instructions more precisely, better organizing elements within the image and reproducing dense text with greater reliability. In other words, we are not only looking for more beautiful results, but also less ambiguous and more controllable ones. Think before you draw. One of the novelties that OpenAI tries to highlight most strongly is that this is its first image model with reasoning capabilities. Translated into practical terms, the company maintains that, when a model with “thinking” is chosen within ChatGPT, the system can take more time, structure the task better, rely on the web to search for updated information and review its own results before delivering the image. And we have tried it, asking for the image of two people walking along Gran Vía, in Madrid, near Cines Callao, and some notes on activities to do in Spain during May. These are the images that we can see in the cover image. The keys. OpenAI talks about game prototyping, storyboards, marketing creatives, comics, social graphics and other materials where both content and form matter. To sustain that ambition, the company says it has improved on two delicate fronts: the handling of non-Latin text, with advances especially in Japanese, Korean, Chinese, Hindi and Bengali, and the more faithful reproduction of very marked visual styles. It also expands the possible formats, with proportions of up to 3:1 and 1:3, resolution of up to 2K and, in certain modes, the possibility of generating up to ten images within the same request with continuity between characters and objects. The competitive context. This announcement also cannot be read as if OpenAI had suddenly discovered a new market. Midjourney has already become a clear reference for works with a strong artistic charge, Nano Banana has attracted attention for its conversational editing capabilities and FLUX 2 has become strong in photorealism. With that board in front, the company seems to be looking for another angle. Rather than contesting each terrain separately, it tries to present ChatGPT as an environment where the image is not generated in isolation, but as part of a broader flow, something that on paper can be attractive if it really delivers what it promises. It’s already starting to unfold: One of the keys to the announcement is that OpenAI ensures that the model does not remain in the showcase phase, but is beginning to reach a product. The company places its deployment in ChatGPT for all users, including Free and Go, and associates the most advanced results with Plus and Pro, as also reported by Engadget. Additionally, it takes you to the API and Codex, a sign that they don’t want to limit it to casual use within the chat. If your strategy involves turning the image into another work tool, it made sense for the deployment to start precisely there. Images | Xataka with ChatGPT Images 2.0 | OpenAI In Xataka | Amazon wants to win the AI race at any price. That is why it has invested both in Anthropic and OpenAI