OpenAI upgraded its ChatGPT chatbot with groundbreaking technology designed to generate images from highly detailed and complex instructions.
Table of Contents
From Text to Elaborate Images

Image Credits: @btibor91/X (Tibor Blaho), OpenAI
For example, if you insert the following image and prompt the ChatGPT “turn this into a triple A video games made with a 4k game engine and add some User interface as overlay from a mystery RPG where we can see a health bar and a minimap at the top as well as spells at the bottom with consistent and iconography”, The ChatGPT generates the following.
Before

Image Credits: OpenAI
After

Image Credits: OpenAI
Previous versions of ChatGPT could generate images, but they lacked the ability to combine diverse and intricate concepts. This latest update marks a significant shift in artificial intelligence capabilities.
Once focused purely on text generation, chatbots are evolving into multi-functional tools that integrate conversational abilities with other advanced functions.
Introducing GPT-4o: Multi-Modal Capabilities
4o image generation has arrived.
— OpenAI (@OpenAI) March 25, 2025
It's beginning to roll out today in ChatGPT and Sora to all Plus, Pro, Team, and Free users. pic.twitter.com/pFXDzKhh2t
The new version of ChatGPT, powered by GPT-4o, supports not only text-based interactions but also responds to voice commands, images, and videos — and it can even speak.
The original ChatGPT, launched in late 2022, learned by analyzing massive amounts of online text. It excelled at answering questions, crafting poetry, and generating code. However, it couldn’t create images.
About a year later, OpenAI introduced DALL-E — an image-generating model separate from ChatGPT. Now, OpenAI has merged these capabilities into one comprehensive system that learns from both text and images. This enables ChatGPT to generate visuals informed by its extensive knowledge base.
Similar Story: Google’s new AI model is used to remove watermarks
A Unified System for Text and Images
“This is a completely new kind of technology under the hood,” said Gabriel Goh, an OpenAI researcher. “We don’t break up image generation and text generation. We want it all to be done together.”
Historically, A.I. image generators struggled to produce visuals that diverged from existing images. For instance, creating an image of a bicycle with triangular wheels posed a challenge.
According to Goh, the latest ChatGPT can now handle such unusual requests seamlessly.
Available to Free and Paid Users
Starting Tuesday, this enhanced version of ChatGPT is accessible to both free and paid users. This includes ChatGPT Plus, priced at $20 per month, and ChatGPT Pro, a premium $200-a-month service offering access to OpenAI’s newest tools.
If you have enjoyed this article, consider sharing it with your friends and family, and subscribe to my newsletter and push notifications for FREE to stay updated with the latest tech news and gadgets. Thank you for reading this further.
FAQ – Frequently Asked Questions
What is the latest ChatGPT update about?
OpenAI’s latest update enables ChatGPT to generate images from complex descriptions using GPT-4o.
Can ChatGPT now handle voice and video?
Yes, the new version supports voice commands, images, and videos, alongside text.
Is the new ChatGPT available for free users?
Yes, both free and paid users can access the updated version starting Tuesday.
How is this update different from previous versions?
The new version integrates text and image generation into one system, enhancing performance.
What is GPT-4o?
GPT-4o is the technology powering the latest ChatGPT, enabling multi-modal capabilities.