ChatGPT Upgrades Image Generation with GPT-4o for More Accurate AI Creations

The chatbot is integrated with image-generation capabilities in a major ChatGPT update of the year. In a live stream on 25 March, OpenAI CEO Sam Altman announced that

the company’s GPT-4o model can now create and modify images. Until now, the AI model could only generate and edit text. 

This feature is currently available in ChatGPT and Sora for subscribers to the company’s $200-a-month Pro Plan. The company also announced that it will soon be available to Plus and free users, as well as developers using the company’s API service.

OpenAI announced the GPT-4o model in May 2024. The model has the capability to handle text, speech, and video as the ‘o’ in the name stands for ‘omni’. The company claims it spent a year using more than 100 human workers to train this model to generate realistic images. The company reports to the Wall Street Journal 

“Today’s refined GPT-4o model makes it easier for consumers, and businesses, to create more life-like images and paragraphs of comprehensible text—and even company logos and slide decks,”

Replacing DALL-E 3

According to the company, the output image generated through this model ‘thinks’ a bit longer than the image-generation model. However, it replaces DALL-E 3 to make a more accurate and detailed image. This advanced feature will enable the chatbot to edit pictures with people in them. It can edit details like foreground and background. 

Reinforced Learning From Human Feedback

This new feature is based on ‘reinforced learning from human feedback’ (RLHF), a technique widely used by AI companies to train their models. The chatbot has over 400 million weekly users, and these human trainers could significantly impact it. 

In a review, GoDaddy’s Chief Data and Analytics Officer, Travis Muhlestein said this chatbot is

“helping us embrace AI-driven content creation.”

The company uses this platform to create stock images and logos. 

Artist’s Rights

While responding to the artist’s concern over copyrights, the Chief Operating Officer of OpenAI reported that.

“We’re respecting of the artists’ rights in terms of how we do the output, and we have policies in place that prevent us from generating images that directly mimic any living artists’ work”. 

Gemini 2.0 

This new feature of ChatGPT 4o is preceded by Google’s launch of Gemini 2.0 that comprises image-generation feature. This feature enabled users to remove watermarks and depict copyright characters.

Disclosure: Some of the links in this article are affiliate links and we may earn a small commission if you make a purchase, which helps us to keep delivering quality content to you. Here is our disclosure policy.

Shahid Anwar
Shahid Anwar
Shahid Anwar is a senior technology journalist at TECHi, specializing in artificial intelligence, emerging technologies, and the digital industry. With years of experience covering breakthroughs in AI, big tech innovations, and future-driven advancements, he delivers in-depth analysis, exclusive reports, and insightful coverage of the ever-evolving tech landscape.

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Popular This Week
Similar Stories
Elon Musk looking serious with financial stock charts in the background and the text "Tesla Profits Drop 71%".
Ignition

Tesla Revealed First Quarterly Earnings Report, Profit Dropped 71% 

Naba Fatima
Elon Musk is facing Wall Street’s scrutiny after Tesla’s disappointing first-quarter earnings report, which revealed a huge 71% drop in...
OpenAI logo displayed inside a browser interface, symbolizing interest in acquiring Google Chrome.
Interaction
In the ongoing US courtroom proceedings worthy of Silicon Valley lore, the AI generative startup OpenAI expressed its interest in...
Smartphone displaying T-Mobile logo on screen, with bold text “5 Year Price Guarantee” over a pink background.
Important
As tariff hikes have increased the cost of living, T-Mobile has taken a bold step to prove its loyalty to...