OpenAI has announced that its improved artificial intelligence (AI) model GPT-4 Turbo with Vision capabilities is now available in ChatGPT. This means that users will now be able to ask the AI chatbot to process and analyse multimedia inputs as the AI model can analyse images and provide textual responses to questions about them.
“Majorly improved GPT-4 Turbo model available now in the API and rolling out in ChatGPT,” the company said in a post on X (formerly Twitter).
It is to be noted that language model systems have been limited by taking in a single input modality, text. In layman’s words, users can largely communicate to AI chatbots by writing text commands but GPT-4 with Vision will allow users to input photos and videos to get a response about them.
For example, if you find a photo of a cute puppy but don’t know about its breed, you can upload the photo in ChatGPT and ask the AI chatbot to tell you about the breed and other answers to questions like maintenance, exercise needs, among others.
Updates in DALL-EThe development comes a few days after the ChatGPT-maker announced that its text-to-speech model DALL-E can now allow users to edit an image in more than one way. The company launched a ‘new’ editor interface that will enable users to minutely edit images by selecting an area of the image to edit and describing the changes in chat.
“You can also provide a prompt with your desired edit in the conversation panel, without using the selection tool,” the company said in a post. The DALL-E editor editor interface provides multiple options to highlight parts of your generated image that users want to update.
“Majorly improved GPT-4 Turbo model available now in the API and rolling out in ChatGPT,” the company said in a post on X (formerly Twitter).
It is to be noted that language model systems have been limited by taking in a single input modality, text. In layman’s words, users can largely communicate to AI chatbots by writing text commands but GPT-4 with Vision will allow users to input photos and videos to get a response about them.
For example, if you find a photo of a cute puppy but don’t know about its breed, you can upload the photo in ChatGPT and ask the AI chatbot to tell you about the breed and other answers to questions like maintenance, exercise needs, among others.
Updates in DALL-EThe development comes a few days after the ChatGPT-maker announced that its text-to-speech model DALL-E can now allow users to edit an image in more than one way. The company launched a ‘new’ editor interface that will enable users to minutely edit images by selecting an area of the image to edit and describing the changes in chat.
“You can also provide a prompt with your desired edit in the conversation panel, without using the selection tool,” the company said in a post. The DALL-E editor editor interface provides multiple options to highlight parts of your generated image that users want to update.
end of article
Source: ChatGPT is getting this ‘major’ improvement: Here’s how it will help users – Times of