2024-05-14 06:21:58

OpenAI Introduces GPT-4o: Enhancing Multimodal Capabilities in ChatGPT

Image used under license from Shutterstock.com

OpenAI debuts GPT-4o, a groundbreaking Omnimodel integrating audio, text, and video seamlessly into ChatGPT and API services. This innovative technology is highlighted for its capability to recognize emotions in live selfies, outperforming its predecessor in non-English languages.

The new GPT-4o model offers enhanced interactivity, realistic voice conversations, and real-time processing of audio and visual inputs, setting new benchmarks for AI accessibility and functionality. OpenAI's advancements aim to provide users with a more human-like and engaging AI experience, offering various interactions such as interviews, customer service, translations, and even playful responses to pets.

heise online
13. Mai 2024 um 17:14

OpenAI presents Omnimodel: No search, no GPT-5, but GPT-4o for ChatGPT | heise online

Technology
OpenAI presents GPT-4o as an Omnimodel that integrates audio, text, and video natively. The technology will be available in ChatGPT and through the API. Additionally, emotions in live selfies are said to be recognized.
THE DECODER
13. Mai 2024 um 18:35

OpenAI's new multimodal "GPT-4 omni" combines text, vision, and audio in a single model

Technology
OpenAI has announced the release of GPT-4o, a large multimodal model that combines text, vision, and audio processing in a single neural network. GPT-4o exhibits impressive audio capabilities, can analyze video in real time, and outperforms its predecessor in non-English languages. OpenAI emphasizes the model's efficiency and affordability compared to previous versions, making it available for free in ChatGPT.
marktechpost.com
14. Mai 2024 um 02:11

OpenAI Released GPT-4o for Enhanced Interactivity and Many Free Tools for ChatGPT Free Users - MarkTechPost

Technology
Economy
OpenAI released GPT-4o, a comprehensive AI model that integrates text, audio, and visual data processing capabilities into a unified framework, significantly reducing response latency and improving user interaction. The model demonstrates enhanced performance in multilingual contexts, audio inputs, and interactive exchanges, setting new benchmarks in AI technology accessibility and functionality.
Cointelegraph.com News
14. Mai 2024 um 00:32

OpenAI’s latest upgrade essentially lets users livestream with ChatGPT

Technology
OpenAI has introduced a major upgrade called GPT Omni, which enables the chatbot to interpret video and audio in real-time and respond more convincingly like a human. The latest AI model, GPT-4o, allows users to interact with the AI in various ways, including interview preparation, customer service interactions, jokes, translations, and even playful responses to pets.
CW

Account

Waiting list for the personalized area


Welcome!

InfoBud.news

infobud.news is an AI-driven news aggregator that simplifies global news, offering customizable feeds in all languages for tailored insights into tech, finance, politics, and more. It provides precise, relevant news updates, overcoming conventional search tool limitations. Due to the diversity of news sources, it provides precise and relevant news updates, focusing entirely on the facts without influencing opinion. Read moreExpand

Your World, Tailored News: Navigate The News Jungle With AI-Powered Precision!