2024-05-14 06:21:58

OpenAI Introduces GPT-4o: Enhancing Multimodal Capabilities in ChatGPT

Image used under license from Shutterstock.com

OpenAI debuts GPT-4o, a groundbreaking Omnimodel integrating audio, text, and video seamlessly into ChatGPT and API services. This innovative technology is highlighted for its capability to recognize emotions in live selfies, outperforming its predecessor in non-English languages.

The new GPT-4o model offers enhanced interactivity, realistic voice conversations, and real-time processing of audio and visual inputs, setting new benchmarks for AI accessibility and functionality. OpenAI's advancements aim to provide users with a more human-like and engaging AI experience, offering various interactions such as interviews, customer service, translations, and even playful responses to pets.

heise online
13. Mai 2024 um 17:14

OpenAI presents Omnimodel: No search, no GPT-5, but GPT-4o for ChatGPT | heise online

Technology
OpenAI presents GPT-4o as an Omnimodel that integrates audio, text, and video natively. The technology will be available in ChatGPT and through the API. Additionally, emotions in live selfies are said to be recognized.
The Guardian
13. Mai 2024 um 18:59

New GPT-4o AI model is faster and free for all users, OpenAI announces

Technology
OpenAI announced the launch of its new GPT-4o AI model, offering faster and more accurate capabilities to free users. The updates also included improved language capabilities and the ability to analyze images, audio, and text documents. The company demonstrated the model's voice assistant and potential partnerships with Apple's iPhone operating system.
Webrazzi
13. Mai 2024 um 18:24

OpenAI's new model that can read people's emotions from facial expressions: GPT-4o

Technology
OpenAI introduced the new GPT-4 iteration, GPT-4o, at the spring update event. GPT-4o works faster and more efficiently in the text, audio, and video fields. It was mentioned that the model can read people's emotions from video images in addition to its translation capabilities.
The Verge
13. Mai 2024 um 18:56

ChatGPT Upgrades Voice Mode to Resemble Her's AI Assistant

Technology
Upgrades to ChatGPT's voice mode bring it closer to a responsive AI assistant like in the movie Her. OpenAI demonstrated the new capabilities, including reading facial expressions and translating spoken language in real time.
DER SPIEGEL
13. Mai 2024 um 18:35

ChatGPT kann jetzt singen - DER SPIEGEL

Tecnología
OpenAI presenta el nuevo modelo de lenguaje GPT-4o, que puede procesar texto, audio e imágenes y cambiar entre tonalidades, voces y hasta 50 idiomas en la salida de voz. Con la actualización, se espera que ChatGPT sea más natural y rápido para permitir conversaciones sin problemas. La nueva versión de ChatGPT estará disponible de forma gratuita para todos los usuarios, mientras que los usuarios de pago recibirán capacidades adicionales y salida de voz mejorada.
CW

Account

Waiting list for the personalized area


Welcome!

InfoBud.news

infobud.news is an AI-driven news aggregator that simplifies global news, offering customizable feeds in all languages for tailored insights into tech, finance, politics, and more. It provides precise, relevant news updates, overcoming conventional search tool limitations. Due to the diversity of news sources, it provides precise and relevant news updates, focusing entirely on the facts without influencing opinion. Read moreExpand

Your World, Tailored News: Navigate The News Jungle With AI-Powered Precision!