OpenAI Introduces GPT-4o: Enhancing Multimodal Capabilities in ...

2024-05-14 06:21:58

OpenAI Introduces GPT-4o: Enhancing Multimodal Capabilities in ChatGPT

Image used under license from Shutterstock.com

OpenAI debuts GPT-4o, a groundbreaking Omnimodel integrating audio, text, and video seamlessly into ChatGPT and API services. This innovative technology is highlighted for its capability to recognize emotions in live selfies, outperforming its predecessor in non-English languages.

The new GPT-4o model offers enhanced interactivity, realistic voice conversations, and real-time processing of audio and visual inputs, setting new benchmarks for AI accessibility and functionality. OpenAI's advancements aim to provide users with a more human-like and engaging AI experience, offering various interactions such as interviews, customer service, translations, and even playful responses to pets.

heise online

13. Mai 2024 um 17:14

OpenAI presents Omnimodel: No search, no GPT-5, but GPT-4o for ChatGPT | heise online

Technology

OpenAI presents GPT-4o as an Omnimodel that integrates audio, text, and video natively. The technology will be available in ChatGPT and through the API. Additionally, emotions in live selfies are said to be recognized.

The Guardian

13. Mai 2024 um 18:59

New GPT-4o AI model is faster and free for all users, OpenAI announces

Technology

OpenAI announced the launch of its new GPT-4o AI model, offering faster and more accurate capabilities to free users. The updates also included improved language capabilities and the ability to analyze images, audio, and text documents. The company demonstrated the model's voice assistant and potential partnerships with Apple's iPhone operating system.

Webrazzi

13. Mai 2024 um 18:24

OpenAI's new model that can read people's emotions from facial expressions: GPT-4o

Technology

OpenAI introduced the new GPT-4 iteration, GPT-4o, at the spring update event. GPT-4o works faster and more efficiently in the text, audio, and video fields. It was mentioned that the model can read people's emotions from video images in addition to its translation capabilities.

The Verge

13. Mai 2024 um 18:56

ChatGPT Upgrades Voice Mode to Resemble Her's AI Assistant

Technology

Upgrades to ChatGPT's voice mode bring it closer to a responsive AI assistant like in the movie Her. OpenAI demonstrated the new capabilities, including reading facial expressions and translating spoken language in real time.

DER SPIEGEL

13. Mai 2024 um 18:35

ChatGPT kann jetzt singen - DER SPIEGEL

Tecnología

OpenAI presenta el nuevo modelo de lenguaje GPT-4o, que puede procesar texto, audio e imágenes y cambiar entre tonalidades, voces y hasta 50 idiomas en la salida de voz. Con la actualización, se espera que ChatGPT sea más natural y rápido para permitir conversaciones sin problemas. La nueva versión de ChatGPT estará disponible de forma gratuita para todos los usuarios, mientras que los usuarios de pago recibirán capacidades adicionales y salida de voz mejorada.

Account

Waiting list for the personalized area

Welcome!

infobud.news is an AI-driven news aggregator that simplifies global news, offering customizable feeds in all languages for tailored insights into tech, finance, politics, and more. It provides precise, relevant news updates, overcoming conventional search tool limitations. Due to the diversity of news sources, it provides precise and relevant news updates, focusing entirely on the facts without influencing opinion. Read moreExpand

OpenAI presents Omnimodel: No search, no GPT-5, but GPT-4o for ChatGPT | heise online

New GPT-4o AI model is faster and free for all users, OpenAI announces

OpenAI's new model that can read people's emotions from facial expressions: GPT-4o

ChatGPT Upgrades Voice Mode to Resemble Her's AI Assistant

ChatGPT kann jetzt singen - DER SPIEGEL

OpenAI Introduces GPT-4o: Enhancing Multimodal Capabilities in ChatGPT

Account

Welcome!

Top Newsworthy Stocks

Front Page Figures

Global Hotspots

News

About

Legal

Contact

OpenAI Introduces GPT-4o: Enhancing Multimodal Capabilities in ChatGPT

Related news on that topic:

OpenAI Introduces GPT-4o Omnimodel for ChatGPT with Multimodal ... Capabilities

OpenAI's GPT-4o Revolutionizes Conversational AI with Real-time ... Interpretation

The press radar on this topic:

OpenAI presents Omnimodel: No search, no GPT-5, but GPT-4o for ChatGPT | heise online

New GPT-4o AI model is faster and free for all users, OpenAI announces

OpenAI's new model that can read people's emotions from facial expressions: GPT-4o

ChatGPT Upgrades Voice Mode to Resemble Her's AI Assistant

ChatGPT kann jetzt singen - DER SPIEGEL

Account

Welcome!

Top Newsworthy Stocks

Front Page Figures

Global Hotspots

News

About

Legal

Contact