2024-05-15 15:03:12

GPT-4o: Advancing AI's Understanding of Sound, Images, and Text

Image used under license from Shutterstock.com

OpenAI's unveiling of GPT-4o marks a significant leap in AI technology, showcasing an Omnimodel that comprehends audio, text, and images seamlessly. In contrast, Google faces challenges in conveying the essence of its complex AI products to the public.

Despite Elon Musk's criticism of GPT-4o, experts praise its human-like capabilities in processing various data types. Additionally, Google introduces 'Astra,' a multimodal AI project, aiming to create a universal AI assistant.

OpenAI's ChatGPT app for MacOS further enhances accessibility to advanced AI models, facilitating tasks like analyzing videos and interpreting facial expressions.

heise online
14. Mai 2024 um 15:06

GPT-4o: AI is supposed to understand sound, images, and text equally – without translation | heise online

Technology
OpenAI has announced GPT-4o, an Omnimodel for Artificial Intelligence. The model is supposed to understand audio, text, and images and has improved text and image competence. It can decipher handwriting, interpret images, and store the context of conversations.
heise online
15. Mai 2024 um 10:36

Google showcases cool products, but hardly anyone understands it | heise online

Technology
Economy
Google has a variety of AI products in its portfolio, which are difficult for most people to understand. Although the products are good, Google fails to present their innovations and ideas understandably at the Google I/O conference. In contrast, OpenAI has demonstrated an Omnimodel with GPT-4o, which can process text, audio, and vision natively and appeals to many people through concrete examples. Google struggles with naming its products, but is keen on supplementing the gigantic knowledge b..
marktechpost.com
15. Mai 2024 um 04:15

Excited about GPT-4o? Now Check out Google AI's New Project 'Astra': The Multimodal Answer to the New ChatGPT - MarkTechPost

Technology
Google's new Project Astra is a multimodal AI agent that aims to be a universal AI assistant, capable of seeing, talking, and understanding the world like humans. The project was introduced during Google's recent event, Google I/O '24, showcasing significant progress in AI technology.
Số hóa - VnExpress
15. Mai 2024 um 03:25

Elon Musk Criticizes OpenAI's GPT-4o Demo

Technology
Elon Musk criticizes the newly launched GPT-4o by OpenAI, describing the demo as cringeworthy. The updated AI model, GPT-4o, has impressed experts with its human-like capabilities, including real-time inference of images, sound, and text. Musk's comments on social media suggest that he is unimpressed with OpenAI's new product.
CW

Account

Waiting list for the personalized area


Welcome!

InfoBud.news

infobud.news is an AI-driven news aggregator that simplifies global news, offering customizable feeds in all languages for tailored insights into tech, finance, politics, and more. It provides precise, relevant news updates, overcoming conventional search tool limitations. Due to the diversity of news sources, it provides precise and relevant news updates, focusing entirely on the facts without influencing opinion. Read moreExpand

Your World, Tailored News: Navigate The News Jungle With AI-Powered Precision!