• Home
  • Blog
  • Android
  • Cars
  • Gadgets
  • Gaming
  • Internet
  • Mobile
  • Sci-Fi
Tech News, Magazine & Review WordPress Theme 2017
  • Home
  • Blog
  • Android
  • Cars
  • Gadgets
  • Gaming
  • Internet
  • Mobile
  • Sci-Fi
No Result
View All Result
  • Home
  • Blog
  • Android
  • Cars
  • Gadgets
  • Gaming
  • Internet
  • Mobile
  • Sci-Fi
No Result
View All Result
Blog - Creative Collaboration
No Result
View All Result
Home Android

OpenAI multimodal digital assistant could launch soon

May 11, 2024
Share on FacebookShare on Twitter

Edgar Cervantes / Android Authority

TL;DR

  • On Monday, OpenAI is holding an event that could see an announcement about a new multimodal digital assistant.
  • Being multimodal would allow the assistant to use images for prompts, such as identifying and translating a sign in the real world.
  • This would be a direct threat against Google’s digital assistants, namely Google Assistant and the newer Gemini.

Over the past few weeks, the rumor mill has been churning, suggesting that OpenAI — the company responsible for ChatGPT — could soon launch an AI-powered search engine, which would be a direct threat to Google’s core business. Given how prominent ChatGPT has become in such a short time, this would represent the first real threat to Google Search in decades.

However, it’s looking less likely that OpenAI has a search engine on the way (via The Information). Instead, new rumors suggest that OpenAI’s scheduled event on Monday could see the company announcing a multimodal digital assistant. While not a traditional search engine, it would still allow people to search for things using the power of AI, so it would still be a significant threat to Google.

Multimodal means the AI can handle multiple input forms, not just text. In the case of this rumored digital assistant, it would be able to link to a camera, process real-world information, and then speak back to you with more information on what it sees. For example, you could point a camera at a sign in a different language and ask ChatGPT to both identify and translate the sign for you, and the AI would speak to you in response.

If this sounds familiar, that’s because it’s something Google Lens, Google Assistant, and, most recently, Google Gemini already do. In fact, ChatGPT can already do this, too, but not through one interface. In other words, Monday’s launch could see the company announce an upgraded GPT model that offers faster, more accurate responses with both image input and audible responses packaged into an app. In other words, a direct competitor to Gemini (and, subsequently, Google Assistant and Apple’s Siri).

To be clear, this would almost certainly not be GPT-5, the long-awaited follow-up to GPT-4 and GPT-4 Turbo. The company has indicated that GPT-5 isn’t coming to this event. The Information suggests it will only land sometime late in 2024.

Got a tip? Talk to us! Email our staff at news@androidauthority.com. You can stay anonymous or get credit for the info, it’s your choice.

You might like

Next Post

Apple is reportedly about to change how we use Siri by adding generative AI

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

No Result
View All Result

Recent Posts

  • Upcoming Beat ’Em Ups in 2026 Worth Keeping an Eye On
  • No new phones coming in 2026
  • The complete list of winners at the 2026 Critics’ Choice Awards
  • TMNT Magic: The Gathering Booster Box Preorders Steeply Discounted
  • Award-Winning Sci-Fi Series Children Of Time Gets New Box Set, And Amazon’s Launch Discount Is Wild

Recent Comments

    No Result
    View All Result

    Categories

    • Android
    • Cars
    • Gadgets
    • Gaming
    • Internet
    • Mobile
    • Sci-Fi
    • Home
    • Shop
    • Privacy Policy
    • Terms and Conditions

    © CC Startup, Powered by Creative Collaboration. © 2020 Creative Collaboration, LLC. All Rights Reserved.

    No Result
    View All Result
    • Home
    • Blog
    • Android
    • Cars
    • Gadgets
    • Gaming
    • Internet
    • Mobile
    • Sci-Fi

    © CC Startup, Powered by Creative Collaboration. © 2020 Creative Collaboration, LLC. All Rights Reserved.

    Get more stuff like this
    in your inbox

    Subscribe to our mailing list and get interesting stuff and updates to your email inbox.

    Thank you for subscribing.

    Something went wrong.

    We respect your privacy and take protecting it seriously