Advanced Voice Mode is finally getting vision capabilities

OpenAI has released the real-time video capabilities for ChatGPT that it demoed nearly seven months ago. Using the ChatGPT app, users subscribed to ChatGPT Plus, Team, and Pro plans can point their phones at objects and have ChatGPT respond in near-real-time. The feature can also understand what’s on a device’s screen through screen sharing.

OpenAI demo of vision capabilities in Advanced Voice Mode
Image Credits:OpenAI / Screenshot via TechCrunch

If you missed the stream, you can catch up on the replay right here.


OpenAI CEO Sam Altman speaks during the Microsoft Build conference at the Seattle Convention Center Summit Building in Seattle, Washington on May 21, 2024.
December 5, 2024 – December 18, 2024

From the Storyline: Live Updates: 12 Days of OpenAI ChatGPT announcements and reveals

OpenAI’s end of the year event is here. The company is hosting “12 Days of OpenAI,” a series of daily…

Latest in AI