Be part of our every day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Study Extra
OpenAI lastly added long-awaited video and display sharing to its superior voice mode, permitting customers to work together with the chatbot in numerous modalities.
Each capabilities are actually obtainable on iOS and Android cell apps for ChatGPT Groups, Plus and Professional customers, and shall be rolled out to ChatGPT Enterprise and Edu subscribers in January. Nonetheless, customers within the EU, Switzerland, Iceland, Norway and Liechtenstein received’t be capable to entry superior voice mode.
OpenAI first teased the characteristic in Could, when the corporate unveiled GPT-4o and mentioned ChatGPT studying to “watch” a sport and clarify what’s occurring. Superior voice mode was rolled out to customers in September.
Customers can entry video through new buttons on the superior voice mode display to begin a video.
OpenAI’s video mode looks like a video name like Facetime, as a result of ChatGPT responds in real-time to what customers present within the video. It may see what’s across the consumer, establish objects and even keep in mind individuals who introduce themselves. In an OpenAI demo as a part of the corporate’s “12 Days of Shipmas” occasion, ChatGPT used the video characteristic to assist brew espresso. ChatGPT noticed the espresso paraphernalia, instructed when to place in a filter and critiqued the end result.
It’s also similar to Google’s just lately introduced Challenge Astra, through which customers can open a video chat, and Gemini 2.0 will reply to questions on what it sees, like figuring out a sculpture present in a London road. In some ways, these options are extra superior variations of what AI gadgets just like the Humane Pin and the Rabbit r1 had been marketed to do: Have an AI voice assistant reply to questions on what it’s seeing in a video.
Sharing a display
The brand new screen-sharing characteristic brings ChatGPT out of the app and into the realm of the browser.
For display share, a three-dot menu permits customers to navigate out of the ChatGPT app. They will open apps on their telephones and ask ChatGPT questions on what it’s seeing. Within the demo, OpenAI researchers triggered display share, then opened the messages app to ask ChatGPT for assist responding to a photograph despatched through textual content message.
Nonetheless, the screen-sharing characteristic on superior voice mode bears similarities to just lately launched options from Microsoft and Google.
Final week, Microsoft launched a preview model of Copilot Imaginative and prescient, which lets Professional subscribers open a Copilot chat whereas searching a webpage. Copilot Imaginative and prescient can take a look at images on a retailer’s web site and even assist play the map guessing sport Geoguessr. Google’s Challenge Astra also can learn browsers in the identical means.
Each Google and OpenAI launched screen-sharing AI chat options on telephones to focus on the buyer base who could also be utilizing ChatGPT or Gemini extra on the go. However all these options may sign a means for enterprises to collaborate extra with AI brokers, because the agent can see what an individual is taking a look at onscreen. It may be a precursor to fashions that use computer systems, like Anthropic’s Laptop Use, the place the AI mannequin isn’t solely taking a look at a display however is actively opening tabs and packages for the consumer.
Ho ho ho, ask Santa a query
In a bid for levity, OpenAI additionally rolled out “Santa Mode” in superior voice mode. The brand new preset voice sounds very similar to the jolly previous man in a crimson swimsuit.
Not like the brand new options restricted to particular customers, “Santa Mode” is now obtainable to customers with entry to superior voice mode on the cell app, the net model of ChatGPT and the Home windows and MacOS apps till early January.
Chats with Santa, although, is not going to be saved in chat historical past and won’t have an effect on ChatGPT’s reminiscence.
Even OpenAI is feeling the Christmas spirit.