
ChatGPT Beta Code Hints at Upcoming Live Camera Feature for Voice Mode
Recent code discoveries suggest ChatGPT's vision capabilities are nearing release. Android Authority found references to a "Live camera" feature in ChatGPT's beta version v1.2024.317, specifically in the Advanced Voice Mode section.
The code includes a safety warning about not using Live camera "for live navigation or decisions that may impact your health or safety," along with instructions to "Tap the camera icon to let ChatGPT view and chat about your surroundings."
This development builds on OpenAI's GPT-4o demonstration from last May, where the system showed ability to process visual information using mobile or desktop cameras. During the demo, GPT-4o successfully identified and remembered details about subjects, including recognizing a dog named "Bowser" playing with a tennis ball.
While vision capabilities have been limited to alpha testers since the initial demonstration, OpenAI has made progress in other areas, launching Advanced Voice Mode for ChatGPT Plus and Team users in September.
OpenAI continues to expand ChatGPT's capabilities, recently introducing ChatGPT Search for real-time web information access. Reports suggest the company is also developing an agent capable of performing complex multi-step tasks, including code writing and web browsing, with a potential January release date.
The implementation of vision features would complete the suite of GPT-4o capabilities previewed last year, marking another significant advancement in ChatGPT's evolution.