Thanks to the new feature, ChatGPT now “sees” and “hears” in real time

Alexander16.12.2024

0 214 1 minute read

OpenAI company announced about the launch of a new feature for ChatGPT that significantly expands the possibilities of voice mode. The new option, called Advanced Voice, allows the chatbot to work with images, videos and respond to voice commands in real time.

Users will now be able to activate their device’s camera and show ChatGPT objects or events around them. This opens up the possibility to ask questions or get explanations based on what the bot “sees” in the frame.

During the presentation, OpenAI demonstrated the functionality of the new option. For example, a set for making coffee was placed on the table, and ChatGPT explained the preparation process step by step, providing detailed instructions. During the demonstration, the bot also answered clarifying questions.

Another interesting novelty is the ability to share the device’s screen. ChatGPT can now analyze the information on the screen and help with specific tasks. For example, if the user opens the messenger, the bot can offer replies to messages or help with text editing.

The innovation will be available to users of paid ChatGPT Plus and Pro tariff plans as early as next week. For businesses and educational institutions, the function will be available from the beginning of 2025.

The expansion of ChatGPT functions significantly increases the convenience of its use, both for household and professional tasks. The ability of the bot to respond to visual and voice information makes it even more interactive and useful in situations where a quick response to the context is important.