Voice and image-based discussion features are coming to OpenAI's ChatGPT.


The addition of voice and image-based talking functionalities to ChatGPT was announced by OpenAI on Monday. ChatGPT will become a conversational companion thanks to the voice capability, which enables users to have lively talks with it. Numerous uses for this feature are possible, such as asking bedtime stories and settling disputes at the dinner table.

The new voice functionality makes use of a cutting-edge text-to-speech technology that can produce audio that sounds remarkably like humans using text input and a small audio sample. The user experience is further improved by OpenAI's use of Whisper, their open-source speech recognition system, which guarantees correct transcription of spoken words into text.

To create a collection of five unique voices, OpenAI worked with experienced voice actors. Users can opt into voice discussions by opening the mobile app, going to "Settings" > "New Features," and then tapping the headphone symbol in the top-right corner of the home screen to choose from one of five voices for a customized experience.

The addition of image interactions in ChatGPT allows users to share and debate visual information. Users can, for instance, take images of famous places while traveling and have live conversations about their distinctive characteristics. Users can take pictures of their cupboard and refrigerator at home to help with meal planning and even get step-by-step culinary instructions.

Over the next two weeks, these functionalities will be made available to Plus and Enterprise users. Users can choose to use voice on both iOS and Android platforms by accessing their settings. On all systems, image support will be accessible.

Post a Comment

Previous Post Next Post