Odaily Planet Daily News OpenAI announced that its product ChatGPT will undergo a major upgrade, adding voice and image interaction functions. Users can now have voice conversations with ChatGPT and search using images.
The speech feature is powered by a new text-to-speech model that generates human-like sounds from text and seconds of sampled speech. OpenAI said it worked with well-known voice actors to create five different voices, and its open-source Whisper speech recognition system was used to transcribe spoken words into text.
Additionally, Spotify, a launch partner, has launched a new feature that allows podcasters to translate their shows from English to other languages while retaining the original voice.
The new features will begin rolling out to paid Plus and Enterprise subscribers over the next two weeks. (TechCrunch)