D-ID launches a web application that provides ChatGPT with a synthetic human face and voice
Generative artificial intelligence chatbot ChatGPT joins the conversation with both face and voice thanks to synthetic media startup D-ID’s new chat.D-ID web app. D-ID’s text-to-video technology creates a realistic digital human appearance and converts ChatGPT responses into speech; opens up new possibilities for interacting with OpenAI’s popular conversation engine.
Video Chat GPT
The ChatGPT API acts as the brain of chat.D-ID, using D-ID’s streaming text-to-video technology to provide the audio and visual components of a photorealistic synthetic human with which users can engage in face-to-face conversation. and listening instead of writing and reading the answers. D-ID envisions the web app as extending access to ChatGPT to people who may not be able to see written text or who prefer a more human approach to conversation.
When you connect to the web application, D-ID’s synthetic host “Alice” is there to greet you and answer questions you type in the text field or speak after clicking the microphone icon. The app is still in beta, with Alice as the only face and voice, but D-ID plans to add more digital characters as an option. Finally, the company allows users to upload their own image as a personal ChatGPT face, although celebrities or public figures cannot appear. Otherwise, the concept is similar to how D-ID first gained notoriety by turning photos into videos of still images of people moving and talking.
“Our technology opens up a side of artificial intelligence that the world has never seen before,” said Gil Perry, CEO of D-ID. “Moving from a text-based interface to face-to-face speech makes the experience more impactful, enjoyable and engaging, and helps people better understand the information you’re providing. With chat.D-ID, conversations with AI will be accessible to a much wider audience, including children, the elderly, the disabled and billions of people worldwide outside of the tech community.”
D-ID released the web app shortly after introducing a new chat API for real-time streaming with generative AI tools. The API is part of the company’s Creative Reality Studio, which it launched last year to let customers design their own video avatars based on uploaded photos or synthetically generated images produced by Stable Diffusion’s text-to-image engine. The avatar can execute a script written by the user or compiled by OpenAI’s GPT-3 text generator. The chat API opens the door to real-time interactions using responses streamed from generative AI chatbots.
Follow @voicebotaiFollow @erichschwartz
D-ID launches Chat API to give face and voice to generative AI chatbots
Enterprise Virtual Being Startup Codebaby Integrates ChatGPT into Customer Service Bots
Synthetic video producer D-ID combines stable diffusion and GPT-3 on multimodal generative AI platform