Site icon The Data Exchange

AI and the Future of Speech Technologies

Yishay Carmiel on Generative AI for audio, voice cloning, real-time speech translation, and more.


Subscribe: AppleSpotify OvercastGoogleAntennaPodPodcast AddictAmazon •  RSS.

Yishay Carmiel is the CEO of Meaning1, a startup at the forefront of building real-time speech applications for enterprises. We discuss the state of AI for speech and audio, including trends in Generative AI, automatic speech recognition, diarization, restoration, voice cloning, speech synthesis and more.
 

Subscribe to the Gradient Flow Newsletter

Yishay Carmiel will be speaking at the AI Conference in San Francisco (Sep 26-27). Use the discount code FriendsofBen18 to save 18% on your registration.



Interview highlights – key sections from the video version:

  1. Generative AI for Audio (text-to-speech; text-to-music; speech synthesis)
  2. Speech Translation
  3. Automatic Speech Recognition and other models that use audio inputs
  4. Speech Emotion Recognition
  5. Restoration
  6. Similarities in recent trends in NLP and Speech
  7. Diarization (speaker identification), and implementation challenges
  8. Voice cloning and risk mitigation


Exit mobile version