• HappyFrog@lemmy.blahaj.zone
    link
    fedilink
    arrow-up
    8
    ·
    6 days ago

    Don’t get me wrong, I like the channel, I just find the voice AI produces to be very grading. I like that he can produce the videos he likes, and I’d rather have a AI voice than no voice at all.

    • ruffsl@programming.devOP
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      1
      ·
      6 days ago

      I’ve been using TTS systems for decades with accessibility use cases, so other than quality audio books that necessitate a skilled performing narrator, I no longer mind.

      In fact, I prefer legacy Bayesian phonetic models over the newer convolutional and recurrent neural networks, as their hard consonants and robotic consistency in pronunciations and intonation are much easier to listen and discern at higher words per minute, like at 3x or 4x natural speech rates for everyday blind reading, as compared to modern mumbling/slurring of syllables or artificial stridor and other breathy sounds.