• ruffsl@programming.devOP
      link
      fedilink
      English
      arrow-up
      11
      arrow-down
      3
      ·
      2 days ago

      I recall the author saying they’re not a native English speaker, and preferring international intelligibility over regional voice-over, plus the production convenience while traveling and script writing without a quiet audio recording environment. See around 7 min mark:

      • HappyFrog@lemmy.blahaj.zone
        link
        fedilink
        arrow-up
        8
        ·
        2 days ago

        Don’t get me wrong, I like the channel, I just find the voice AI produces to be very grading. I like that he can produce the videos he likes, and I’d rather have a AI voice than no voice at all.

        • ruffsl@programming.devOP
          link
          fedilink
          English
          arrow-up
          4
          arrow-down
          1
          ·
          2 days ago

          I’ve been using TTS systems for decades with accessibility use cases, so other than quality audio books that necessitate a skilled performing narrator, I no longer mind.

          In fact, I prefer legacy Bayesian phonetic models over the newer convolutional and recurrent neural networks, as their hard consonants and robotic consistency in pronunciations and intonation are much easier to listen and discern at higher words per minute, like at 3x or 4x natural speech rates for everyday blind reading, as compared to modern mumbling/slurring of syllables or artificial stridor and other breathy sounds.