• ruffsl@programming.devOP
    link
    fedilink
    English
    arrow-up
    11
    arrow-down
    3
    ·
    7 days ago

    I recall the author saying they’re not a native English speaker, and preferring international intelligibility over regional voice-over, plus the production convenience while traveling and script writing without a quiet audio recording environment. See around 7 min mark:

    • HappyFrog@lemmy.blahaj.zone
      link
      fedilink
      arrow-up
      8
      ·
      7 days ago

      Don’t get me wrong, I like the channel, I just find the voice AI produces to be very grading. I like that he can produce the videos he likes, and I’d rather have a AI voice than no voice at all.

      • ruffsl@programming.devOP
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        1
        ·
        7 days ago

        I’ve been using TTS systems for decades with accessibility use cases, so other than quality audio books that necessitate a skilled performing narrator, I no longer mind.

        In fact, I prefer legacy Bayesian phonetic models over the newer convolutional and recurrent neural networks, as their hard consonants and robotic consistency in pronunciations and intonation are much easier to listen and discern at higher words per minute, like at 3x or 4x natural speech rates for everyday blind reading, as compared to modern mumbling/slurring of syllables or artificial stridor and other breathy sounds.