https://gitlab.com/christosangel/sapo3

  • Sapo3 is a suite of scripts-tools that can help the user convert a text file to an audio file.

  • It uses the tts-edge API for text-to-speech conversion.

  • Big txt files can be easily converted to audio books, using a wide range of customization capabilities.

When the user runs Sapo3, they will be presented with a menu of options:

  • o option: Fix name pronunciation with Fix Names

  • c option: Split text to chapters with Chapterize

  • v option: Convert File to audio

  • f option: Check every sentence outcome with Fix Audio option.

  • m option: Merging Audio Files

  • p option: Configuring Preferences

  • lime!
    link
    fedilink
    English
    arrow-up
    2
    ·
    11 hours ago

    shouldn’t there at least be an option to use speech-dispatcher?

      • lime!
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        7 hours ago

        i believe that’s what speech-dispatcher is; a uniform interface for tts systems.

        • christos@lemmy.worldOP
          link
          fedilink
          arrow-up
          1
          ·
          7 hours ago

          speech-dispatcher

          If you are referring to locally generated speech synthesis, the respecting outcome as far as I am concerned generally sounds generally poorer, and is more difficult to manage. However you can check out the original project https://gitlab.com/christosangel/sapo, where the audio files are generated locally.

          • lime!
            link
            fedilink
            English
            arrow-up
            3
            ·
            6 hours ago

            well speech-dispatcher has no synthesis component, you can plug in any tts engine that follows the interface. it’s nice to have a choice in engine just by implementing the support. personally i use piper which i feel gives a pretty good performance.

            • christos@lemmy.worldOP
              link
              fedilink
              arrow-up
              3
              ·
              6 hours ago

              piper

              Indeed piper performs very well. Thank you for the input, I will most certainly consider adding the option to select tts engine in the near future, piper sounds totally worth it.