floyo logo
Workflows
Pricing
floyo logo
Workflows
Pricing
Last updated
2026-05-19

A custom node integration for ComfyUI, the TTS Audio Suite enables local multi-engine and multi-language Text-to-Speech (TTS) and voice conversion. It supports a variety of engines, including RVC, Echo-TTS, and Qwen3-TTS, offering extensive audio processing capabilities like character support, SRT timing, and advanced audio editing tools.

  • Supports multiple TTS engines, allowing users to select from a wide range of voice synthesis options tailored to various languages and styles.
  • Includes advanced subtitle processing features, enabling the generation and timing of SRT files, which enhances multimedia projects by ensuring accurate synchronization of audio and text.
  • Offers powerful audio editing capabilities, including emotion control, voice conversion, and intelligent chunking for long texts, improving the overall quality and flexibility of audio outputs.

Context

The TTS Audio Suite is an extension designed for ComfyUI that integrates multiple Text-to-Speech engines, allowing users to generate audio from text in various languages and styles. Its purpose is to enhance audio production workflows by providing tools for voice conversion, audio editing, and subtitle generation, all within a unified interface.

Key Features & Benefits

The suite's key features include support for various TTS engines, enabling users to choose the most suitable voice synthesis technology based on their needs. Additionally, it offers advanced subtitle processing, allowing users to generate and edit SRT files directly, which is crucial for projects that require precise audio and text synchronization. The audio editing capabilities, such as emotion control and intelligent chunking, significantly enhance the quality and expressiveness of the generated audio.

Advanced Functionalities

The TTS Audio Suite includes advanced functionalities such as iterative voice conversion and real-time voice conversion through RVC, allowing users to refine audio outputs progressively. It also features a comprehensive character switching system, enabling seamless transitions between different voice profiles in a single project. The integration of emotion control provides the ability to convey nuanced expressions in speech, making the audio outputs more engaging and lifelike.

Practical Benefits

This tool streamlines the workflow for audio production within ComfyUI, allowing users to maintain control over various aspects of TTS generation, including voice selection, timing, and emotional expression. By facilitating the creation of high-quality audio outputs with precise timing and character management, the TTS Audio Suite improves efficiency and enhances the overall quality of multimedia projects.

Credits/Acknowledgments

The TTS Audio Suite is developed by Diogod, building upon the original ChatterBox Voice project by ShmuelRonen. It is released under the MIT License, allowing for open-source collaboration and contributions from the community.

Inner Nodes

ASRPunctuationTruecaseNode
AudioAnalyzerNode
AudioAnalyzerOptionsNode
CharacterVoicesNode
ChatterBoxAudioAnalyzer
ChatterBoxAudioAnalyzerOptions
ChatterBoxEngineNode
ChatterBoxF5TTSEditOptions
ChatterBoxF5TTSEditVoice
ChatterBoxOfficial23LangEngineNode
ChatterBoxVoiceCapture
CosyVoice Engine
CosyVoiceEngineNode
EchoTTSEngineNode
F5TTSEngineNode
GraniteASREngineNode
HiggsAudioEngineNode
IndexTTS Engine
IndexTTSEmotionOptionsNode
IndexTTSEngineNode
LoadRVCModelNode
MergeAudioNode
MossTTSEngineNode
MouthMovementAnalyzer
PhonemeTextNormalizer
Qwen3TTSEngineNode
Qwen3TTSVoiceDesignerNode
QwenEmotionNode
RVCDatasetPrepNode
RVCEngineNode
RVCPitchOptionsNode
RVCTrainingConfigNode
RefreshVoiceCacheNode
SRTAdvancedOptionsNode
Step Audio EditX Engine
StepAudioEditXAudioEditorNode
StepAudioEditXEngineNode
StringMultilineTagEditor
TextToSRTBuilderNode
UnifiedASRTranscribeNode
UnifiedModelTrainingNode
UnifiedTTSSRTNode
UnifiedTTSTextNode
UnifiedVoiceChangerNode
VibeVoiceEngineNode
VisemeDetectionOptionsNode
VocalRemovalNode
VoiceFixerNode