Navigate:

All ReposFish Speech

~$FISH↑0.5%

Fish Speech: Open source text-to-speech synthesis

Transformer-based TTS with voice cloning from reference audio.

LIVE RANKINGS • 10:20 AM • STEADY

OVERALL

#130

AI & ML

#53

30 DAY RANKING TREND

ovr#130

·AI#53

STARS

25.0K

FORKS

2.1K

7D STARS

+132

7D FORKS

+21

Tags:

AI & ML

See Repo:

Learn more about Fish Speech

Fish Speech is an open-source text-to-speech synthesis system that generates natural speech audio from text input using transformer-based neural network architectures. The system implements voice cloning capabilities by analyzing reference audio samples to extract speaker characteristics, which are then applied during the synthesis process to reproduce the target voice. It processes text through multiple stages including linguistic analysis, acoustic feature prediction, and neural vocoding to produce waveform output. The architecture separates the text-to-acoustic-feature generation from the vocoding stage, allowing for modular optimization of each component in the speech synthesis pipeline.

Transformer-based architecture

Uses transformer models for semantic token prediction combined with VQVAE quantization, enabling efficient discrete representation of speech content.

Emotional speech control

Supports multiple emotional markers and tone specifications during synthesis, allowing fine-grained control over prosody and expression in generated speech.

Voice cloning from samples

Enables speaker adaptation through reference audio input, allowing synthesis in arbitrary speaker voices without requiring extensive speaker-specific training data.

from fish_speech import TextToSpeech

tts = TextToSpeech()
audio = tts.synthesize("Hello, this is a test of Fish Speech synthesis.")
audio.save("output.wav")

Top in AI & ML

Trending Repos

Pi Mono

17,222#1

OpenClaw

233,443#2

Zvec

8,089#3

Claude Code

70,649#4

Heretic

9,761#5

See all →

LIVE RANKINGS • 10:20 AM • STEADY

OVERALL

#130

AI & ML

#53

30 DAY RANKING TREND

ovr#130

·AI#53

STARS

25.0K

FORKS

2.1K

7D STARS

+132

7D FORKS

+21

[ EXPLORE MORE ]

Related Repositories

Discover similar tools and frameworks used by developers

Fish Speech: Open source text-to-speech synthesis

Learn more about Fish Speech

What is Fish Speech for?

What makes Fish Speech different?

Transformer-based architecture

Emotional speech control

Voice cloning from samples

Example code snippets

Top in AI & ML

Pi Mono

OpenClaw

Claude Code

Heretic

Rowboat

Trending Repos

Pi Mono

OpenClaw

Zvec

Claude Code

Heretic

Related Repositories

ControlNet

OpenVINO

Goose

Magenta

Video2X

Product

Company

Helpful Links