# text-to-speech

このトピックのトレンドリポジトリ（5件）

AIモデルの実行も学習もブラウザ画面ひとつで完結！最大2倍速・VRAM70%削減の万能ツール — unsloth

Unslothは、Qwen、DeepSeek、Gemma、LlamaなどのオープンソースAIモデルを自分のパソコンで動かしたり、追加学習（ファインチューニング）したりできる統合ツールです。ブラウザから操作できるWeb画面（Unsloth S

agentdeepseekdeepseek-r1fine-tuninggemmagemma3gpt-ossllamallama3llmllmsmistralopenaiqwenqwen3reinforcement-learningtext-to-speechttsunslothvoice-cloning

OpenBMB/VoxCPM

OpenBMB/VoxCPMOtherPython

30.0k7回登場

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

audiodeeplearningminicpmmultilingualpythonpytorchspeechspeech-synthesistext-to-speechttstts-modelvoice-cloningvoice-designvoxcpm

calesthio/OpenMontage

calesthio/OpenMontageOtherPython

24.4k10回登場

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

agentagentic-aiaiclaudecopilotcursorelevenlabsffmpegfluximage-generationopen-sourceopenaipythonremotionstable-diffusiontext-to-speechtext-to-videovideo-generationvideo-production

supertone-inc/supertonic

supertone-inc/supertonicOtherSwift

8.0k6回登場

Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.

cppcsharpfluttergoiosjavalightweightmultilingualnodejson-deviceonnxonnxruntimepythonrustspeech-synthesisswifttext-to-speechttswebwebgpu

OpenMOSS/MOSS-TTS

OpenMOSS/MOSS-TTSOtherPython

2.7k2回登場

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.

audioaudio-tokenizerllmmultimodaltext-to-speechvoice-cloning