返回顶部
w

whisper-piper-voice

Set up and run a local voice pipeline combining Whisper STT (speech-to-text) and Piper TTS (text-to-speech) as a single HTTP server. Use when asked to set up voice capabilities, transcribe audio, generate speech, configure STT/TTS, or build a voice assistant pipeline. Handles both directions — audio-to-text and text-to-audio — on a single port. Runs fully offline on CPU or GPU (NVIDIA CUDA). NOT for cloud-based TTS (ElevenLabs, Google TTS) — this is 100% local and free.

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.0.0
安全检测
已通过
70
下载量
0
收藏
概述
安装方式
版本历史

whisper-piper-voice

# Whisper + Piper Voice Pipeline Local STT (speech-to-text) and TTS (text-to-speech) as a single HTTP server. Zero cloud dependencies. ## Architecture ``` Audio In → POST /transcribe → Whisper (faster-whisper) → JSON {text, language} Text In → POST /speak → Piper TTS → ffmpeg → audio/ogg (Opus) ``` Both endpoints run in one Python process on one port (default: 9998). ## Quick Start 1. Install dependencies: ```bash python3 -m venv ~/whisper-env && source ~/whisper-env/bin/activate pip install faster-whisper apt install ffmpeg # or brew install ffmpeg on macOS ``` 2. Download Piper + a voice: ```bash mkdir -p ~/piper && cd ~/piper wget https://github.com/rhasspy/piper/releases/latest/download/piper_linux_x86_64.tar.gz tar xzf piper_linux_x86_64.tar.gz mkdir voices && cd voices wget https://huggingface.co/rhasspy/piper-voices/resolve/main/de/de_DE/thorsten_emotional/medium/de_DE-thorsten_emotional-medium.onnx wget https://huggingface.co/rhasspy/piper-voices/resolve/main/de/de_DE/thorsten_emotional/medium/de_DE-thorsten_emotional-medium.onnx.json ``` 3. Run the server (`scripts/voice-server.py`): ```bash python3 voice-server.py --port 9998 \ --whisper-model small --whisper-device cpu \ --piper-bin ~/piper/piper/piper \ --piper-model ~/piper/voices/de_DE-thorsten_emotional-medium.onnx ``` ## API **Transcribe** (audio → text): ```bash curl -X POST -F "file=@message.ogg" http://HOST:9998/transcribe # {"text": "Hallo Welt", "language": "de"} ``` **Speak** (text → audio): ```bash curl -X POST -H "Content-Type: application/json" \ -d '{"text": "Hallo Welt", "speaker": "4"}' \ http://HOST:9998/speak -o response.ogg ``` ## Configuration | Flag | Default | Description | |------|---------|-------------| | `--port` | 9998 | Server port | | `--whisper-model` | small | tiny/base/small/medium/large-v3 | | `--whisper-device` | cpu | cpu or cuda | | `--piper-bin` | (required) | Path to piper binary | | `--piper-model` | (required) | Path to .onnx voice file | | `--piper-speaker` | 4 | Speaker ID (multi-speaker models) | | `--speed` | 0.9 | TTS speed (lower = faster) | ## Choosing Models **Whisper:** `small` for CPU (good balance), `medium` for GPU (best quality without large-v3 overhead). **Piper voices:** Browse https://rhasspy.github.io/piper-samples/ — download .onnx + .onnx.json files. ## Full Setup Guide Read `references/setup-guide.md` for systemd service config, all voice options, model comparison table, and OpenClaw integration details.

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 whisper-piper-voice-1776051073 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 whisper-piper-voice-1776051073 技能

通过命令行安装

skillhub install whisper-piper-voice-1776051073

下载 Zip 包

⬇ 下载 whisper-piper-voice v1.0.0

文件大小: 5.45 KB | 发布时间: 2026-4-14 11:13

v1.0.0 最新 2026-4-14 11:13
Initial release: Combined Whisper STT + Piper TTS HTTP server. Single-port voice pipeline, runs fully local on CPU or GPU. Includes voice-server.py script, setup guide, and model recommendations.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部