返回顶部
m

mimo-voice-assistant

>

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.0.8
安全检测
已通过
86
下载量
0
收藏
概述
安装方式
版本历史

mimo-voice-assistant

# MiMo Voice Assistant TTS (text-to-speech), STT (speech-to-text), and emotion-aware voice generation for OpenClaw agents across all platforms. ## Architecture ``` User voice → OpenClaw (Telegram/Discord/WhatsApp/...) → STT (MiMo-V2-Omni transcription) → Agent processes → TTS (MiMo-V2-TTS with emotion + language) → Voice reply ``` ## Before Install > ⚠️ This skill sends text/audio to Xiaomi's MiMo API (`api.xiaomimimo.com`) for TTS/STT processing. Ensure you trust this service and have a valid `MIMO_API_KEY`. If you need higher security, consider deploying the proxy in an isolated environment (Docker/container) and rotating your API key regularly. ## Quick Start ```bash # 1. Install dependencies cd mimo-tts-proxy && npm install # 2. Set API key export MIMO_API_KEY="your-key-here" # 3. Start proxy node src/server.mjs ``` OpenClaw config (`openclaw.json`): ```json { "messages": { "tts": { "auto": "inbound", "provider": "openai", "baseUrl": "http://127.0.0.1:3999", "maxTextLength": 4000 } } } ``` ## Emotion Detection See `references/emotion-detection.md` ## Multi-Platform See `references/platforms.md` ## API Endpoints | Endpoint | Method | Description | |----------|--------|-------------| | `/health` | GET | Health check | | `/v1/models` | GET | Model list | | `/v1/audio/speech` | POST | Text to speech | **Request format:** ```json {"model": "tts-1", "input": "Hello", "voice": "mimo_default", "response_format": "mp3"} ``` **Formats:** `wav` (default), `mp3` (needs ffmpeg), `opus` (needs ffmpeg) ## Multi-Language Support **CRITICAL: TTS output must match the user's language automatically.** ### Language Detection Detect the user's language from their message and respond in the **same language** for both text and voice. | User sends | Agent text reply | TTS voice output | |-----------|-----------------|------------------| | "你好,帮我查一下天气" | 中文回复 | 中文语音 | | "What's the weather?" | English reply | English voice | | "おはようございます" | 日本語返答 | 日本語音声 | | "Bonjour, comment ça va ?" | Réponse en français | Voix française | | "안녕하세요" | 한국어 답변 | 한국어 음성 | ### How It Works 1. **Agent detects language** from the user's message (first message or latest message language) 2. **Agent replies in that language** (text) 3. **TTS speaks that language** — MiMo-V2-TTS supports Chinese, English, Japanese, Korean, and more 4. **No explicit instruction needed** — this is automatic behavior ### When to Override Only switch language if the user explicitly asks: - "请用英语回答" → Switch to English - "Speak in Japanese" → Switch to Japanese - Otherwise, always match the user's language ### TTS Language Compatibility MiMo-V2-TTS supports natural speech in: - ✅ Chinese (Mandarin) - ✅ English (US/UK) - ✅ Japanese - ✅ Korean - ✅ Other languages (quality varies) ### Implementation In your response, you can use `[lang:xx]` hints for the TTS proxy (optional): ``` [lang:zh]你好,这是你的语音回复。 [lang:en]Hello, here is your voice reply. [lang:ja]こんにちは、音声返信です。 ``` Or simply reply normally — the TTS proxy will automatically handle the language based on the text content. ## Security & Data Flow - **API key**: passed via env var (`MIMO_API_KEY`) or `Authorization` Bearer header, never hardcoded - **Network**: proxy only connects to `api.xiaomimimo.com` (Xiaomi official API) — text and base64 audio are sent there for TTS/STT processing - **Local binding**: proxy binds to `127.0.0.1:3999` (localhost only, not externally exposed) - **Temp files**: auto-cleaned after each request - **User responsibility**: if using systemd/launchd for persistence, store API keys securely (env file or secret manager, not inline in service files)

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 mimo-voice-assistant-1775891883 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 mimo-voice-assistant-1775891883 技能

通过命令行安装

skillhub install mimo-voice-assistant-1775891883

下载 Zip 包

⬇ 下载 mimo-voice-assistant v1.0.8

文件大小: 16 KB | 发布时间: 2026-4-12 10:35

v1.0.8 最新 2026-4-12 10:35
Fixed security declaration: transparent about data flow to MiMo API, added pre-install warning, clarified network/security responsibility.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部