返回顶部
s

speech-to-text

Transcribe audio to text with Whisper models via inference.sh CLI. Models: Fast Whisper Large V3, Whisper V3 Large. Capabilities: transcription, translation, multi-language, timestamps. Use for: meeting transcription, subtitles, podcast transcripts, voice notes. Triggers: speech to text, transcription, whisper, audio to text, transcribe audio, voice to text, stt, automatic transcription, subtitles generation, transcribe meeting, audio transcription, whisper ai

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 0.1.5
安全检测
已通过
2,546
下载量
0
收藏
概述
安装方式
版本历史

speech-to-text

# Speech-to-Text Transcribe audio to text via [inference.sh](https://inference.sh) CLI. ![Speech-to-Text](https://cloud.inference.sh/u/4mg21r6ta37mpaz6ktzwtt8krr/01jz025e88nkvw55at1rqtj5t8.png) ## Quick Start ```bash curl -fsSL https://cli.inference.sh | sh && infsh login infsh app run infsh/fast-whisper-large-v3 --input '{"audio_url": "https://audio.mp3"}' ``` > **Install note:** The [install script](https://cli.inference.sh) only detects your OS/architecture, downloads the matching binary from `dist.inference.sh`, and verifies its SHA-256 checksum. No elevated permissions or background processes. [Manual install & verification](https://dist.inference.sh/cli/checksums.txt) available. ## Available Models | Model | App ID | Best For | |-------|--------|----------| | Fast Whisper V3 | `infsh/fast-whisper-large-v3` | Fast transcription | | Whisper V3 Large | `infsh/whisper-v3-large` | Highest accuracy | ## Examples ### Basic Transcription ```bash infsh app run infsh/fast-whisper-large-v3 --input '{"audio_url": "https://meeting.mp3"}' ``` ### With Timestamps ```bash infsh app sample infsh/fast-whisper-large-v3 --save input.json # { # "audio_url": "https://podcast.mp3", # "timestamps": true # } infsh app run infsh/fast-whisper-large-v3 --input input.json ``` ### Translation (to English) ```bash infsh app run infsh/whisper-v3-large --input '{ "audio_url": "https://french-audio.mp3", "task": "translate" }' ``` ### From Video ```bash # Extract audio from video first infsh app run infsh/video-audio-extractor --input '{"video_url": "https://video.mp4"}' > audio.json # Transcribe the extracted audio infsh app run infsh/fast-whisper-large-v3 --input '{"audio_url": "<audio-url>"}' ``` ## Workflow: Video Subtitles ```bash # 1. Transcribe video audio infsh app run infsh/fast-whisper-large-v3 --input '{ "audio_url": "https://video.mp4", "timestamps": true }' > transcript.json # 2. Use transcript for captions infsh app run infsh/caption-videos --input '{ "video_url": "https://video.mp4", "captions": "<transcript-from-step-1>" }' ``` ## Supported Languages Whisper supports 99+ languages including: English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese, Korean, Arabic, Hindi, Russian, and many more. ## Use Cases - **Meetings**: Transcribe recordings - **Podcasts**: Generate transcripts - **Subtitles**: Create captions for videos - **Voice Notes**: Convert to searchable text - **Interviews**: Transcription for research - **Accessibility**: Make audio content accessible ## Output Format Returns JSON with: - `text`: Full transcription - `segments`: Timestamped segments (if requested) - `language`: Detected language ## Related Skills ```bash # Full platform skill (all 150+ apps) npx skills add inference-sh/skills@inference-sh # Text-to-speech (reverse direction) npx skills add inference-sh/skills@text-to-speech # Video generation (add captions) npx skills add inference-sh/skills@ai-video-generation # AI avatars (lipsync with transcripts) npx skills add inference-sh/skills@ai-avatar-video ``` Browse all audio apps: `infsh app list --category audio` ## Documentation - [Running Apps](https://inference.sh/docs/apps/running) - How to run apps via CLI - [Audio Transcription Example](https://inference.sh/docs/examples/audio-transcription) - Complete transcription guide - [Apps Overview](https://inference.sh/docs/apps/overview) - Understanding the app ecosystem

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 speech-to-text-1776159601 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 speech-to-text-1776159601 技能

通过命令行安装

skillhub install speech-to-text-1776159601

下载 Zip 包

⬇ 下载 speech-to-text v0.1.5

文件大小: 2.2 KB | 发布时间: 2026-4-15 10:31

v0.1.5 最新 2026-4-15 10:31
- Updated documentation for clear setup instructions using inference.sh CLI.
- Detailed available Whisper model options, usage examples, and input formats.
- Added new sections on extracting audio from video, translation, and video subtitle workflows.
- Enhanced guidance for supported languages and output structure.
- Improved 'Related Skills' for easy access to complementary AI tools.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部