返回顶部
🇺🇸 English
🇨🇳 简体中文
🇨🇳 繁體中文
🇺🇸 English
🇯🇵 日本語
🇰🇷 한국어
🇫🇷 Français
🇩🇪 Deutsch
🇪🇸 Español
🇷🇺 Русский
s

step-asr

Transcribe audio files to text via Step ASR streaming API (HTTP SSE). Supports Chinese and English, multiple audio formats (PCM, WAV, MP3, OGG/OPUS), real-time streaming output, and terminology correction prompts.

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.0.0
安全检测
已通过
310
下载量
免费
免费
1
收藏
概述
安装方式
版本历史

step-asr

# Step ASR - Streaming Speech-to-Text Transcribe audio files using the Step (StepFun) ASR API with HTTP SSE streaming. ## Quick start ```bash python3 {baseDir}/scripts/transcribe.py /path/to/audio.wav ``` ## Usage examples Basic transcription (Chinese, streaming output): ```bash python3 {baseDir}/scripts/transcribe.py /path/to/audio.wav ``` Specify language and save to file: ```bash python3 {baseDir}/scripts/transcribe.py /path/to/audio.mp3 --language en --out /tmp/transcript.txt ``` Use a prompt for terminology correction: ```bash python3 {baseDir}/scripts/transcribe.py /path/to/audio.pcm --prompt "Related terms: OpenClaw, StepFun, ASR" ``` Output as JSON (includes usage stats): ```bash python3 {baseDir}/scripts/transcribe.py /path/to/audio.ogg --json ``` Non-streaming mode (only print final result): ```bash python3 {baseDir}/scripts/transcribe.py /path/to/audio.wav --no-stream ``` Specify audio format explicitly (for raw PCM files without extension): ```bash python3 {baseDir}/scripts/transcribe.py /path/to/raw_audio --format-type pcm --sample-rate 16000 ``` ## Supported audio formats | Format | Extensions | Notes | |--------|-----------|-------| | PCM | `.pcm`, `.raw` | Raw PCM, default codec `pcm_s16le` | | WAV | `.wav` | WAV container format | | MP3 | `.mp3` | | | OGG/OPUS | `.ogg`, `.opus` | | ## All options | Flag | Default | Description | |------|---------|-------------| | `--language` | `zh` | Language code (`zh` or `en`) | | `--model` | `step-asr` | ASR model name | | `--out` | *(stdout)* | Save transcription to file | | `--prompt` | *(none)* | Hint text to improve accuracy for domain-specific terms | | `--format-type` | *(auto)* | Audio format: `pcm`, `mp3`, `ogg` (auto-detected from extension) | | `--sample-rate` | `16000` | Audio sample rate in Hz | | `--no-stream` | `false` | Only print the final complete result | | `--json` | `false` | Output as JSON with usage statistics | | `--no-itn` | `false` | Disable inverse text normalization | | `--no-rerun` | `false` | Disable second-pass error correction | ## API key Set `STEPFUN_API_KEY` environment variable, or configure it in `~/.openclaw/openclaw.json`: ```json5 { skills: { "step-asr": { apiKey: "YOUR_STEPFUN_API_KEY" } } } ``` Get your API key from [Step Platform](https://platform.stepfun.com/).

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 step-asr-1776287449 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 step-asr-1776287449 技能

通过命令行安装

skillhub install step-asr-1776287449

下载

⬇ 下载 step-asr v1.0.0(免费)

文件大小: 4 KB | 发布时间: 2026-4-16 16:53

v1.0.0 最新 2026-4-16 16:53
Initial release of step-asr.

- Transcribe audio files to text using the Step ASR streaming API (HTTP SSE).
- Supports Chinese and English languages.
- Handles multiple audio formats: PCM, WAV, MP3, OGG/OPUS.
- Real-time streaming output and non-streaming (final result) modes.
- Allows terminology correction via prompt.
- Optional JSON output with usage stats.
- Requires STEPFUN_API_KEY environment variable for authentication.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部