返回顶部
b

browser-audio-capture

Capture audio from any browser tab — meetings, YouTube, podcasts, courses, webinars — and stream to any AI agent. Zero API keys, works with any framework.

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.1.0
安全检测
已通过
425
下载量
0
收藏
概述
安装方式
版本历史

browser-audio-capture

# Browser Audio Capture **Give any AI agent ears for the browser.** One Chrome extension captures audio from any tab — meetings, YouTube, podcasts, webinars, courses, earnings calls — and streams it to your AI pipeline. ## Why Use This Your AI agent can't hear anything happening in your browser. This skill fixes that. Capture audio from any Chrome tab and stream it to your agent — no API keys, no OAuth, no per-platform integrations. **Use cases:** meeting summaries, YouTube/podcast notes, competitive intel from earnings calls, auto-notes from online courses, customer call analysis — anything that plays audio in a browser tab. **Works with any AI agent** — Claude, ChatGPT, OpenClaw, LangChain, CrewAI, or your own. If your agent can run shell commands or receive HTTP, it gets browser audio. ## Prerequisites Chrome with remote debugging: ```bash # macOS /Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome \ --remote-debugging-port=9222 --user-data-dir=$HOME/.chrome-debug-profile & ``` Python 3.9+ with aiohttp: `pip install aiohttp` ## Quick Start ### CLI (any agent that can exec) ```bash # List tabs — meetings flagged with 🎙️ python3 -m browser_capture.cli tabs # Auto-detect and capture meeting tab python3 -m browser_capture.cli capture # Continuous watch mode python3 -m browser_capture.cli watch --interval 15 # Stop python3 -m browser_capture.cli stop ``` ### Chrome Extension (one-click, persistent) 1. `chrome://extensions/` → Developer mode → Load unpacked → `scripts/extension/` 2. Join a meeting → click Percept icon → Start Capturing 3. Close popup — capture continues in background ## Supported Platforms Google Meet • Zoom (web) • Microsoft Teams • Webex • Whereby • Around • Cal.com • Riverside • StreamYard • Ping • Daily.co • Jitsi • Discord — plus any future platform that runs in a browser. ## Audio Output Streams to `http://127.0.0.1:8900/audio/browser` as JSON: ```json { "sessionId": "browser_1709234567890", "audio": "<base64 PCM16>", "sampleRate": 16000, "format": "pcm16", "tabUrl": "https://meet.google.com/...", "tabTitle": "Weekly Standup" } ``` Configure endpoint in `scripts/extension/offscreen.js` (`PERCEPT_URL`). Point it at Whisper, Deepgram, NVIDIA Riva, or any transcription service. ## Troubleshooting - **No tabs**: Chrome needs `--remote-debugging-port=9222` - **Button won't click**: Remove + re-add extension (MV3 caches aggressively) - **Audio not arriving**: Check receiver on port 8900. Extension sends to `/audio/browser`

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 browser-audio-capture-1776296438 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 browser-audio-capture-1776296438 技能

通过命令行安装

skillhub install browser-audio-capture-1776296438

下载 Zip 包

⬇ 下载 browser-audio-capture v1.1.0

文件大小: 14.74 KB | 发布时间: 2026-4-16 16:01

v1.1.0 最新 2026-4-16 16:01
- Expanded audio capture support from just meetings to any browser tab, including YouTube, podcasts, webinars, and courses.
- Updated description and documentation to highlight broader use cases beyond meetings.
- Revised "Why Use This" and "Use cases" sections for clarity and to reflect new capabilities.
- No code changes; documentation (SKILL.md) updated only.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部