返回顶部
o

ocr_document

OCR document extraction - extract text from scanned documents, photos, and images using OCR. Use when reading scanned PDFs, photographed pages, handwritten notes, or any document that needs optical character recognition.

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.0.0
安全检测
已通过
269
下载量
0
收藏
概述
安装方式
版本历史

ocr_document

# OCR Document - Extract Text from Scanned Documents and Images Extract text from scanned documents and images using OCR via MinerU Open API. No API key required. ## Quick Start ```bash # OCR a scanned PDF mineru-open-api flash-extract scanned.pdf # OCR an image of a document mineru-open-api flash-extract page-photo.jpg # OCR from URL (no download needed) mineru-open-api flash-extract https://example.com/scanned.pdf # Specify language for better accuracy mineru-open-api flash-extract scanned.pdf --language en # Save OCR result to file mineru-open-api flash-extract scanned.pdf -o ./output/ ``` ## Language Rule You MUST reply to the user in the SAME language they use. This is non-negotiable. ## Capabilities - OCR for scanned PDFs, photographed documents, images - Supports PDF, PNG, JPG, WebP, BMP, TIFF - Supports both local files and URLs directly - Language hint with `--language` (default: `ch`, use `en` for English) - No API key, no signup, no authentication - Max 10MB / 20 pages per document ## When to Use - User asks to "OCR" a document or image - User has a scanned PDF that needs text extraction - User shares a photo of a page and wants the text - User mentions "scan", "handwriting", or "recognize text" ## CLI Reference Run `mineru-open-api flash-extract --help` for all available options. ## Data Privacy - `flash-extract` uploads the document to MinerU's cloud API for processing and returns the result. No account or API key is required. - Documents are processed in real-time and are not stored after extraction. - For details, see https://mineru.net ## Notes - Best results with clear, high-resolution scans - For higher precision OCR with full layout preservation, use `mineru-open-api extract --ocr` (requires auth via `mineru-open-api auth`) - If the CLI cannot be installed via npm/uv/go, download it from https://mineru.net/ecosystem?tab=cli

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 ocr-document-1776032494 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 ocr-document-1776032494 技能

通过命令行安装

skillhub install ocr-document-1776032494

下载 Zip 包

⬇ 下载 ocr_document v1.0.0

文件大小: 1.78 KB | 发布时间: 2026-4-13 11:15

v1.0.0 最新 2026-4-13 11:15
- Initial release of ocr_document skill for OCR text extraction from scanned documents, images, and handwritten notes.
- Supports PDF, PNG, JPG, WebP, BMP, and TIFF formats from local files or URLs.
- No API key, signup, or authentication required.
- Language selection available for improved accuracy; replies always match the user's language.
- Maximum file size is 10MB or 20 pages per document.
- Powered by the MinerU Open API CLI; installation guides provided for npm, uv, go, and direct download.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部