返回顶部
o

ocr-pro

Professional-grade OCR for PDFs and images using MinerU. Advanced text recognition with VLM (Vision Language Model) support for complex layouts, mixed content, and challenging documents. Features: high-accuracy OCR for PDFs and images (.png, .jpg, .jpeg, .webp). VLM mode for complex visual layouts with mixed text, tables, and figures. Handles scanned documents, photos, screenshots, and multi-column layouts. Multiple output formats. Use when you need to: OCR a document with high accuracy, extract

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 0.4.0
安全检测
已通过
124
下载量
0
收藏
概述
安装方式
版本历史

ocr-pro

# Ocr Pro Convert and extract content from .pdf / images (.png/.jpg/.jpeg/.jp2/.webp/.gif/.bmp) using MinerU (`mineru-open-api`). ## Install ```bash npm install -g mineru-open-api # or via Go (macOS/Linux): go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest ``` ## Quick Start ```bash # Extraction (requires token: mineru-open-api auth) mineru-open-api extract scanned.pdf -o ./out/ # From URL mineru-open-api extract https://example.com/scanned.pdf -o ./out/ # Specify language mineru-open-api extract scanned.pdf --language en -o ./out/ ``` ## Authentication Token required for `extract` and `crawl`: ```bash mineru-open-api auth # Interactive token setup export MINERU_TOKEN="your-token" # Or via environment variable ``` Create token at: https://mineru.net/apiManage/token ## Capabilities - Supports local files and URLs - Requires token (`mineru-open-api auth` or `MINERU_TOKEN` env) - Supported input: .pdf / images (.png/.jpg/.jpeg/.jp2/.webp/.gif/.bmp) - Language hint with `--language` (default: `ch`, use `en` for English) - Page range with `--pages` (where applicable) ## Notes - OCR is only available via `extract` with token. Use `--ocr` flag. For complex layouts use `--model vlm`. - Output goes to stdout by default; use `-o <dir>` to save to file - Binary formats (docx) require `-o` flag (cannot stream to stdout) - All progress/status messages go to stderr - MinerU is an open-source project by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 ocr-pro-1775899817 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 ocr-pro-1775899817 技能

通过命令行安装

skillhub install ocr-pro-1775899817

下载 Zip 包

⬇ 下载 ocr-pro v0.4.0

文件大小: 1.98 KB | 发布时间: 2026-4-12 10:45

v0.4.0 最新 2026-4-12 10:45
SEO: expand description for better ClawHub vector search discovery

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部