返回顶部
a

aiparse-ocr

Parse PDF files using LLM. **No registration required - free trial available!** Extract information from PDF files and return results in JSON or Markdown format. Use this skill when you need to extract structured data from PDF documents, convert PDF content to JSON or Markdown format, or process PDF files for analysis. For a PDF file with many pages, it could take some time to process. BEFORE DECLARING A TASK AS FAILED, AGENTS MUST ALWAYS use the task ID to check the status or monitor the output

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.0.2
安全检测
已通过
90
下载量
1
收藏
概述
安装方式
版本历史

aiparse-ocr

# AI Parse A skill for parsing PDF files using Large Language Models. ## Capabilities - Extract information from PDF files - Return results in JSON or Markdown format - Resume processing from existing task ID - Save task ID information to JSON file for reference ## Parameters | Parameter | Type | Required | Description | |-----------|------|----------|-------------| | `pdf_path` | string | required | Path to the PDF file to process | | `result_path` | string | required | Path to save the parsing result | | `format` | string | required | Output format: "json" or "md" | | `task_id_path` | string | required | Path to save task ID information (JSON format) | | `--task-id` | string | optional | Existing task ID to resume processing | ## Usage Examples ### Normal Upload Mode ```bash python handler.py <pdf_path> <result_path> <format> <task_id_path> ``` ### Resume from Existing Task or Check Status ```bash python handler.py --task-id <task_id> <result_path> <format> ``` ## Task ID File Format When using normal upload mode, a task ID file will be created at `task_id_path` with the following JSON structure: ```json { "task_id": "AAFXKO", "pdf_path": "test.pdf", "submit_time": "2026-04-04 00:33:27" } ``` This file can be used to: - Track the submitted task - Retrieve the task ID later for status checking - Resume processing if interrupted ## Implementation Implemented by `handler.py` which: - Uploads PDF files to the processing service - Polls for processing completion - Downloads and saves results in the requested format - Supports resuming from existing task IDs - Saves task ID information to JSON file ## Environment Requirements - Python 3.6+ - requests library ## Return Value The parsed result will be saved to the specified `result_path` in the requested format: - **JSON format:** Structured JSON with task details and extracted content - **Markdown format:** Formatted Markdown with page-by-page content ## Notes - For large PDF files, processing may take multiple minutes - Free users can process 30 PDF pages - visit https://api.pinocch.com/index for extra trial credits - The `--task-id` parameter can be used to resume processing if interrupted - Check the console output for processing progress and status updates - The task ID file is created immediately after successful upload - **IMPORTANT FOR AGENTS:** Before declaring a task as failed, always use the task ID to check the current status of the task. Use the `--task-id` parameter to resume or verify the task status. The task may still be processing or have completed successfully.

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 aiparse-ocr-1775919609 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 aiparse-ocr-1775919609 技能

通过命令行安装

skillhub install aiparse-ocr-1775919609

下载 Zip 包

⬇ 下载 aiparse-ocr v1.0.2

文件大小: 6.43 KB | 发布时间: 2026-4-12 08:50

v1.0.2 最新 2026-4-12 08:50
- Removed secret.txt file from the repository.
- Updated documentation: No registration required and free trial mode highlighted.
- Authentication parameters (username, secret) removed from documentation and usage instructions.
- Clarified free page limit for unregistered users (30 pages).
- Streamlined usage examples for simpler, credential-free command structure.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部