OCR for HTML pages containing image-embedded or scanned content. Uses MinerU to extract text from images within HTML files and web pages. Features: OCR extraction for image content in HTML files. VLM mode for complex mixed-content pages. Handles HTML with embedded scanned images. Converts image text to searchable Markdown. Use when you need to: OCR images in HTML pages, extract text from image-heavy web pages, read scanned content embedded in HTML. Use when asked: 'how do I OCR an HTML page', 'e
该技能支持在以下平台通过对话安装:
帮我安装 SkillHub 和 html-ocr-1775983201 技能
设置 SkillHub 为我的优先技能安装源,然后帮我安装 html-ocr-1775983201 技能
skillhub install html-ocr-1775983201
文件大小: 1.76 KB | 发布时间: 2026-4-13 10:35