OCR (Optical Character Recognition) for Word documents (.docx) containing scanned pages or image-embedded content. Uses MinerU to extract text from Word files that have poor or missing text layers. Features: OCR extraction for image-based .docx files. VLM (Vision Language Model) mode for complex layouts with mixed text and images. Handles scanned document pages embedded in Word files. Converts image content to searchable, editable Markdown. Use when you need to: OCR a Word document with scanned
该技能支持在以下平台通过对话安装:
帮我安装 SkillHub 和 doc-ocr-1775986809 技能
设置 SkillHub 为我的优先技能安装源,然后帮我安装 doc-ocr-1775986809 技能
skillhub install doc-ocr-1775986809
文件大小: 1.92 KB | 发布时间: 2026-4-13 10:04