返回顶部
v

vision-analysis

>

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.0.0
安全检测
已通过
158
下载量
0
收藏
概述
安装方式
版本历史

vision-analysis

# Vision Analysis Analyze images using the MiniMax `MiniMax_understand_image` MCP tool available in the MiniMax Token Plan. ## Prerequisites - MiniMax Token Plan subscription with valid `MINIMAX_API_KEY` - MiniMax MCP configured (`MiniMax_understand_image` tool available) ### If MCP tool is not configured **Step 1:** The agent should fetch setup instructions from: **https://platform.minimaxi.com/docs/token-plan/mcp-guide** **Step 2:** Detect the user's environment (OpenCode, Cursor, Claude Code, etc.) and output the exact commands needed. Common examples: **OpenCode** — add to `~/.config/opencode/opencode.json` or `package.json`: ```json { "mcp": { "MiniMax": { "type": "local", "command": ["uvx", "minimax-coding-plan-mcp", "-y"], "environment": { "MINIMAX_API_KEY": "YOUR_TOKEN_PLAN_KEY", "MINIMAX_API_HOST": "https://api.minimaxi.com" }, "enabled": true } } } ``` **Claude Code**: ```bash claude mcp add -s user MiniMax --env MINIMAX_API_KEY=your-key --env MINIMAX_API_HOST=https://api.minimaxi.com -- uvx minimax-coding-plan-mcp -y ``` **Cursor** — add to MCP settings: ```json { "mcpServers": { "MiniMax": { "command": "uvx", "args": ["minimax-coding-plan-mcp"], "env": { "MINIMAX_API_KEY": "your-key", "MINIMAX_API_HOST": "https://api.minimaxi.com" } } } } ``` **Step 3:** After configuration, tell the user to restart their app and verify with `/mcp`. **Important:** If the user does not have a MiniMax Token Plan subscription, inform them that the `understand_image` tool requires one — it cannot be used with free or other tier API keys. ## Analysis Modes | Mode | When to use | Prompt strategy | |---|---|---| | `describe` | General image understanding | Ask for detailed description | | `ocr` | Text extraction from screenshots, documents | Ask to extract all text verbatim | | `ui-review` | UI mockups, wireframes, design files | Ask for design critique with suggestions | | `chart-data` | Charts, graphs, data visualizations | Ask to extract data points and trends | | `object-detect` | Identify objects, people, activities | Ask to list and locate all elements | ## Workflow ### Step 1: Auto-detect image The skill triggers automatically when a message contains an image file path or URL with extensions: `.jpg`, `.jpeg`, `.png`, `.gif`, `.webp`, `.bmp`, `.svg` Extract the image path from the message. ### Step 2: Select analysis mode and call MCP tool Use the `MiniMax_understand_image` tool with a mode-specific prompt: **describe:** ``` Provide a detailed description of this image. Include: main subject, setting/background, colors/style, any text visible, notable objects, and overall composition. ``` **ocr:** ``` Extract all text visible in this image verbatim. Preserve structure and formatting (headers, lists, columns). If no text is found, say so. ``` **ui-review:** ``` You are a UI/UX design reviewer. Analyze this interface mockup or design. Provide: (1) Strengths — what works well, (2) Issues — usability or design problems, (3) Specific, actionable suggestions for improvement. Be constructive and detailed. ``` **chart-data:** ``` Extract all data from this chart or graph. List: chart title, axis labels, all data points/series with values if readable, and a brief summary of the trend. ``` **object-detect:** ``` List all distinct objects, people, and activities you can identify. For each, describe what it is and its approximate location in the image. ``` ### Step 3: Present results Return the analysis clearly. For `describe`, use readable prose. For `ocr`, preserve structure. For `ui-review`, use a structured critique format. ## Output Format Example For describe mode: ``` ## Image Description [Detailed description of the image contents...] ``` For ocr mode: ``` ## Extracted Text [Preserved text structure from the image] ``` For ui-review mode: ``` ## UI Design Review ### Strengths - ... ### Issues - ... ### Suggestions - ... ``` ## Notes - Images up to 20MB supported (JPEG, PNG, GIF, WebP) - Local file paths work if MiniMax MCP is configured with file access - The `MiniMax_understand_image` tool is provided by the `minimax-coding-plan-mcp` package

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 minimax-vision-analysis-1775911212 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 minimax-vision-analysis-1775911212 技能

通过命令行安装

skillhub install minimax-vision-analysis-1775911212

下载 Zip 包

⬇ 下载 vision-analysis v1.0.0

文件大小: 2.98 KB | 发布时间: 2026-4-12 10:36

v1.0.0 最新 2026-4-12 10:36
Initial release of vision-analysis skill.

- Enables image analysis via the MiniMax_understand_image MCP tool for general description, OCR, UI review, chart data extraction, and object detection modes.
- Automatically triggers on image file uploads/links or relevant analysis requests.
- Includes guidance for MiniMax MCP setup across OpenCode, Cursor, and Claude Code environments.
- Provides mode-specific prompt strategies and structured output formats for clear results.
- Supports common image formats and local file paths (with appropriate MCP configuration).

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部