llms-txt-sniffer
# llms-txt-sniffer: The Smart Document Radar
This skill streamlines documentation ingestion by locating the most AI-optimized version of a site's content.
## 🧠 Why llms.txt?
It provides a high-density, Markdown-based index designed for LLMs to map entire sites instantly and save tokens.
## 🚀 Discovery Strategy (Two-Stage)
### Stage 1: Quick Jump Probes (Instructional)
1. **URL + /llms.txt**: Probe `{input_url}/llms.txt` using `curl -I`.
2. **Domain Root**: Probe `https://{domain}/llms.txt` using `curl -I`.
### Stage 2: Advanced Sniffing (Tool-based)
If Stage 1 fails, run the companion sniffer script located in this skill's directory:
`python3 sniffer.py $ARGUMENTS`
## 📜 Behavioral Rules
- **User-Initiated Only**: Only invoke this skill when the user explicitly provides a documentation URL. Do not autonomously scan domains.
- **Switch to High-Speed Mode**: Once an index is found, prioritize its links over manual scraping.
- **Index Summary**: Always present a brief structure overview.
- **Fallback**: Use `sitemap.xml` parser results if `llms.txt` is missing.
标签
skill
ai