AI SummaryParse PDF, Office (Word/PPT/Excel), and image files into clean Markdown — with LaTeX formulas, tables, images, and OCR. One zero-dependency script, two backends, No , no API key. The free Agent API handles files ≤ 10 MB / ≤ 20 pages.
Install
Copy this and paste it into Claude Code, Cursor, or any AI assistant:
I want to install the "mineru" skill in my project. Please run this command in my terminal: # Install skill into your project mkdir -p .claude/skills/MinerU-Skill && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/MinerU-Skill/SKILL.md "https://raw.githubusercontent.com/Nebutra/MinerU-Skill/main/SKILL.md" Then restart Claude Code (or reload the window in Cursor) so the skill is picked up.
Description
An AI-Native skill for parsing PDF / Office / image files into clean Markdown with MinerU — a fast, zero-config document parser for AI agents. Works with NO token via the lightweight Agent API and auto-upgrades to the Standard API (token) for large files, batches, and DOCX/HTML/LaTeX export. Use when: (1) Converting PDF/Word/PPT/Excel/image to Markdown, (2) Extracting text, tables, formulas, or running OCR on scanned docs, (3) Batch-parsing a folder in parallel, (4) Piping parsed Markdown straight back to an agent or into Obsidian.
Deliver into your tools (`--to`)
Parse once and push the Markdown into content tools via each one's official path: `bash python3 scripts/mineru.py paper.pdf --to obsidian --to notion --to feishu ` Targets: obsidian logseq siyuan notion linear yuque coda slack feishu confluence onenote ticktick dingtalk airtable wecom (all zero-dependency), plus roam and wps via optional extras. Each reads its config from env vars (run --list-sinks). Per-target auth, fidelity, and image notes: references/integrations.md.
MinerU PDF Parser
Parse PDF, Office (Word/PPT/Excel), and image files into clean Markdown — with LaTeX formulas, tables, images, and OCR. One zero-dependency script, two backends, automatic routing.
Parse a local file or URL — the Agent API needs no login
python3 scripts/mineru.py paper.pdf
Pipe the Markdown straight back to an agent
python3 scripts/mineru.py paper.pdf --stdout
Discussion
Health Signals
My Fox Den
Community Rating
Sign in to rate this booster