Skip to content
Skill

mineru

by Nebutra

AI Summary

Parse PDF, Office (Word/PPT/Excel), and image files into clean Markdown — with LaTeX formulas, tables, images, and OCR. One zero-dependency script, two backends, No , no API key. The free Agent API handles files ≤ 10 MB / ≤ 20 pages.

Install

Copy this and paste it into Claude Code, Cursor, or any AI assistant:

I want to install the "mineru" skill in my project.

Please run this command in my terminal:
# Install skill into your project
mkdir -p .claude/skills/MinerU-Skill && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/MinerU-Skill/SKILL.md "https://raw.githubusercontent.com/Nebutra/MinerU-Skill/main/SKILL.md"

Then restart Claude Code (or reload the window in Cursor) so the skill is picked up.

Description

An AI-Native skill for parsing PDF / Office / image files into clean Markdown with MinerU — a fast, zero-config document parser for AI agents. Works with NO token via the lightweight Agent API and auto-upgrades to the Standard API (token) for large files, batches, and DOCX/HTML/LaTeX export. Use when: (1) Converting PDF/Word/PPT/Excel/image to Markdown, (2) Extracting text, tables, formulas, or running OCR on scanned docs, (3) Batch-parsing a folder in parallel, (4) Piping parsed Markdown straight back to an agent or into Obsidian.

Deliver into your tools (`--to`)

Parse once and push the Markdown into content tools via each one's official path: `bash python3 scripts/mineru.py paper.pdf --to obsidian --to notion --to feishu ` Targets: obsidian logseq siyuan notion linear yuque coda slack feishu confluence onenote ticktick dingtalk airtable wecom (all zero-dependency), plus roam and wps via optional extras. Each reads its config from env vars (run --list-sinks). Per-target auth, fidelity, and image notes: references/integrations.md.

MinerU PDF Parser

Parse PDF, Office (Word/PPT/Excel), and image files into clean Markdown — with LaTeX formulas, tables, images, and OCR. One zero-dependency script, two backends, automatic routing.

Parse a local file or URL — the Agent API needs no login

python3 scripts/mineru.py paper.pdf

Pipe the Markdown straight back to an agent

python3 scripts/mineru.py paper.pdf --stdout

Discussion

0/2000
Loading comments...

Health Signals

MaintenanceCommitted Today
Active
AdoptionUnder 100 stars
62 ★ · Niche
DocsREADME + description
Well-documented

GitHub Signals

Stars62
Forks2
Issues3
UpdatedToday
View on GitHub
MIT License

My Fox Den

Community Rating

Sign in to rate this booster

Works With

Claude Code
Cursor