2 boosters for "mineru" — open source, verified from GitHub, ready to install
Parse PDF, Office (Word/PPT/Excel), and image files into clean Markdown — with LaTeX formulas, tables, images, and OCR. One zero-dependency script, two backends, No , no API key. The free Agent API handles files ≤ 10 MB / ≤ 20 pages.
Turn a folder of raw files into a Markdown vault that an LLM can grep, and then answer questions over that vault responsibly. source file, carrying retrieval frontmatter (abstract / tags / synonyms) + a