13 boosters for "ocr" — open source, verified from GitHub, ready to install
A PDF processing skill that enables users to read, extract, merge, split, and manipulate PDF files through an AI coding assistant. Developers and business users benefit from automated PDF workflows without writing complex code.
This booster enables Claude Code to process, convert, and extract data from documents (PDF, DOCX, XLSX, PPTX, HTML, images) using the Nutrient DWS API, including OCR, editing, signing, and form-filling capabilities. Developers building document automation workflows benefit from seamless integration with multiple file formats.
Heuristic scoring (no AI key configured).
"name": "mcp-markdownify-server", "description": "MCP Markdownify Server - Model Context Protocol Server for Converting Almost Anything to Markdown", "author": "@zcaceres (@zachcaceres | zach.dev)",
ZAI CLI is a command-line tool that brings AI-powered capabilities—including vision analysis, web search, page reading, and GitHub exploration—directly to developers and AI coding assistants. It's useful for developers who need programmatic access to vision, search, and code discovery features.
This MCP server integrates Nutrient's DWS Processor API with Claude, enabling AI agents to perform advanced document processing tasks including PDF handling, OCR, digital signatures, and redaction. Developers building document automation workflows and AI-powered applications benefit from seamless integration with Claude's context protocol.
"description": "Give Claude eyes and hands — screen capture and interaction for full-auto workflows", "name": "Kyle Zobell", "url": "https://gitlab.com/3spky5u"
"name": "swift-study-skills", "description": "Swift/iOS learning skills - Socratic tutoring, adaptive quizzes, and note-taking", "keywords": ["swift", "ios", "learning", "quiz", "study"],
Memento helps users search their computer history—screenshots, keystrokes, and OCR text—to recall forgotten information and context from past activity. Developers and knowledge workers benefit from quick retrieval of previously seen or typed information.
This MCP server enables extraction of text from documents, PDF manipulation, and OCR on images, making it useful for developers building document processing workflows in Claude Desktop, Claude Code, and Cursor.
A practical integration guide for enhancing OCR text extraction with visual and language LLM capabilities using local Ollama models in Caption Extractor. Developers working with document processing, image analysis, and text correction workflows benefit from this reusable agent framework.