8 boosters for "ocr" — AI-graded, open source, ready to install
Heuristic scoring (no AI key configured).
A comprehensive developer tools hub that guides developers through debugging, code review, refactoring, git operations, and quality checks using Socratic questioning and specialized agents. Ideal for developers seeking structured, educational support across the entire development workflow.
A specialized document classification agent that automatically analyzes OCR-extracted documents and categorizes them into predefined administrative document types, helping users streamline document submission and reduce back-and-forth communication with administrative staff.
A Cursor rules booster that enables developers to build iOS apps for translating burned-in video subtitles using Vision OCR and Apple's translation API. Useful for developers working with multilingual video content and social media platforms like RedNote.
This document explains how to use the AI agent features in Caption Extractor, which enhance OCR processing with visual LLM models using local Ollama.
This MCP server enables extraction of text from documents, PDF manipulation, and OCR on images, making it useful for developers building document processing workflows in Claude Desktop, Claude Code, and Cursor.
A specialized PDF processing agent that handles text extraction, metadata retrieval, and OCR operations using unpdf and PaddleOCR technologies. Ideal for developers building document processing pipelines, data extraction workflows, and document management systems.