30 boosters for "vision" — AI-graded, open source, ready to install
Expert guidance for developing, debugging, and optimizing Azure AI Document Intelligence applications, covering architecture, security, integrations, and deployment patterns. Essential for developers building document processing solutions on Azure.
Human MCP enables Claude coding agents to leverage human-like capabilities including vision, debugging, and multimodal interactions through the Model Context Protocol. Developers building AI coding assistants and agents will benefit from enhanced human-centered debugging and visual analysis features.
Dinox MCP Server enables Claude to understand images through advanced object detection, localization, and captioning by connecting to the DINO-X vision model. Developers building multimodal AI applications benefit from seamless integration of real-world visual perception into LLM workflows.
Ag Bridge is a LAN-only MCP server that provides agent supervision capabilities for Claude Desktop and Claude Code, enabling developers to monitor and control AI agent behavior in local environments.
Teaches developers to write JavaScript leveraging Brendan Eich's core design principles—first-class functions, prototypes, and dynamic typing—for more idiomatic and powerful code. Best for intermediate to advanced developers wanting to deepen their JavaScript fundamentals.
ZAI CLI is a command-line tool that brings AI-powered capabilities—including vision analysis, web search, page reading, and GitHub exploration—directly to developers and AI coding assistants. It's useful for developers who need programmatic access to vision, search, and code discovery features.
A strategic project leadership agent that interviews users, analyzes project requirements, and assembles specialized teams by recruiting other agents. Ideal for organizations using multi-agent systems who need automated project orchestration and team coordination.
Enables AI assistants to control and automate Android devices via ADB with real-time visual feedback, perfect for testing mobile apps, automating repetitive tasks, and building phone-based workflows.
This MCP server enables autonomous AI agents to reason more effectively through a dual-cycle metacognitive framework that combines fast intuitive reasoning with slower deliberative verification. It's designed for developers building sophisticated autonomous agents that need robust self-monitoring, loop detection, and belief revision capabilities.
This MCP server integrates the Contabo VPS API into Claude and Cursor, enabling programmatic cloud infrastructure provisioning and management directly through AI assistants. It benefits developers and DevOps engineers who want to automate VPS deployment and configuration workflows.
A Cursor-specific prompt containing detailed coding standards and project structure rules for building an OpenCV-based customer detection system for jewelry stores using YOLOv8. Developers working on computer vision projects in Cursor will benefit from these standardized guidelines.
A Cursor rules booster that enables developers to build iOS apps for translating burned-in video subtitles using Vision OCR and Apple's translation API. Useful for developers working with multilingual video content and social media platforms like RedNote.
The plan-generator transforms high-level product requirements into executable project blueprints (genesis.xml files) with structured task DAGs and agent assignments. It's invaluable for cofounders and product teams who need to bridge strategic vision with concrete execution plans.
A coordinator agent that handles high-level project decisions, priorities, and delegation across specialized team members for monorepo package management. Ideal for project leads managing complex multi-agent workflows who need strategic oversight without hands-on implementation.
A Chief Technology Officer agent that guides enterprise technology strategy decisions, including investment evaluation, technical vision setting, and architectural planning. Ideal for organizations needing structured CTO-level guidance on technology roadmaps and innovation initiatives.
This Copilot prompt automates PXE server setup on Ubuntu 24.04, guiding users through configurable TFTP, DHCP, NFS, and HTTP services for network booting and OS deployment. DevOps engineers and system administrators benefit from its modular, idempotent approach to infrastructure provisioning.
This MCP server integrates Google's Gemini vision model to enable Claude with fast, reliable image and video analysis capabilities. Developers and Claude users benefit from direct visual content understanding without external API calls.
John is a Product Manager agent that generates comprehensive PRDs and strategic product documentation by translating business vision into actionable development requirements. Product teams, startup founders, and business stakeholders benefit from structured, professional product planning without needing extensive PM expertise.
FaceSwap Copilot Instructions provides a structured guide for developing a React/TypeScript face-swapping web application with real-time ML capabilities. It benefits developers building entertainment, creative, or research-focused face manipulation tools.
Zeus is a thematic multi-agent coordinator agent designed for strategic decision-making and performance tracking, but lacks concrete implementation details and practical use cases.
OpenDarts is a self-hosted dart application with computer vision-based auto-scoring, enabling players to track games and practice with automated score detection via their phone camera. It benefits dart enthusiasts and competitive players who want accurate scoring and game management without manual entry.
CodeVibing lets Claude Code users share their work to a social network with automatic account provisioning and zero setup friction. It's designed for developers who want to post and connect with other Claude Code users effortlessly.
Peepit MCP Server enables AI agents to capture and analyze macOS screenshots with smart window targeting and AI-powered image analysis, solving the critical problem of giving Claude visual perception of the desktop environment.
A1-Vision is a vision-aware system prompt that enables AI assistants to understand screenshots, UI elements, and on-screen text while leveraging memory retrieval and structured reasoning. It's useful for developers building vision-capable AI agents across multiple platforms (Claude, ChatGPT, Cursor, Windsurf).