92 boosters for "image" — open source, verified from GitHub, ready to install
A PDF processing skill that enables users to read, extract, merge, split, and manipulate PDF files through an AI coding assistant. Developers and business users benefit from automated PDF workflows without writing complex code.
The docx skill enables AI assistants to create, read, edit, and manipulate Word documents with professional formatting, tables of contents, images, and tracked changes. It's useful for developers and users who need to programmatically generate or process .docx files as part of their workflows.
Carousel Growth Engine autonomously transforms any website into viral TikTok and Instagram carousels, analyzing content, generating images via Gemini, publishing directly to feeds, and optimizing through analytics feedback. Ideal for social media managers, content creators, and marketing teams seeking to automate carousel production at scale.
Use this skill when the quality of the work depends on art direction, hierarchy, restraint, imagery, and motion rather than component count. Goal: ship interfaces that feel deliberate, premium, and current. Default toward award-level composition: one big idea, strong imagery, sparse copy, rigorous s
This skill enables users to generate and edit images directly within Claude Code using the OpenAI Image API, supporting use cases from product mockups to concept art. Developers and designers benefit by automating image creation workflows without leaving their coding environment.
A performance optimization skill that identifies and fixes loading speed, rendering, animations, images, and bundle size issues to create faster, smoother user experiences. Developers building web applications benefit from automated performance diagnostics and improvements.
Train object detection, image classification, and SAM/SAM2 segmentation models on managed cloud GPUs. No local GPU setup required—results are automatically saved to the Hugging Face Hub. Use this skill when users want to: Helper scripts use PEP 723 inline dependencies. Run them with :
Transformers.js enables running state-of-the-art machine learning models directly in JavaScript, both in browsers and Node.js environments, with no server required. Use this skill when you need to: The pipeline API is the easiest way to use models. It groups together preprocessing, model inference,
Implementation patterns for Runway prod checklist — AI video generation platform. See related Runway skills for more workflows.
A web search MCP server powered by Brave Search API, enabling Claude and other AI tools to search the web, find local businesses, and retrieve images in real-time. Useful for developers building AI applications that need current information beyond training data.
Nuxt SEO is a meta-module that simplifies SEO configuration for Nuxt applications, enabling developers to manage robots.txt, sitemaps, OG images, and structured data from a single integration.
Meigen is a visual creative expert MCP server that enables Claude to search design inspiration, enhance AI image prompts, and generate images through intelligent workflow orchestration. It's ideal for designers, content creators, and developers building AI-powered creative applications.
Given a URL, return its main content as clean Markdown — headings, links, images, lists, code blocks all preserved. Always try one method per URL — don't cascade blindly. Pick the right one upfront. is the directory where this SKILL.md lives. Resolve it before calling the script.
"name": "nano-banana-2-skill-marketplace", "url": "https://github.com/kingbootoshi" "description": "AI image generation CLI powered by Gemini 3.1 Flash (default) and Gemini 3 Pro. Multi-resolution, aspect ratios, cost tracking, green screen transparency, reference images, style transfer.",
Generate images using ModelScope's Tongyi-MAI/Z-Image-Turbo model. Optional: Set to use your own API key.
17 Google Maps tools for AI agents — geocode, search, directions, weather, air quality, map images via MCP server or standalone CLI
A Chinese-language system prompt that transforms simple descriptions into detailed video generation prompts for AI video models, designed for use in Claude, ChatGPT, Cursor, and Windsurf environments.
Build with . Never run the commands, the user will do it manually. Lint with the for TypeScript and for Biome. Do not use as it will start a long running watch commands - just use . For every string, add the localization to and use the function to retrieve it. Do not hardcode any strings in th
Run this bash block first, before any analysis: This is the first time clearshot is running — no config exists yet. Before doing any analysis, tell the user to run the onboarding setup. Say something brief like: "clearshot needs a quick first-run setup (two questions, arrow keys + enter):"
"name": "minimax-mcp-js", "version": "0.0.17", "description": "Official MiniMax Model Context Protocol (MCP) JavaScript implementation that provides seamless integration with MiniMax's powerful AI capabilities including image generation, video generation, text-to-speech, and voice cloning APIs.",
Dinox MCP Server enables Claude to understand images through advanced object detection, localization, and captioning by connecting to the DINO-X vision model. Developers building multimodal AI applications benefit from seamless integration of real-world visual perception into LLM workflows.
Nuxt SEO is a meta-module that streamlines SEO configuration, sitemap generation, OG image creation, and structured data management for Nuxt applications. Developers building Nuxt sites need this to ensure proper search engine visibility and social media optimization.
A skill that removes backgrounds from images using FAL.ai's BiRefNet model, enabling developers to quickly extract subjects and create transparent PNGs. Useful for anyone building image processing features into Claude-based applications.
This skill encodes the complete design specification for professional business presentations — a consultant-grade PowerPoint framework based on McKinsey design principles. It includes: All specifications have been refined through iterative production feedback to ensure visual consistency, profession