21 boosters for "audio" — open source, verified from GitHub, ready to install
Podcast Strategist is a content strategy expert for Chinese podcast creators, offering guidance on show positioning, production, audience growth, and monetization across platforms like Xiaoyuzhou and Ximalaya. It's ideal for podcast creators and audio content entrepreneurs looking to build sustainable brands in China's audio market.
A hands-on short-video editing coach that guides users through the complete post-production pipeline across CapCut Pro, Premiere Pro, DaVinci Resolve, and Final Cut Pro. Ideal for content creators, video editors, and filmmakers seeking professional-grade editing expertise and workflow optimization.
Game Audio Engineer is an interactive audio specialist agent that helps developers design and implement adaptive music systems, spatial audio, and FMOD/Wwise integration across game engines. It benefits game audio engineers, sound designers, and full-stack game developers working on audio implementation.
Transcribe audio and video files to text with optional speaker identification and diarization. Useful for developers building transcription features, interview processing tools, or meeting documentation systems.
Generates spoken audio from text using OpenAI's TTS API, supporting single clips, batch operations, and accessibility reads. Developers building voiceovers, IVR systems, or accessible content will find this directly useful.
Transformers.js enables running state-of-the-art machine learning models directly in JavaScript, both in browsers and Node.js environments, with no server required. Use this skill when you need to: The pipeline API is the easiest way to use models. It groups together preprocessing, model inference,
YouTube Audio is a browser extension that strips video from YouTube streams, preserving only audio to save battery and bandwidth. It's useful for users who consume YouTube content primarily for audio (music, podcasts, lectures) and want to reduce data usage.
A fully autonomous AI research agent that ingests sources into Google NotebookLM, runs deep web research, synthesizes knowledge through cited Q&A and 9 downloadable artifact types, creates polished content drafts, and optionally publishes to social platforms.
Cursor rules for WiiM Home Assistant integration development that enforces architectural decisions and prevents technical debt by mandating fixes in pywiim rather than workarounds in the HA layer. Developers working on this Home Assistant integration will follow these rules to maintain code quality and proper separation of concerns.
Copilot instructions for developing and maintaining a Go application that syncs reading progress and book metadata between AudiobookShelf and Hardcover platforms. Developers building or contributing to this sync tool benefit from clear architectural guidance and API integration patterns.
ultraplan is a CLI tool that records multi-modal context (audio, screenshots, clipboard, keystrokes) from work sessions and converts them into detailed prompts for Claude analysis. Developers and researchers benefit by capturing complete context from meetings, debugging, or research workflows for later AI-assisted analysis.
Cursor rules for managing VST3 & AU plugins within the Spotify Pedalboard Python library, enabling developers to integrate audio plugin scanning with crash recovery and SQLite storage. Useful for audio engineers and Python developers building plugin-based audio applications.
"name": "runway-api", "description": "Helps users integrate Runway's public API (https://docs.dev.runwayml.com/) into their projects. Analyzes codebase compatibility, guides API key setup, and provides hands-on integration assistance for video generation, image generation, audio, and file uploads.",
An MCP server that integrates OpenAI's image and audio generation capabilities into Claude Desktop, Claude Code, and Cursor via a CLI wrapper. Developers building AI applications benefit from direct access to DALL-E and audio generation without managing API calls manually.
"$schema": "https://anthropic.com/claude-code/marketplace.schema.json", "name": "windows-pc-optimizer", "description": "Windows PC optimization toolkit for Claude Code",
An MCP server that integrates FFmpeg for multimedia processing with security, caching, and batch operations support, enabling developers to perform video and audio transformations seamlessly within Claude Desktop and Claude Code environments.
"name": "promptpilot-mcp", "description": "MCP server for PromptPilot.club — generate images, video, and audio via Pollinations API", "main": "build/index.js",
A Windsurf-integrated prompt for managing and developing an audio-to-subtitle video conversion application using Next.js and FastAPI with OpenAI Whisper. Developers building Korean audio processing tools benefit from structured server management and development workflows.
"name": "pagebolt-mcp", "description": "MCP server for PageBolt — take screenshots, generate PDFs, create OG images, inspect pages, record demo videos with Audio Guide narration, from AI coding assistants like Claude, Cursor, and Windsurf.", "main": "src/index.mjs",
Heuristic scoring (no AI key configured).
"name": "@gpu-bridge/mcp-server", "description": "GPU-Bridge MCP Server — 30 AI services as MCP tools. LLM, image, video, audio, embeddings, reranking, PDF parsing, NSFW detection & more. x402 native for autonomous agents.", "gpu-bridge-mcp": "index.js"