AI SummaryTranscribe audio and video files to text with optional speaker identification and diarization. Useful for developers building transcription features, interview processing tools, or meeting documentation systems.
Install
Copy this and paste it into Claude Code, Cursor, or any AI assistant:
I want to install the "transcribe" skill in my project. Please run this command in my terminal: # Install skill into your project (7 files) mkdir -p .claude/skills/transcribe && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/transcribe/SKILL.md "https://raw.githubusercontent.com/openai/skills/main/skills/.curated/transcribe/SKILL.md" && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/transcribe/LICENSE.txt "https://raw.githubusercontent.com/openai/skills/main/skills/.curated/transcribe/LICENSE.txt" && mkdir -p .claude/skills/transcribe/agents && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/transcribe/agents/openai.yaml "https://raw.githubusercontent.com/openai/skills/main/skills/.curated/transcribe/agents/openai.yaml" && mkdir -p .claude/skills/transcribe/assets && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/transcribe/assets/transcribe-small.svg "https://raw.githubusercontent.com/openai/skills/main/skills/.curated/transcribe/assets/transcribe-small.svg" && mkdir -p .claude/skills/transcribe/assets && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/transcribe/assets/transcribe.png "https://raw.githubusercontent.com/openai/skills/main/skills/.curated/transcribe/assets/transcribe.png" && mkdir -p .claude/skills/transcribe/references && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/transcribe/references/api.md "https://raw.githubusercontent.com/openai/skills/main/skills/.curated/transcribe/references/api.md" && mkdir -p .claude/skills/transcribe/scripts && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/transcribe/scripts/transcribe_diarize.py "https://raw.githubusercontent.com/openai/skills/main/skills/.curated/transcribe/scripts/transcribe_diarize.py" Then restart Claude Code (or reload the window in Cursor) so the skill is picked up.
Description
Transcribe audio files to text with optional diarization and known-speaker hints. Use when a user asks to transcribe speech from audio/video, extract text from recordings, or label speakers in interviews or meetings.
Audio Transcribe
Transcribe audio using OpenAI, with optional speaker diarization when requested. Prefer the bundled CLI for deterministic, repeatable runs.
Workflow
• Collect inputs: audio file path(s), desired response format (text/json/diarized_json), optional language hint, and any known speaker references. • Verify OPENAI_API_KEY is set. If missing, ask the user to set it locally (do not ask them to paste the key). • Run the bundled transcribe_diarize.py CLI with sensible defaults (fast text transcription). • Validate the output: transcription quality, speaker labels, and segment boundaries; iterate with a single targeted change if needed. • Save outputs under output/transcribe/ when working in this repo.
Decision rules
• Default to gpt-4o-mini-transcribe with --response-format text for fast transcription. • If the user wants speaker labels or diarization, use --model gpt-4o-transcribe-diarize --response-format diarized_json. • If audio is longer than ~30 seconds, keep --chunking-strategy auto. • Prompting is not supported for gpt-4o-transcribe-diarize.
Output conventions
• Use output/transcribe/<job-id>/ for evaluation runs. • Use --out-dir for multiple files to avoid overwriting.
Discussion
Health Signals
My Fox Den
Community Rating
Sign in to rate this booster