How do I install transcribe?

transcribe is a Skill hosted on GitHub at https://github.com/openai/skills. Visit the ImAiFox page at https://imaifox.com/boosters/openai-skills-transcribe for the AI-ready install prompt you can copy directly into Claude Code, Cursor, or Windsurf.

How popular is transcribe?

transcribe has 22,593 GitHub stars and 1,535 forks. It is actively maintained with recent commits.

Yes — transcribe is open source and free to use. The source code is publicly available on GitHub at https://github.com/openai/skills.

Skill

transcribe

Name: transcribe
Author: openai

by openai

AI Summary

Transcribe audio and video files to text with optional speaker identification and diarization. Useful for developers building transcription features, interview processing tools, or meeting documentation systems.

Install

Copy this and paste it into Claude Code, Cursor, or any AI assistant:

I want to install the "transcribe" skill in my project.

Please run this command in my terminal:
# Install skill into your project (7 files)
mkdir -p .claude/skills/transcribe && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/transcribe/SKILL.md "https://raw.githubusercontent.com/openai/skills/main/skills/.curated/transcribe/SKILL.md" && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/transcribe/LICENSE.txt "https://raw.githubusercontent.com/openai/skills/main/skills/.curated/transcribe/LICENSE.txt" && mkdir -p .claude/skills/transcribe/agents && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/transcribe/agents/openai.yaml "https://raw.githubusercontent.com/openai/skills/main/skills/.curated/transcribe/agents/openai.yaml" && mkdir -p .claude/skills/transcribe/assets && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/transcribe/assets/transcribe-small.svg "https://raw.githubusercontent.com/openai/skills/main/skills/.curated/transcribe/assets/transcribe-small.svg" && mkdir -p .claude/skills/transcribe/assets && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/transcribe/assets/transcribe.png "https://raw.githubusercontent.com/openai/skills/main/skills/.curated/transcribe/assets/transcribe.png" && mkdir -p .claude/skills/transcribe/references && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/transcribe/references/api.md "https://raw.githubusercontent.com/openai/skills/main/skills/.curated/transcribe/references/api.md" && mkdir -p .claude/skills/transcribe/scripts && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/transcribe/scripts/transcribe_diarize.py "https://raw.githubusercontent.com/openai/skills/main/skills/.curated/transcribe/scripts/transcribe_diarize.py"

Then restart Claude Code (or reload the window in Cursor) so the skill is picked up.

Description

Transcribe audio files to text with optional diarization and known-speaker hints. Use when a user asks to transcribe speech from audio/video, extract text from recordings, or label speakers in interviews or meetings.

Audio Transcribe

Transcribe audio using OpenAI, with optional speaker diarization when requested. Prefer the bundled CLI for deterministic, repeatable runs.

Workflow

• Collect inputs: audio file path(s), desired response format (text/json/diarized_json), optional language hint, and any known speaker references. • Verify OPENAI_API_KEY is set. If missing, ask the user to set it locally (do not ask them to paste the key). • Run the bundled transcribe_diarize.py CLI with sensible defaults (fast text transcription). • Validate the output: transcription quality, speaker labels, and segment boundaries; iterate with a single targeted change if needed. • Save outputs under output/transcribe/ when working in this repo.

Decision rules

• Default to gpt-4o-mini-transcribe with --response-format text for fast transcription. • If the user wants speaker labels or diarization, use --model gpt-4o-transcribe-diarize --response-format diarized_json. • If audio is longer than ~30 seconds, keep --chunking-strategy auto. • Prompting is not supported for gpt-4o-transcribe-diarize.

Output conventions

• Use output/transcribe/<job-id>/ for evaluation runs. • Use --out-dir for multiple files to avoid overwriting.

Discussion

0/2000

Loading comments...

Health Signals

MaintenanceCommitted 1mo ago

● Active

Adoption1K+ stars on GitHub

22.6k ★ · Popular

DocsREADME + description

Well-documented

GitHub Signals

Stars22.6k

Forks1.5k

Issues277

Updated1mo ago

View on GitHub

No License

My Fox Den

Community Rating

Works With

Claude Code

Related Skills

Openclaw MCP Server

MCP Server

test-driven-development

Skill

using-superpowers

Skill

dispatching-parallel-agents

Skill

View all Skills →