Skip to content
Skill

huggingface-lora-space-builder

by huggingface

AI Summary

Build and publish a Gradio demo on Hugging Face Spaces that runs inference with a user-provided LoRA. Use whenever someone asks to create, generate, ship, or publish "a Space", "a demo", "a Gradio app", or "a playground" for a LoRA — whether the base model is Qwen-Image, Qwen-Image-Edit, LTX, or ano

Install

Copy this and paste it into Claude Code, Cursor, or any AI assistant:

I want to install the "huggingface-lora-space-builder" skill in my project.

Please run this command in my terminal:
# Install skill into your project (5 files)
mkdir -p .claude/skills/huggingface-lora-space-builder && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/huggingface-lora-space-builder/SKILL.md "https://raw.githubusercontent.com/huggingface/skills/main/skills/huggingface-lora-space-builder/SKILL.md" && mkdir -p .claude/skills/huggingface-lora-space-builder/references && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/huggingface-lora-space-builder/references/adapting-to-the-lora.md "https://raw.githubusercontent.com/huggingface/skills/main/skills/huggingface-lora-space-builder/references/adapting-to-the-lora.md" && mkdir -p .claude/skills/huggingface-lora-space-builder/references && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/huggingface-lora-space-builder/references/creative-mode.md "https://raw.githubusercontent.com/huggingface/skills/main/skills/huggingface-lora-space-builder/references/creative-mode.md" && mkdir -p .claude/skills/huggingface-lora-space-builder/references && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/huggingface-lora-space-builder/references/tasks.md "https://raw.githubusercontent.com/huggingface/skills/main/skills/huggingface-lora-space-builder/references/tasks.md" && mkdir -p .claude/skills/huggingface-lora-space-builder/references && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/skills/huggingface-lora-space-builder/references/zerogpu-and-publishing.md "https://raw.githubusercontent.com/huggingface/skills/main/skills/huggingface-lora-space-builder/references/zerogpu-and-publishing.md"

Then restart Claude Code (or reload the window in Cursor) so the skill is picked up.

Description

Build and publish a Gradio demo on Hugging Face Spaces for a user-provided LoRA. Use when someone asks to create, generate, ship, or publish a Space, demo, Gradio app, or playground for a LoRA — including LoRAs for Qwen-Image, Qwen-Image-Edit, LTX-Video, Wan, FLUX, SDXL, or other diffusion base models. Also triggers when someone describes a LoRA they trained or hosts on the Hub and wants to share it. Covers picking the right base pipeline and `diffusers` inference recipe, designing a UI tailored to the LoRA's task and inputs (Union/multi-task control, edit, video, image, etc.), respecting model-card recommendations (trigger words, steps, guidance, LoRA scale, example inputs), and shipping to ZeroGPU hardware as a private Space by default.

Gradio LoRA Space Builder

Build and publish a Gradio demo on Hugging Face Spaces that runs inference with a user-provided LoRA. Use whenever someone asks to create, generate, ship, or publish "a Space", "a demo", "a Gradio app", or "a playground" for a LoRA — whether the base model is Qwen-Image, Qwen-Image-Edit, LTX, or another diffusion model. Also use when someone describes a LoRA they trained or hosts on the Hub and wants to share it. The default target is ZeroGPU hardware and the default inference library is diffusers when the base model supports it. The output is a real, published Space (private by default) that the user can try in the browser, not a local script.

What "good" looks like for these demos

The demo should feel handcrafted for this specific LoRA, not a generic template with the LoRA bolted on. Two LoRAs that share a task can still need different demos: a pose-control video LoRA and an outpainting video LoRA both take video in and produce video out, but the inputs the user provides, the preprocessing, and the controls are completely different. Recognizing that is the central job here. Concretely, a good demo: • Loads fast and runs fast — minimal model loading, sensible step count, no wasted computation per call. • Has a UI with exactly the controls this LoRA needs and nothing else. Excess sliders are a cost, not a feature. • Shows the user what's happening — progress, intermediate outputs where useful, the seed used, a clear error when input is missing. • Honors the LoRA's own recommendations from its model card: trigger words, recommended step count, recommended guidance scale, recommended LoRA scale, example inputs. • Is creative where creativity helps — interactive canvases, before/after sliders, side-by-side previews of intermediate processing — and plain where plainness is right.

Workflow

Work through these phases in order. Information gathered in one phase decides the next. • Gather the LoRA info needed to pick a pipeline and design a UI. • Pick the base pipeline and inference recipe. • Design the UI for this specific LoRA's task and inputs. • Write app.py, requirements.txt, and README.md together; show all three to the user for one batched approval. • Publish the Space (private). Don't drip-feed questions across multiple turns. Batch them. ---

Phase 1 — Gather LoRA info

Required: a LoRA repo on the Hub (e.g. username/my-lora). First, try to read the repo without a token. If it succeeds, the repo is public — proceed. If it fails with 401/403, the repo is private/gated and you need an authenticated session to read it. Don't immediately ask for a token. Check first whether the user is already authenticated. `python from huggingface_hub import HfApi, get_token cached_token = get_token() # picks up HF_TOKEN env var or cached CLI login if cached_token: try: info = HfApi().whoami(token=cached_token) username = info["name"] # info also has fine-grained token scope info if applicable except Exception: cached_token = None # token exists but is invalid/expired ` Then: • If a valid cached token exists and it can read the repo, use it. No prompt needed. • If no cached token, or the cached token can't read this private repo, ask the user for a token — once, with the explanation below. When asking for a token (and only when you actually need to ask): > I need a Hugging Face access token with write scope (to read the LoRA if it's private/gated, and to publish the Space). Create one at https://huggingface.co/settings/tokens. Paste it here. The same token will be reused for publishing in the final phase, so this is a one-time ask. Then read what's in the repo: • List the repo files (huggingface_hub.HfApi().list_repo_files(repo_id)). Look for .safetensors, README.md, example images/videos, multiple checkpoints. • Fetch the model card (huggingface_hub.ModelCard.load(repo_id)). The data dict has structured fields; the text has the README body. • If multiple .safetensors files exist, pick the right one — see "Picking the LoRA weights file" in references/zerogpu-and-publishing.md. Briefly: README-recommended file wins, then pytorch_lora_weights.safetensors, then latest training checkpoint, otherwise ask. From the model card, try to determine: • Base model — the base_model field, or text mentions in the README. Usually present. Use it to pick the pipeline reference file (see Phase 2). • Task — pipeline_tag if set, otherwise inferred from the base model and README text. The five tasks this skill handles: text-to-image, image-to-image, text-to-video, image-to-video, video-to-video. • Trigger words — often called "trigger word", "instance prompt", "activation word"; sometimes embedded in example prompts. • Recommended inference recipe — step count, guidance scale, true CFG scale, LoRA scale, resolution. Many LoRA cards include a Python snippet; trust its parameters (steps, guidance, CFG, LoRA scale, dtype). For loading mechanics, see adapting-to-the-lora.md — prefer pipe.load_lora_weights(...) over whatever loading approach the snippet uses. • Example prompts and example media — use these as Gradio examples in the UI. • Sub-task / specific use case — for image edits and video LoRAs, "what does this LoRA actually do" matters as much as the task category. A relighting LoRA, a face-swap LoRA, and a style LoRA all might be image-to-image, but the UI for each is different. When something can't be inferred, ask the user — once, in a single batched message. Format the question to make answering trivial. For task category, list the five options as a numbered choice. For sub-task, give a one-line description ("what does this LoRA do? e.g. 'relight portraits', 'apply manga style', 'extend videos to wider aspect ratios'"). Don't ask if you can already infer it confidently from the base model or README. If the model card has nothing helpful at all — no base model, no task, no example — surface that clearly: "The model card has no usable info. I'll need you to tell me: (1) base model, (2) what this LoRA does, (3) recommended step count and guidance scale if you know them." ---

Discussion

0/2000
Loading comments...

Health Signals

MaintenanceCommitted Yesterday
Active
Adoption1K+ stars on GitHub
10.7k ★ · Popular
DocsREADME + description
Well-documented

GitHub Signals

Stars10.7k
Forks703
Issues28
UpdatedYesterday
View on GitHub
Apache-2.0 License

My Fox Den

Community Rating

Sign in to rate this booster

Works With

Claude Code