Skip to content
Prompt

Curator — Cursor Rules

by NVIDIA-NeMo

AI Summary

A Cursor-integrated coding standards and linting configuration toolkit for NeMo Curator projects, helping teams enforce consistent data processing code quality with Ruff-based rules and exceptions.

Install

Copy this and paste it into Claude Code, Cursor, or any AI assistant:

I want to add the "Curator — Cursor Rules" prompt rules to my project.
Repository: https://github.com/NVIDIA-NeMo/Curator

Please read the repo to find the rules/prompt file, then:
1. Download it to the correct location (.cursorrules, .windsurfrules, .github/prompts/, or project root — based on the file type)
2. If there's an existing rules file, merge the new rules in rather than overwriting
3. Confirm what was added

Description

Scalable data pre processing and curation toolkit for LLMs

`examples/` directory

• No __init__.py required (INP001)

Linting and Formatting

The project uses Ruff for linting and formatting with line length of 119 characters.

Allowed Patterns

• ✅ Print statements (T20 ignored) • ✅ Boolean arguments in functions (FBT ignored) • ✅ df as variable name for DataFrames (PD901 ignored) • ✅ TODOs without author/link (TD002, TD003 ignored) • ✅ Long exception messages (TRY003 ignored) • ✅ Accessing private attributes (SLF001 ignored) • ✅ Branching after return (RET505-508 ignored)

Required Patterns

• ❌ No docstrings required (D ignored) • ❌ No pathlib enforcement (PTH ignored) • ❌ No logging enforcement (G ignored) • ✅ Type annotations for functions (except args, *kwargs, special methods)

Discussion

0/2000
Loading comments...

Health Signals

MaintenanceCommitted 18d ago
Active
Adoption1K+ stars on GitHub
1.5k ★ · Popular
DocsMissing or thin
Undocumented

GitHub Signals

Stars1.5k
Forks253
Issues253
Updated18d ago
View on GitHub
Apache-2.0 License

My Fox Den

Community Rating

Sign in to rate this booster

Works With

Any AI assistant that accepts custom rules or system prompts

Claude
ChatGPT
Cursor
Windsurf
Copilot
+ more