CU

NVIDIA-NeMo/Curator

Scalable data pre processing and curation toolkit for LLMs

1.5k 252 +2/wk
GitHub
data data-curation data-prep data-preparation data-processing data-processing-pipelines data-quality datacuration datarecipes deduplication fast-data-processing fine-tuning
Trend 3

Star & Fork Trend (37 data points)

Stars
Forks

Multi-Source Signals

Growth Velocity

NVIDIA-NeMo/Curator has +2 stars this period . 7-day velocity: 0.5%.

Deep analysis is being generated for this repository.

Signal-backed technical analysis will be available soon.

Metric Curator Lumos nlp-lang coffee
Stars 1.5k 1.5k1.5k1.5k
Forks 252 11149575
Weekly Growth +2 -1+0+0
Language Python TypeScriptJavaPython
Sources 1 111
License Apache-2.0 MITApache-2.0Apache-2.0

Capability Radar vs Lumos

Curator
Lumos
Maintenance Activity 100

Last code push 1 days ago.

Community Engagement 83

Fork-to-star ratio: 16.7%. Active community forking and contributing.

Issue Burden 70

Issue data not yet available.

Growth Momentum 48

+2 stars this period — 0.13% growth rate.

License Clarity 95

Licensed under Apache-2.0. Permissive — safe for commercial use.

Risk scores are computed from real-time repository data. Higher scores indicate healthier metrics.