CL

CLUEbenchmark/CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

1.0k 83 +0/wk
GitHub
albert bert chinese chinese-corpus corpus datasets nlp pretrain roberta
Trend 0

Star & Fork Trend (16 data points)

Stars
Forks

Multi-Source Signals

Growth Velocity

CLUEbenchmark/CLUECorpus2020 has +0 stars this period . Velocity data will be available after more historical data is collected.

Deep analysis is being generated for this repository.

Signal-backed technical analysis will be available soon.

Metric CLUECorpus2020 Prompt4ReasoningPapers mlx-tune PointLLM
Stars 1.0k 1.0k1.0k999
Forks 83 676357
Weekly Growth +0 +0+10+1
Language N/A N/APythonPython
Sources 1 111
License MIT MITApache-2.0N/A

Capability Radar vs Prompt4ReasoningPapers

CLUECorpus2020
Prompt4ReasoningPapers
Maintenance Activity 69

Last code push 61 days ago.

Community Engagement 41

Fork-to-star ratio: 8.3%. Lower fork ratio may indicate passive usage.

Issue Burden 70

Issue data not yet available.

Growth Momentum 30

No measurable growth in the current period (first-day cold start expected).

License Clarity 95

Licensed under MIT. Permissive — safe for commercial use.

Risk scores are computed from real-time repository data. Higher scores indicate healthier metrics.