CL
CLUEbenchmark/CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
1.0k 83 +0/wk
GitHub
albert bert chinese chinese-corpus corpus datasets nlp pretrain roberta
Trend
0
Star & Fork Trend (16 data points)
Stars
Forks
Multi-Source Signals
Growth Velocity
CLUEbenchmark/CLUECorpus2020 has +0 stars this period . Velocity data will be available after more historical data is collected.
Deep analysis is being generated for this repository.
Signal-backed technical analysis will be available soon.
| Metric | CLUECorpus2020 | Prompt4ReasoningPapers | mlx-tune | PointLLM |
|---|---|---|---|---|
| Stars | 1.0k | 1.0k | 1.0k | 999 |
| Forks | 83 | 67 | 63 | 57 |
| Weekly Growth | +0 | +0 | +10 | +1 |
| Language | N/A | N/A | Python | Python |
| Sources | 1 | 1 | 1 | 1 |
| License | MIT | MIT | Apache-2.0 | N/A |
Capability Radar vs Prompt4ReasoningPapers
CLUECorpus2020
Prompt4ReasoningPapers
Maintenance Activity 69
Last code push 61 days ago.
Community Engagement 41
Fork-to-star ratio: 8.3%. Lower fork ratio may indicate passive usage.
Issue Burden 70
Issue data not yet available.
Growth Momentum 30
No measurable growth in the current period (first-day cold start expected).
License Clarity 95
Licensed under MIT. Permissive — safe for commercial use.
Risk scores are computed from real-time repository data. Higher scores indicate healthier metrics.