CLUEbenchmark/CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

1.0k 83 +0/wk

GitHub

albert bert chinese chinese-corpus corpus datasets nlp pretrain roberta

Trend 0

Stars

Forks

CLUEbenchmark/CLUECorpus2020 has +0 stars this period . Velocity data will be available after more historical data is collected.

Deep analysis is being generated for this repository.

Signal-backed technical analysis will be available soon.

Metric	CLUECorpus2020	Prompt4ReasoningPapers	mlx-tune	PointLLM
Stars	1.0k	1.0k	1.0k	999
Forks	83	67	63	57
Weekly Growth	+0	+0	+10	+1
Language	N/A	N/A	Python	Python
Sources	1	1	1	1
License	MIT	MIT	Apache-2.0	N/A

CLUECorpus2020

Prompt4ReasoningPapers

Maintenance Activity 69

Last code push 61 days ago.

Community Engagement 41

Fork-to-star ratio: 8.3%. Lower fork ratio may indicate passive usage.

Issue Burden 70

Issue data not yet available.

Growth Momentum 30

No measurable growth in the current period (first-day cold start expected).

License Clarity 95

Licensed under MIT. Permissive — safe for commercial use.

Risk scores are computed from real-time repository data. Higher scores indicate healthier metrics.