opendatalab/MinerU-HTML
MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep Research Agents, RAG applications, and training data generation.
Star & Fork Trend (18 data points)
Multi-Source Signals
Growth Velocity
opendatalab/MinerU-HTML has +0 stars this period . Velocity data will be available after more historical data is collected.
Deep analysis is being generated for this repository.
Signal-backed technical analysis will be available soon.
| Metric | MinerU-HTML | NSP-BERT | bert-vocab-builder | rag-using-langchain-amazon-bedrock-and-opensearch |
|---|---|---|---|---|
| Stars | 229 | 230 | 230 | 228 |
| Forks | 24 | 38 | 48 | 45 |
| Weekly Growth | +0 | +0 | +0 | +0 |
| Language | Python | Python | Python | Python |
| Sources | 1 | 1 | 1 | 1 |
| License | Apache-2.0 | Apache-2.0 | N/A | MIT-0 |
Capability Radar vs NSP-BERT
Last code push 12 days ago.
Fork-to-star ratio: 10.5%. Active community forking and contributing.
Issue data not yet available.
No measurable growth in the current period (first-day cold start expected).
Licensed under Apache-2.0. Permissive — safe for commercial use.
Risk scores are computed from real-time repository data. Higher scores indicate healthier metrics.