TR

adbar/trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

5.7k 352 +0/wk
GitHub
article-extractor corpus-builder corpus-tools crawler html-to-markdown html2text llm news-aggregator news-crawler nlp rag readability
Trend 3

Star & Fork Trend (35 data points)

Stars
Forks

Multi-Source Signals

Growth Velocity

adbar/trafilatura has +0 stars this period . 7-day velocity: 0.3%.

Deep analysis is being generated for this repository.

Signal-backed technical analysis will be available soon.

Metric trafilatura Baichuan-7B QOwnNotes freegpt-webui
Stars 5.7k 5.7k5.7k5.7k
Forks 352 5064921.2k
Weekly Growth +0 +0+1-1
Language Python PythonC++Python
Sources 1 111
License Apache-2.0 Apache-2.0GPL-2.0GPL-3.0

Capability Radar vs Baichuan-7B

trafilatura
Baichuan-7B
Maintenance Activity 0

Last code push 208 days ago.

Community Engagement 31

Fork-to-star ratio: 6.2%. Lower fork ratio may indicate passive usage.

Issue Burden 70

Issue data not yet available.

Growth Momentum 30

No measurable growth in the current period (first-day cold start expected).

License Clarity 95

Licensed under Apache-2.0. Permissive — safe for commercial use.

Risk scores are computed from real-time repository data. Higher scores indicate healthier metrics.