HA

tianyi-lab/HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

336 9 +0/wk
GitHub
benchmark benchmarks gpt-4 gpt-4v hallucination large-language-models large-vision-language-models llava llm lmm vlms
Trend 0

Star & Fork Trend (18 data points)

Stars
Forks

Multi-Source Signals

Growth Velocity

tianyi-lab/HallusionBench has +0 stars this period . Velocity data will be available after more historical data is collected.

Deep analysis is being generated for this repository.

Signal-backed technical analysis will be available soon.

Metric HallusionBench NLPGNN gpt-j-api Open-WebUI-Functions
Stars 336 336335335
Forks 9 665452
Weekly Growth +0 +0+0+0
Language Python PythonPythonPython
Sources 1 111
License BSD-3-Clause MITMITApache-2.0

Capability Radar vs NLPGNN

HallusionBench
NLPGNN
Maintenance Activity 2

Last code push 176 days ago.

Community Engagement 63

Fork-to-star ratio: 2.7%. Lower fork ratio may indicate passive usage.

Issue Burden 70

Issue data not yet available.

Growth Momentum 30

No measurable growth in the current period (first-day cold start expected).

License Clarity 95

Licensed under BSD-3-Clause. Permissive — safe for commercial use.

Risk scores are computed from real-time repository data. Higher scores indicate healthier metrics.