tianyi-lab/HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

336 9 +0/wk

GitHub

benchmark benchmarks gpt-4 gpt-4v hallucination large-language-models large-vision-language-models llava llm lmm vlms

Trend 0

Star & Fork Trend (18 data points)

Stars

Forks

Multi-Source Signals

GitHub

stars 336

forks 9

Growth Velocity

tianyi-lab/HallusionBench has +0 stars this period . Velocity data will be available after more historical data is collected.

Deep analysis is being generated for this repository.

Signal-backed technical analysis will be available soon.

Metric	HallusionBench	NLPGNN	gpt-j-api	Open-WebUI-Functions
Stars	336	336	335	335
Forks	9	66	54	52
Weekly Growth	+0	+0	+0	+0
Language	Python	Python	Python	Python
Sources	1	1	1	1
License	BSD-3-Clause	MIT	MIT	Apache-2.0

Capability Radar vs NLPGNN

HallusionBench

NLPGNN

Maintenance Activity 2

Last code push 176 days ago.

Community Engagement 63

Fork-to-star ratio: 2.7%. Lower fork ratio may indicate passive usage.

Issue Burden 70

Issue data not yet available.

Growth Momentum 30

No measurable growth in the current period (first-day cold start expected).

License Clarity 95

Licensed under BSD-3-Clause. Permissive — safe for commercial use.

Risk scores are computed from real-time repository data. Higher scores indicate healthier metrics.