community
active
leiden_hybrid_concepts
label: haiku
community:leiden_hybrid_concepts-run4-c13-c4

Multimodal chain-of-thought reasoning benchmarks

ScienceQA and related vision-language tasks evaluated via explicit reasoning steps, spanning 738M-parameter models with 89-95% accuracy ranges.

4 members. Each node is clickable.

Loading graph…

Bridges (3)

Other communities that share members with this one — cross-cutting threads or papers that sit at the seam between two themes.

Findings (4)