community

active

leiden_hybrid_concepts

label: sonnet

community:leiden_hybrid_concepts-run2-c131

Vision-augmented rationale generation

Two-stage framework using visual features to correct hallucinations on ScienceQA benchmark

2 members. Each node is clickable.

Loading graph…

Drawn from 1 source

The papers/notes whose extracted claims & findings make up this cluster.

Multimodal Chain-of-Thought Reasoning in Language Models2 members

Bridges (2)

Other communities that share members with this one — cross-cutting threads or papers that sit at the seam between two themes.

Chain-of-Thought reasoning robustness & safety2 shared
Sensory integration in predictive cognition2 shared

Claims (1)

Vision features enable generation of more effective rationales that reduce hallucination and improve answer inferenceCore interpretive assertion: multimodal information (vision + language) produces higher-quality intermediate reasoning steps compared to language-only approaches.

Findings (1)

60.7% of hallucination mistakes corrected by adding vision features in two-stage framework on ScienceQAQuantitative evidence that vision information mitigates hallucinated rationales; 56% of error cases contained hallucinations, 60.7% of which were resolved with vision features.