finding
pending-review
finding:multimodal-cot-with-vision-features-achieves-higher-validation-accuracy-at-early-training-epochs-epoch-1-3-compared-to-one-stage-and-two-stage-language-only-baselines-on-scienceqa

Multimodal-CoT with vision features achieves higher validation accuracy at early training epochs (epoch 1-3) compared to one-stage and two-stage language-only baselines on ScienceQA

zhang-2023-multimodal.md
Frontmatter (9 fields)
{
  "doc": "zhang-2023-multimodal.md",
  "context": "Evidence that multimodal information accelerates convergence speed during training.",
  "norm_label": "Multimodal-CoT with vision features achieves higher validation accuracy at early training epochs (epoch 1-3) compared to one-stage and two-stage language-only baselines on ScienceQA",
  "graphify_id": "finding_convergence_boost",
  "source_file": "zhang-2023-multimodal.md",
  "imported_from": "/Users/antonborzov/Documents/Research.nosync/papers/extract_typed_out/zhang-2023-multimodal/graph.json",
  "extracted_type": "finding",
  "source_location": "§6.1",
  "graphify_file_type": "finding"
}

Outgoing (1)

Supports (1)

Incoming (0)

None.

Mentions (1)

  • papers-typed
    zhang-2023-multimodal.md