method
active
method:lora-sft

LoRA SFT

Light fine-tuning method used in E2 to reduce mismatch dr.

Neighborhood — ranked by edge-count

Concepts (2)

concept
  • Fine-tuning
    implements
    Parameter updates that reduce mismatch dr; another anchoring variant in UCCT.
  • Measures how tightly the target pattern PT clusters in representation space; one of three components of S

Methods (1)

method

Related by similarity (6)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Supervised fine-tuning to adapt model parameters.
  • LoRA+CoTmethod0.704
    Fine-tuning with chain-of-thought rationales aiming to reduce dr via procedural alignment.
  • Parameter-efficient fine-tuning method used for both SDF and expert iteration stages.
  • Specific fine-tuning implementation using LoRA rank 32, learning rate 2e-4, AdamW 8-bit optimizer
  • Parameter-efficient fine-tuning method used to implement SOO fine-tuning on LLMs
  • SFR-DeepResearchframework0.665
    The paper's core contribution: an RL-based framework for training autonomous single-agent LLMs to perform deep research with web search, browsing, and code execution.