method
active
method:lora-sftLoRA SFT
Light fine-tuning method used in E2 to reduce mismatch dr.
Neighborhood — ranked by edge-count
Concepts (2)
concept
- Fine-tuningimplementsParameter updates that reduce mismatch dr; another anchoring variant in UCCT.
- Target Pattern Cohesion (ρd)associated_withMeasures how tightly the target pattern PT clusters in representation space; one of three components of S
Methods (1)
method
- Quantitative study varying representational familiarity via numeral bases B10/B8/B9 at fixed computational complexity
Related by similarity (6)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Supervised fine-tuning to adapt model parameters.
- Fine-tuning with chain-of-thought rationales aiming to reduce dr via procedural alignment.
- Parameter-efficient fine-tuning method used for both SDF and expert iteration stages.
- Specific fine-tuning implementation using LoRA rank 32, learning rate 2e-4, AdamW 8-bit optimizer
- Parameter-efficient fine-tuning method used to implement SOO fine-tuning on LLMs
- The paper's core contribution: an RL-based framework for training autonomous single-agent LLMs to perform deep research with web search, browsing, and code execution.