method
active
method:lora-cotLoRA+CoT
Fine-tuning with chain-of-thought rationales aiming to reduce dr via procedural alignment.
Neighborhood — ranked by edge-count
Concepts (2)
concept
- Fine-tuningimplementsParameter updates that reduce mismatch dr; another anchoring variant in UCCT.
- Prior-Target Mismatch (dr)associated_withMeasures how far the target PT is from the prior P_prior; increases anchoring difficulty
Methods (1)
method
- Quantitative study varying representational familiarity via numeral bases B10/B8/B9 at fixed computational complexity
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- E2 finding showing CoT's limited benefit for OOD transfer, consistent with larger dr out of scope
- Parameter-efficient fine-tuning method used for both SDF and expert iteration stages.
- Specific fine-tuning implementation using LoRA rank 32, learning rate 2e-4, AdamW 8-bit optimizer
- Parameter-efficient fine-tuning method used to implement SOO fine-tuning on LLMs
- Light fine-tuning method used in E2 to reduce mismatch dr.
- A prompting technique that elicits intermediate reasoning steps before final answer inference in language models.
- A two-stage framework that separates rationale generation and answer inference by incorporating vision and language modalities.
- Named method for monitoring chain-of-thought text to detect when the model signals its answer, compared against activation probes