method
active
method:nltk-stemming-and-lemmatizationNLTK Stemming and Lemmatization
Used to normalize candidate instruction tokens in the instruction discovery experiment.
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- SAE latents that rise as correction approaches and peak after self-correction begins, complementing OTDs
- Out-of-context reasoning work directly related to synthetic document fine-tuning experiments
- An unsupervised method for generating natural language explanations of LLM activations through a verbalizer-reconstructor pair trained jointly with RL.
- Core unsupervised method for generating natural language explanations of LLM activations through a verbalizer-reconstructor pair trained with RL.
- Sharma et al. result supporting cross-modal alignment: language-only models implicitly encode visual structure
- Little evidence of steganography in NLAs; meaning-preserving transformations cause only small drops in FVEfinding0.679Quantitative evaluation showing NLAs do not heavily rely on covert encoding beyond overt language.
- Understanding how LMs learn linguistic behaviours may offer insights into fundamental properties of languagehypothesis0.675Forward-looking hypothesis linking LM mechanism analysis to linguistic theory
- Alternative data attribution approach using an LLM as a judge; compared against the probe-based method.