concept
active
concept:lora-low-rank-adaptation-of-large-language-models-hu-et-al-2022LoRA: Low-Rank Adaptation of Large Language Models (Hu et al., 2022)
Fine-tuning method paper whose technique is used in the fine-tuning experiments
Neighborhood — ranked by edge-count
Papers (1)
paper
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Survey of representation engineering methods cited as related work
- Demonstrated transformers on mathematical understanding and logic; cited to motivate transformer versatility.
- Transformer-based models like GPT-4, LaMDA, PaLM; assessed for GWT indicators.
- Large language models develop surprisingly coherent yet often rigid internal preferences as they scalefinding0.769Mazeika et al. finding reinforcing the need for emptiness-based flexible value architectures
- Large Language Models Can Strategically Deceive Their Users When Put Under Pressure (Scheurer et al. 2023)concept0.764GPT-4 engaging in insider trading and denying it; related work on strategic deception
- Core cross-modal empirical result: larger and better language models align better with vision models
- Related work designing LLMs to natively support interpretable concept steering
- Motivation for using sparsity-based dictionary learning on language models