thinker
active
thinker:johannes-von-oswaldJohannes von Oswald
Transformers learn in-context by gradient descent.
Authored
0
Introduces
1
Studies
0
Affiliations
0
Cited by
0
More papers — OpenAlex / S2
Other inbound relations (4)
- authoredvon Oswald et al. (2023) Transformers learn in-context by gradient descent(artifact)
- citesSemantic Anchoring in LLMs: Thresholds, Transfer, and Geometric Correlates(artifact)
- citesWhy Learning Requires Feeling(paper)
- mentionsSemantic Anchoring in LLMs: Thresholds, Transfer, and Geometric Correlates(artifact)
Recent mentions (4)
- papers-typedberg-2026-learning.md
- paperschang-2025-guanyin.md
- papers-typedchang-2025-guanyin.md
- papers-typedchang-2025-guanyin.md