thinker
active
thinker:johannes-von-oswald

Johannes von Oswald

Transformers learn in-context by gradient descent.

Authored
0
Introduces
1
Studies
0
Affiliations
0
Cited by
0

More papers — OpenAlex / S2

Recent mentions (4)