concept
active
concept:k-valuesK values
In attention, key vectors that advertise 'where in the future should look here?'
Neighborhood — ranked by edge-count
Papers (1)
paper
Artifacts (1)
artifact
- Original thread by janus explaining transformer information highways and introspection capabilities, posted on X.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Janus's interpretive claim about key vectors.
- In attention, query vectors that ask 'where in the past should I look?' given the current state.
- Probability of sensory input expected by an agent, aligning value maximization with surprise minimization.
- A dynamic programming method for computing optimal value functions and policies in known MDPs.
- In attention, value vectors that carry the information future positions should receive.
- Unsupervised feature-finding method using cluster centroid difference as feature direction
- Expected information gain about hidden states; drives curiosity and novelty-seeking; mutual information term in expected free energy.
- Negative of value, equated with free-energy and surprise.