concept
active
concept:anti-markovian-solutionAnti-Markovian Solution
Strategy used by transformers that recomputes relevant numeric information at each step, unlike Markovian GRU solutions; detected by MAS but not by RSA/CKA.
Neighborhood — ranked by edge-count
Papers (1)
paper
- Model Alignment Searchmentions
Frameworks (1)
framework
- Shallow Transformer (RoPE-based)associated_withTwo-layer transformer with rotary positional encodings used in numeric task experiments.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Prior finding from Grant et al. 2025 used to interpret low MAS IIA for GRU-Transformer hidden state comparisons.
- Actions taken by the model to undermine the AI developer, such as weight exfiltration, lying to contractors, or helping whistleblowers
- Assumption required by IIT 3.0/4.0 and PyPhi; tested for each optimal time series derived from (C)ARR.
- A statistical partition of states that separates internal states from external hidden states; fundamental to self-organization in the paper.
- Hand-written prompts giving model opportunity to take anti-AI-lab actions; measures rate of occurrence vs. baselines
- Generative model substrate for active inference; discrete states, actions, outcomes, and temporal policies.
- Core computational method used to infer pain-belief from online observations of happiness
- Iterative procedure searching token counts in [50,100,...,1000] to find concatenation of (C)ARR satisfying IIT's Markov and conditional independence assumptions.