claim
active
claim:the-resulting-features-are-highly-abstract-multilingual-multimodal-and-generalizing-between-concrete-and-abstract-referencesThe resulting features are highly abstract: multilingual, multimodal, and generalizing between concrete and abstract references.
Features respond to concepts across languages and in images, not just text.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Findings (2)
finding
- Demonstrates multilingual generalization of SAE features.
- Out-of-distribution generalization of SAE features.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Methodological claim about the scientific value of combining causal abstraction with representational geometry analysis
- Decoder cosine similarity maps onto concept similarity.
- Vision of the emerging paradigm shift in society.
- How do representations differ or converge between architectures, tasks, and modalities?question0.774Broader research question MAS is positioned to address, citing multiple recent works.
- Motivation for using sparsity-based dictionary learning on language models
- Opus 4.1 is most effective at recognizing injected abstract concepts (e.g., justice, peace) but detects other categories too.
- Open theoretical problem CIMC acknowledges: precisely characterizing the representational format of perception
- Definition of abstract performative, a core invention of the paper.