claim
active
claim:feature-universality-across-independently-trained-models-suggests-features-have-some-existence-beyond-individual-models

Feature universality across independently trained models suggests features have some existence beyond individual models

Authors take agnostic position on ontological status but universality evidence pushes toward features being real

Source paper

extracted_from
Towards Safe and Honest AI Agents with Neural Self-Other Overlap
(2024) · Marc Carauleanu · Michael Vaiana · Judd Rosenblatt · Cameron Berg +1

Neighborhood — ranked by edge-count

Findings (4)

finding

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.