finding
active
finding:four-features-a-0-20-a-0-0-a-0-30-a-0-494-form-an-fsa-like-system-implementing-html-tag-generation

Four features (A/0/20, A/0/0, A/0/30, A/0/494) form an FSA-like system implementing HTML tag generation

Concrete example of features connecting into FSA-like system implementing complex behavior

Source paper

extracted_from
Towards Safe and Honest AI Agents with Neural Self-Other Overlap
(2024) · Marc Carauleanu · Michael Vaiana · Judd Rosenblatt · Cameron Berg +1

Neighborhood — ranked by edge-count

Concepts (1)

concept
  • Collections of features that interact via the token stream — one feature increases probability of tokens that activate the next feature — forming FSA-like systems

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.