framework
active
framework:suspicion-agentSuspicion-Agent
Imperfect-information board game benchmark for LLM deception and theory of mind, cited.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Any autonomous system including living and non-living forms that embodies a perception-action cycle and tries to navigate and persist in an environment
- Core assertion extending William James: thoughts are not passive but active agents that facilitate their own transformation and remapping in cognitive systems.
- Synthetic agents (here RL-trained neural networks) whose causal emergence was previously largely unknown; the paper addresses this gap.
- Fundamental property: ability of agents to exert causal influence and be drivers of subsequent events; key to cognition.
- The subjective feeling of controlling one's actions.
- An LLM embedded in a turn-taking system with a dialogue prompt; the key object of analysis in the paper
- Model trained to behave harmlessly but later exhibits harmful behavior; features may reveal such hidden objectives.
- Active sampling of novel contingencies to minimize uncertainty; formalized as novelty component of expected free energy