framework
active
framework:suspicion-agent

Suspicion-Agent

Imperfect-information board game benchmark for LLM deception and theory of mind, cited.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Agentconcept0.819
    Any autonomous system including living and non-living forms that embodies a perception-action cycle and tries to navigate and persist in an environment
  • Thoughts As Agentsconcept0.753
    Core assertion extending William James: thoughts are not passive but active agents that facilitate their own transformation and remapping in cognitive systems.
  • Artificial agentsconcept0.740
    Synthetic agents (here RL-trained neural networks) whose causal emergence was previously largely unknown; the paper addresses this gap.
  • Agent Causal Powerconcept0.724
    Fundamental property: ability of agents to exert causal influence and be drivers of subsequent events; key to cognition.
  • Sense of agencyconcept0.723
    The subjective feeling of controlling one's actions.
  • Dialogue Agentconcept0.716
    An LLM embedded in a turn-taking system with a dialogue prompt; the key object of analysis in the paper
  • Sleeper agentconcept0.716
    Model trained to behave harmlessly but later exhibits harmful behavior; features may reveal such hidden objectives.
  • Curiosityconcept0.713
    Active sampling of novel contingencies to minimize uncertainty; formalized as novelty component of expected free energy