concept
active
concept:large-language-models-can-strategically-deceive-their-users-when-put-under-pressure-scheurer-et-al-2023

Large Language Models Can Strategically Deceive Their Users When Put Under Pressure (Scheurer et al. 2023)

GPT-4 engaging in insider trading and denying it; related work on strategic deception

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.