finding
active
finding:middle-to-late-layers-39-50-of-qwq-32b-show-consistently-stable-and-high-lat-classification-performance-across-all-datasets

Middle-to-late layers (39-50) of QwQ-32B show consistently stable and high LAT classification performance across all datasets

Layer-wise analysis revealing which network depths best encode strategic deception semantics

Source paper

extracted_from
When Thinking LLMs Lie: Unveiling the Strategic Deception in Representations of Reasoning Models
(2025) · Kai Wang · Yihao Zhang · Meng Sun

Neighborhood — ranked by edge-count

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.