method
active
method:ols-linear-regression-fit-to-alpha-trendsOLS Linear Regression Fit to Alpha Trends
OLS regression fitted to mu(alpha) trends to assess near-linearity of steering with alpha coefficient
Neighborhood — ranked by edge-count
Claims (1)
claim
- Theoretical alignment claim backed by OLS R2 analysis showing 96.15% of trends have R2>=0.75
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Demonstrates alignment with Linear Representation Hypothesis: target trait steers approximately linearly with alpha
- Control comparison showing near-linearity is specific to the targeted manipulation direction
- Primary statistical model with random intercept by conversation, REML estimation, for pooled conversation-turn observations
- Causal confirmation that coupling between self-report and internal state is genuine; steering toward positive pole increases report
- Second model system studied; used to show why flat autoregressive LLMs struggle with long-range coherence.
- Fits a non-decreasing function and computes R² = 1 - SSres/SStot to quantify introspective fidelity without assuming linearity
- Justification for the linear combination
- Large language models without hierarchical structure, challenged by long sequences