concept
active
concept:bias-amplificationBias Amplification
Problem cited as a limitation of current LLMs; PRH predicts larger models should amplify bias less
Neighborhood — ranked by edge-count
Claims (1)
claim
- Scaling may reduce hallucination and certain kinds of bias as models converge toward an accurate model of realityassociated_withImplication of PRH: larger models should amplify bias less and hallucinate less if they better model reality
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Assumptions or preferences (e.g., parsimony) that determine how a learning system generalizes beyond training data
- Implication of PRH for AI fairness and bias
- Features related to gender, racial, ethnic biases, slurs, and hate speech.
- Architectural modification subtracting a learned bias from autoencoder inputs before encoding; initialized to geometric median of dataset; improves autoencoder performance
- Deep networks are biased toward finding simple fits to data, and this bias increases with model size, driving convergence
- The idea of using machines/systems to magnify human intellectual capability, early AI concept tied to Alexander's Notes.
- Metric for intervention effectiveness: 0 = ineffective, 1 = full flip of model output from false to true or vice versa