concept
active
concept:simplicity-biasSimplicity Bias
The tendency of deep networks to implicitly favor simpler solutions that fit the data, driving convergence
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (1)
concept
- Representational ConvergencesupportsThe central empirical phenomenon: different neural networks trained on different data/objectives develop increasingly similar representations
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Deep networks are biased toward finding simple fits to data, and this bias increases with model size, driving convergence
- Features related to gender, racial, ethnic biases, slurs, and hate speech.
- Assumptions or preferences (e.g., parsimony) that determine how a learning system generalizes beyond training data
- Researcher preferences and goals of mimicking human reasoning shape model development, potentially causing convergence toward human-like representations
- Problem cited as a limitation of current LLMs; PRH predicts larger models should amplify bias less
- The chapter's foundational question.
- Architectural modification subtracting a learned bias from autoencoder inputs before encoding; initialized to geometric median of dataset; improves autoencoder performance
- Adapted control task metric measuring difference between odds-ratio on original task and arbitrary-label control task