artifact
active
artifact:concept-probe-python-library-github-com-mneuronico-concept-probeconcept-probe Python library (github.com/mneuronico/concept-probe)
Open-source Python library released with the paper supporting probe training, multi-probe scoring, activation steering, and logit extraction
Neighborhood — ranked by edge-count
Papers (1)
paper
Frameworks (1)
framework
- Quantitative Introspection FrameworkimplementsThe paper's central contribution: treating LLM numeric self-report as a quantitative signal validated against probe-defined internal states with causal confirmation via steering