/var/logmarcus chiu

/var/log

❯

Computer

❯

Artificial Intelligence (AI) - Cognitive Computing - Machine Intelligence

❯

AI - Subfields

❯

Interpretability/Interpretable AI/ML vs Explainability/Explainable AI/ML

Mechanistic Interpretability (Mech Interp - Mechinterp - MI) Research

Created on Dec 15, 2023 · Last Modified on Sep 02, 2025

Mechanistic Interpretability (Mech Interp - Mechinterp - MI) Research
  • is a subfield of research within explainable artificial intelligence
  • aims to understand the internal workings of ML models (especially neural networks) by:
    • analyzing the mechanisms present in their computations
    • understand how individual components contribute to the overall behavior

Types

  • LLM Interpretability