Explaining Model Behavior with Global Causal Analysis

Marcel Robeer*, Floris Bex, Ad Feelders, Henry Prakken

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

We present GLOBAL CAUSAL ANALYSIS (GCA) for text classification. GCA is a technique for global model-agnostic explainability drawing from well-established observational causal structure learning algorithms. GCA generates an explanatory graph from high-level human-interpretable features, revealing how these features affect each other and the black-box output. We show how these high-level features do not always have to be human-annotated, but can also be computationally inferred. Moreover, we discuss how the explanatory graph can be used for global model analysis in natural language processing (NLP): the graph shows the effect of different types of features on model behavior, whether these effects are causal effects or mere (spurious) correlations, and if and how different features interact. We then propose a three-step method for (semi-)automatically evaluating the quality, fidelity and stability of the GCA explanatory graph without requiring a ground truth. Finally, we provide a detailed GCA of a state-of-the-art NLP model, showing how setting a global one-versus-rest contrast can improve explanatory relevance, and demonstrating the utility of our three-step evaluation method.
Original languageEnglish
Title of host publicationExplainable Artificial Intelligence
Subtitle of host publicationFirst World Conference, xAI 2023, Lisbon, Portugal, July 26–28, 2023, Proceedings, Part I
EditorsLuca Longo
Place of PublicationCham
PublisherSpringer
Pages299–323
Number of pages25
Edition1
ISBN (Electronic)978-3-031-44064-9
ISBN (Print)978-3-031-44063-2
DOIs
Publication statusPublished - 30 Oct 2023

Publication series

NameCommunications in Computer and Information Science
PublisherSpringer
Volume1901
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Keywords

  • Causal explanation
  • Explainable Machine Learning (XML)
  • Model-agnostic explanation
  • Natural Language Processing (NLP)

Fingerprint

Dive into the research topics of 'Explaining Model Behavior with Global Causal Analysis'. Together they form a unique fingerprint.

Cite this