Skip to content

Conversation

@hbaniecki
Copy link
Collaborator

@hbaniecki hbaniecki commented Nov 26, 2025

Introducing CTEImputer

closes #225

Adds the CTEImputer following the compress then explain (CTE) methodology. It replaces missing features of the explanation point by values sampled from the background data, which is first subsampled using a distribution compression algorithm, specifically Compress++ with Kernel Thinning. CTE has shown to provide accurate and stable estimates of explanations while being computationally efficient. It is a new default imputer in TabularExplainer, removing the necessity to set sample_size.

TODO

  • add tests
  • fix Windows CI to include some missing C++ packages

@hbaniecki hbaniecki requested a review from mmschlk November 26, 2025 13:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

Include Compress Then Explain (CTE) in the Explainer / Imputer interface

2 participants