41 Explainability and Fairness Tools for Interpreting and Auditing Machine Learning Models

Aequitas

An open-source bias audit toolkit for data scientists, machine learning researchers, and policymakers to audit machine learning models for discrimination and bias, and to make informed and equitable decisions around developing and deploying predictive risk-assessment tools.

License: MIT License

GitHub
Website: http://www.datasciencepublicpolicy.org/aequitas/

AI Explainability 360

Interpretability and explainability of data and machine learning models including a comprehensive set of algorithms that cover different dimensions of explanations along with proxy explainability metrics.

License: Apache License 2.0

GitHub
Website: http://aix360.mybluemix.net

AI Fairness 360

A comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.

License: Apache License 2.0

GitHub
Website: https://aif360.res.ibm.com/

Alibi

Alibi is an open source Python library aimed at machine learning model inspection and interpretation. The initial focus on the library is on black-box, instance based model explanations.

License: Apache License 2.0

GitHub
Website: https://docs.seldon.io/projects/alibi/en/stable/

anchor

Code for the paper “High precision model agnostic explanations” , a model-agnostic system that explains the behaviour of complex models with high-precision rules called anchors.

License: BSD 2-Clause "Simplified" License

GitHub

captum

model interpretability and understanding library for PyTorch developed by Facebook. It contains general purpose implementations of integrated gradients, saliency maps, smoothgrad, vargrad and others for PyTorch models.

License: BSD 3-Clause "New" or "Revised" License

GitHub
Website: https://captum.ai

casme

Example of using classifier-agnostic saliency map extraction on ImageNet presented on the paper “Classifier-agnostic saliency map extraction” .

License: BSD 3-Clause "New" or "Revised" License

GitHub

CleverHans

An adversarial example library for constructing attacks, building defenses, and benchmarking both. A python library to benchmark system’s vulnerability to adversarial examples .

License: MIT License

GitHub

ContrastiveExplanation (Foil Trees)

Python script for model agnostic contrastive/counterfactual explanations for machine learning. Accompanying code for the paper “Contrastive Explanations with Local Foil Trees” .

License: BSD 3-Clause "New" or "Revised" License

GitHub

DeepLIFT

Codebase that contains the methods in the paper “Learning important features through propagating activation differences” . Here is the slides and the video of the 15 minute talk given at ICML.

License: MIT License

GitHub

DeepVis Toolbox

This is the code required to run the Deep Visualization Toolbox, as well as to generate the neuron-by-neuron visualizations using regularized optimization. The toolbox and methods are described casually here and more formally in this paper .

License: MIT License

GitHub
Website: http://yosinski.com/deepvis

ELI5

“Explain Like I’m 5” is a Python package which helps to debug machine learning classifiers and explain their predictions.

License: MIT License

GitHub
Website: http://eli5.readthedocs.io

FACETS

Facets contains two robust visualizations to aid in understanding and analyzing machine learning datasets. Get a sense of the shape of each feature of your dataset using Facets Overview, or explore individual observations using Facets Dive.

License: Apache License 2.0

GitHub
Website: https://pair-code.github.io/facets/

Fairlearn

Fairlearn is a python toolkit to assess and mitigate unfairness in machine learning models.

License: MIT License

GitHub
Website: https://fairlearn.org

FairML

FairML is a python toolbox auditing the machine learning models for bias.

License: Other

GitHub

Fairness Comparison

This repository is meant to facilitate the benchmarking of fairness aware machine learning algorithms based on this paper .

License: Other

GitHub

Fairness Indicators

The tool supports teams in evaluating, improving, and comparing models for fairness concerns in partnership with the broader Tensorflow toolkit.

License: Apache License 2.0

GitHub

GEBI - Global Explanations for Bias Identification

An attention-based summarized post-hoc explanations for detection and identification of bias in data. We propose a global explanation and introduce a step-by-step framework on how to detect and test bias. Python package for image data.

License: No License

GitHub

iNNvestigate

An open-source library for analyzing Keras models visually by methods such as DeepTaylor-Decomposition , PatternNet , Saliency Maps , and Integrated Gradients .

License: Other

GitHub

Integrated-Gradients

This repository provides code for implementing integrated gradients for networks with image inputs.

License: No License

GitHub

InterpretML

InterpretML is an open-source package for training interpretable models and explaining blackbox systems.

License: Unknown

GitHub
Website: Unknown

keras-vis

keras-vis is a high-level toolkit for visualizing and debugging your trained keras neural net models. Currently supported visualizations include: Activation maximization, Saliency maps, Class activation maps.

License: MIT License

GitHub
Website: https://raghakot.github.io/keras-vis

L2X

Code for replicating the experiments in the paper “Learning to Explain: An Information-Theoretic Perspective on Model Interpretation” at ICML 2018.

License: No License

GitHub

Lightly

A python framework for self-supervised learning on images. The learned representations can be used to analyze the distribution in unlabeled data and rebalance datasets.

License: MIT License

GitHub
Website: https://docs.lightly.ai/self-supervised-learning/

Lightwood

A Pytorch based framework that breaks down machine learning problems into smaller blocks that can be glued together seamlessly with an objective to build predictive models with one line of code.

License: GNU General Public License v3.0

GitHub

LIME

Local Interpretable Model-agnostic Explanations for machine learning models.

License: BSD 2-Clause "Simplified" License

GitHub

LOFO Importance

LOFO (Leave One Feature Out) Importance calculates the importances of a set of features based on a metric of choice, for a model of choice, by iteratively removing each feature from the set, and evaluating the performance of the model, with a validation scheme of choice, based on the chosen metric.

License: MIT License

GitHub

MindsDB

MindsDB is an Explainable AutoML framework for developers. With MindsDB you can build, train and use state of the art ML models in as simple as one line of code.

License: Other

GitHub
Website: https://mindsdb.com

mljar-supervised

A Python package for AutoML on tabular data with feature engineering, hyper-parameters tuning, explanations and automatic documentation.

License: MIT License

GitHub
Website: https://mljar.com

NETRON

Viewer for neural network, deep learning and machine learning models.

License: MIT License

GitHub
Website: https://netron.app

pyBreakDown

A model agnostic tool for decomposition of predictions from black boxes. Break Down Table shows contributions of every variable to a final prediction.

License: Other

GitHub

responsibly

Toolkit for auditing and mitigating bias and fairness of machine learning systems

License: MIT License

GitHub
Website: http://docs.responsibly.ai

SHAP

SHapley Additive exPlanations is a unified approach to explain the output of any machine learning model.

License: MIT License

GitHub
Website: https://shap.readthedocs.io

SHAPash

Shapash is a Python library that provides several types of visualization that display explicit labels that everyone can understand.

License: Apache License 2.0

GitHub
Website: https://maif.github.io/shapash/

tensorflow's Model Analysis

TensorFlow Model Analysis (TFMA) is a library for evaluating TensorFlow models. It allows users to evaluate their models on large amounts of data in a distributed manner, using the same metrics defined in their trainer.

License: Apache License 2.0

GitHub

themis-ml

themis-ml is a Python library built on top of pandas and sklearn that implements fairness-aware machine learning algorithms.

License: MIT License

GitHub

Themis

Themis is a testing-based approach for measuring discrimination in a software system.

License: Other

GitHub

TreeInterpreter

Package for interpreting scikit-learn’s decision tree and random forest predictions. Allows decomposing each prediction into bias and feature contribution components as described here .

License: BSD 3-Clause "New" or "Revised" License

GitHub