Accumulated local effects

Accumulated local effects (ALE) is a machine learning interpretability method.[1]

Concepts

ALE uses a conditional feature distribution as an input and generates augmented data, creating more realistic data than a marginal distribution.[2]

It ignores far out-of-distribution (outlier) values.[1] Unlike partial dependence plots and marginal plots, ALE is not defeated in the presence of correlated predictors.[3]

It analyzes differences in predictions instead of averaging them by calculating the average of the differences in model predictions over the augmented data, instead of the average of the predictions themselves.[2]

Example

Given a model that predicts house prices based on its distance from city center and size of the building area, ALE compares the differences of predictions of houses of different sizes. The result separates the impact of the size from otherwise correlated features.[1]

Limitations

Defining evaluation windows is subjective. High correlations between features can defeat the technique.[1][3] ALE requires more and more uniformly distributed observations than PDP so that the conditional distribution can be reliably determined. The technique may produce inadequate results if the data is highly sparse, which is more common with high-dimensional data (curse of dimensionality).[2]

See also

References

  1. Rodriguez, Jesus. "Edge 255: Interpretability Methods: Accumulated Local Effects (ALE)". thesequence.substack.com. Retrieved 2022-12-27.
  2. "Accumulated Local Effects — ADS 2.7.0 documentation". docs.oracle.com. Retrieved 2022-12-31.
  3. Gupta, Aman (2020-10-25). "Accumulated Local Effects (ALE)-Feature Importance Technique". Analytics Vidhya. Retrieved 2022-12-27.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.