← Back to Machine Learning
cs.LG

How to pick the smartest questions when estimating feature importance is expensive?

David Rundel, Fabian Fumagalli, Maximilian Muschalik, Bernd Bischl, Matthias Feurer

June 1, 2026

Computing Shapley values—a gold standard for explaining AI decisions—requires evaluating thousands of feature combinations, which becomes prohibitive when each evaluation is expensive (retraining models, testing hyperparameters). ShaplEIG uses a Gaussian process to learn patterns from initial evaluations, then intelligently picks which combinations to test next by maximizing information gain. A mathematical trick using symmetric polynomials makes this feasible without exponential blowup. Experiments show it cuts sample costs substantially when budget is tight.
Published as ShaplEIG: Bayesian Experimental Design for Shapley Value Estimation arXiv:2606.02247
Read the original paper →