A Nested Genetic Algorithm for Explaining Classification Data Sets with Decision Rules

08/23/2022
by   Paul-Amaury Matt, et al.
0

Our goal in this paper is to automatically extract a set of decision rules (rule set) that best explains a classification data set. First, a large set of decision rules is extracted from a set of decision trees trained on the data set. The rule set should be concise, accurate, have a maximum coverage and minimum number of inconsistencies. This problem can be formalized as a modified version of the weighted budgeted maximum coverage problem, known to be NP-hard. To solve the combinatorial optimization problem efficiently, we introduce a nested genetic algorithm which we then use to derive explanations for ten public data sets.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset