Home > Enterprise >  Effect of a column of data on the final classification result
Effect of a column of data on the final classification result

Time:02-18

I'am ML newbie but I decided to create a program in ML.NET for data classification. However, my dataset has a very large number of columns, so I'll use another example.

There is an iris database with five columns:

  • sepalLength (number)
  • sepalWidth (number)
  • petalWidth (number)
  • petalWidth (number)
  • irisClass (predicted -> Iris-setosa/Iris-versicolor/Iris-virginica)

Is it possible to investigate to what extent a given data column influences the classification of a flower to a given type of iris? For example, the value of the sepalLength column influences the result by 60%, while the value of the petalWidth column influences the result by 5%.

CodePudding user response:

I think what you're looking for is either permutation feature importance or feature contribution calculation

  • Related