Confusion matrix: ValueError: Classification metrics can't handle a mix of multiclass and conti-CodePudding

I'm working on a multiclass classification problem (12 classes), but I can't create the confusion matrix. What I'm trying to do is:

from sklearn.metrics import multilabel_confusion_matrix
pred = model_BiLSTM.predict(X_val)
y_unique = np.unique(y_val)
mcm =confusion_matrix(y_val, pred, labels = y_unique)

But as per the title, the returned error is: "ValueError: Classification metrics can't handle a mix of multiclass and continuous-multioutput targets", and the same happens when using confusion_matrix().

Instead, this is the output of

y_val, pred, y_val.shape, pred.shape

16/16 [==============================] - 0s 6ms/step
(array([11,  8,  3,  3,  9,  9,  9,  3,  3,  1,  3,  6,  6,  6,  2,  9,  1,
         3, 10,  6,  9,  2,  9,  4,  3,  9,  6,  9,  3,  3,  3,  3,  8,  1,
         9,  9,  9,  9,  2,  1,  9,  3,  9,  3,  9,  9,  2, 10,  3,  2,  9,
         3,  8,  6,  8,  9,  9,  6,  6,  9,  8,  7,  9,  2,  9,  9,  2,  3,
         9,  9,  2,  6,  3,  7, 11,  9,  9,  2, 11,  6,  7, 11,  6,  9,  2,
         6,  9,  2,  9,  9,  3,  6,  9,  1, 11,  4,  4,  2,  6,  2,  9,  3,
        10,  3,  2,  9,  9, 10,  6,  3,  9,  9,  6,  8,  6,  9,  4,  6,  5,
         9,  6,  6,  3,  3,  3,  9,  6,  2,  9, 11,  6,  9,  3,  9,  6,  9,
         2,  9,  3,  9,  6,  1,  6,  9,  9,  8,  3,  2,  9,  2,  8,  9,  9,
         3, 10,  2,  4,  9,  9,  2,  8,  3,  6,  9,  9,  9,  6,  2,  2,  9,
         7,  9,  3,  6,  7,  2,  9,  2,  9, 10,  9,  2,  9,  5,  7,  6,  5,
         6,  3,  9,  7,  9,  3, 11,  5,  3,  3,  0,  8,  3,  9,  5,  9, 10,
         9,  3,  3, 11,  2,  1,  8,  6,  2,  9,  3,  6,  2,  8,  9,  2,  8,
         3,  3,  9,  6,  2,  1,  9,  9,  2,  2, 10,  9,  1,  6,  9,  9,  2,
         9,  5,  6,  3,  9,  7,  6,  9,  9,  6,  3,  3,  2,  3,  9,  6,  9,
         9,  9,  9,  9,  9,  9,  2,  9,  9,  9,  9,  3,  1,  6,  6,  6, 10,
         9,  9,  9,  2,  9,  2,  3,  9,  9, 10,  6, 10,  6,  1,  1,  6,  1,
         6,  3,  4,  6,  1, 11,  3,  9,  9,  4,  9,  9,  5,  8,  3,  5,  9,
         6,  9,  9,  9,  3,  2,  5,  1,  9,  6,  1,  3,  9,  5,  3,  9,  6,
         7,  2,  9,  9,  3,  5,  9,  6,  6,  3,  1,  9,  3,  9,  7,  9,  9,
         7,  9, 10,  2,  9,  4,  9,  9,  9,  3,  9,  5,  3,  2,  9,  3,  8,
         2,  9,  3, 11,  3,  3,  9,  2, 10,  9,  3,  9,  1,  9, 10,  5,  1,
         2,  5,  9,  2,  3,  5,  9,  4,  8,  9,  5, 10,  3,  2,  6,  3,  6,
        10, 11,  3,  6,  9,  3,  3,  0,  6,  7,  8,  9,  6,  5,  3,  1,  9,
         2,  9,  9,  5,  2,  5,  3,  6, 11,  2,  9,  3,  6,  2,  9,  9,  3,
         3,  3,  5,  6,  4,  4,  9,  2,  2,  5,  1,  0,  9,  4,  3,  3,  9,
         9,  5,  2,  2,  2,  5,  7,  9,  3,  9,  9,  1,  2,  9,  6,  8,  2,
         3,  4,  2,  3,  3,  2,  6,  5,  9,  5,  2,  2,  2,  9,  9,  6,  2,
         9,  4,  9,  9,  2,  3,  8, 11,  9,  9], dtype=int32),
 array([[5.7949889e-03, 2.5301890e-03, 5.9659913e-05, ..., 2.7534673e-03,
         1.8798949e-03, 4.0977496e-01],
        [2.1629781e-04, 1.0219574e-02, 1.2285617e-03, ..., 4.0498661e-04,
         3.6948815e-04, 8.3618681e-04],
        [8.1547890e-03, 1.1354284e-04, 1.3678521e-04, ..., 3.6535120e-01,
         1.1546685e-03, 3.5349184e-03],
        ...,
        [1.1976730e-03, 6.8558909e-02, 8.7605380e-03, ..., 1.7384565e-01,
         5.4570078e-04, 2.0005915e-02],
        [2.1097453e-02, 7.7744485e-03, 2.5690982e-01, ..., 5.4854238e-01,
         3.9467164e-03, 1.6034273e-02],
        [2.0812787e-03, 1.6885218e-05, 4.7070305e-05, ..., 4.3611538e-01,
         3.6522493e-04, 1.4385413e-02]], dtype=float32),
 (486,),
 (486, 12))

If it helps, this is my model:

def build_BiLSTM_classifier(input_shape, classes):
    input_layer = tfkl.Input(shape=input_shape, name='Input')
    bilstm = tfkl.Bidirectional(tfkl.LSTM(128, return_sequences=True))(input_layer)
    bilstm = tfkl.Bidirectional(tfkl.LSTM(128))(bilstm)
    dropout = tfkl.Dropout(.5, seed=seed)(bilstm)

    classifier = tfkl.Dense(128, activation='tanh')(dropout)
    output_layer = tfkl.Dense(classes, activation='softmax')(classifier)

    model = tfk.Model(inputs=input_layer, outputs=output_layer, name='model')

    model.compile(loss=tfk.losses.SparseCategoricalCrossentropy(), optimizer=tfk.optimizers.Adam(), metrics='accuracy')
    return model

What could I try to do?

CodePudding user response：

Your pred array appears to contain class scores rather than predictions. Try pred.argmax(axis=1) instead.