CodePudding user response:
Assume that the data set has two kinds, label of 1 and 0 respectively, 1 class there are 30 samples, 0 class has 10 samples, set aside and cross validation method is not suitable for classification, because number too little training data and testing data, and category ratio imbalance, reliability will be very poor, because be repeated sampling, and self-help method in the m (m=100) after sampling, the number of training set for 64, the number of test set is 36, total number of training set is greater than the original sample, the trained model generalization ability is stronger,