Custom SVM Polynomial kernel sklearn-CodePudding

I got asked as an assignment to develop a custom polynomial (degree = 3,4,5) kernel for SVM and compare its accuracy to the in-built poly kernel of the sklearnkit (should be almost the same) I tried to follow the polynomial kernel definition but my results seems not to be quite similar, this is my code:

def poly_kernel_fn(X, Y):
# Implement a polynomial kernel
#  - args: 2 numpy arrays of shape [n_samples, n_features]
#  - returns: computed kernel matrix of shape [n_samples, n_samples]

K = np.zeros((X.shape[0],Y.shape[0]))
K = (X.dot(Y.T)   1)**4
return K

clfpoly = svm.SVC(kernel='poly', degree=4)
clfpoly.fit(X_train, y_train)
zpoly = clfpoly.predict(X_test)
print("The accuracy with in-built 3D polynomial kernel is: ",accuracy_score(y_test, zpoly)*100,"%")

clf = svm.SVC(kernel=poly_kernel_fn)
clf.fit(X_train, y_train)
z = clf.predict(X_test)
print("The accuracy with custom rbf kernel is: ",accuracy_score(y_test, z)*100,"%")

The accuracy results is the following:

The accuracy with in-built 4D polynomial kernel is: 56.99999999999999 %
The accuracy with kernel is: 59.0 %

If I change the polynomial grade to 3 or 5 it changes even more, so I do not know if I am doing something wrong or simply is not possible to match the in-built accuracy.

Thanks for your help

CodePudding user response：

You have to look at the definition of the poly case at

coef = 0 and gamma = 1/(n_features*.var()) Then you can get the same

from sklearn.datasets import make_classification
from sklearn import svm
import numpy as np

gamma = None
   
def poly_kernel_fn(X, Y):
   
   K = np.zeros((X.shape[0],Y.shape[0]))
   K = (gamma*X.dot(Y.T))**4
   return K
   
   

if __name__=="__main__":
    
    X, Y = make_classification(10, 5)  # random data
    
    clf1 = svm.SVC(kernel='poly', degree=4) # built in function
    
    clf1.fit(X, Y)
    print("built in score  = ", clf1.score(X,Y))
    
    gamma = 1/(5*X.var())
    
    clf2 = svm.SVC(kernel=poly_kernel_fn)
    
    clf2.fit(X, Y)
    print("custom in score = ", clf2.score(X,Y))

In [9]: run main.py
built in score  =  0.8
custom in score =  0.8