The most scalable kernel SVM implementation I know of is LaSVM. It’s written in C hence wrap-able in Python if you know Cython, ctypes or cffi. Alternatively you can use it from the command line. You can use the utilities in sklearn.datasets
to load convert data from a NumPy or CSR format into svmlight formatted files that LaSVM can use as training / test set.