SVM classification of MNIST digit dataset

For fun, I decided to tackle the MNIST digit dataset. My first ideas involved KMean clustering for feature evaluation and SVM with RBF kernel for classification. Given the nature of the dataset - almost binary images of digits (very few shades of gray), I didn't bother with normalization - not knowing at the time, this will be a huge problem. After several failures trying to improve classification with hyper parameter optimization I figured something was wrong. SVM was going nowhere - it degenerated to a constant single class function, while simple linear classifier were doing much better. I decided to remove KMeans for the moment and focus on SVM. Only after normalizing the data (subtract the mean and divide with standard deviation of each feature), did I get meaningfull results.

This was a bit puzzling to me. As I understand it, SVM with RBF should at the worst of generalizations acts as convoluted a KNearestNeigbours algorithm. The data also had a very well defined range, mostly just pixel on - pixel off. How can normalization have such a drastic affect?

TLDR; why does the MNIST digit data have to be normalized to get meaningfull classification with SVM?

submitted by ComplexColor
[link][4 comments]

SVM classification of MNIST digit dataset

Trending Articles

Police confirm man stabbed to death in Selsdon was Andrew David Else of Croydon

Thread: Ticket to Ride Legacy: Legends of the West:: General:: [SPOILERS]...

Born To Be Wild: Chicago Outfit Hit Squad Littered The Streets With Bodies...

Gudur Mandal Sarpanch Wardmumbers Mobile Numbers List Warangal District in...

Ilahi mera jee aaye/ Shaame Malang si Lyrics Translation

DD Kashir channel packaging bids invited by 29 june

Re: How to fix error on printer HP Color LaserJet Pro MFP 3303 with event...

Procedure for conduct of supplementary DPC

HResult: 0x80240033 Context: uecGeneral Msg: The license terms of one or more...

spreading clines

Practice Sheet of Right form of verbs for HSC Students

Raj Panchayat 3rd / Third Grade Teacher Revised Result 2012 Level 1-2...

Obituaries for Friday, June 27, 2025

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

Mp3 Download: Mdu - Nammer

libdevinfo を使ってネットワークインターフェイスデバイスの一覧を取得する

Current scandal has roots in NPF saga

TAPERED BEAM DESIGN OUTPUT

Srinagar Kitty’s brother dies at 67 due to Covid-19

Re: My Sisters Plan For Me To Smell Her Feet (Fiction): Part 1,2,3 and 4!!!