Question on linear SVMs and curse of dimensionality

Hello, for a project I'm trying to analyze a binary SVM classifier on a set of images. Each image is represented by a vector of 330 values in [0,1], which sum up to 1.

(I won't explain what those features represent: it would be useless since they lack a clear meaning. Actually, my goal is to try to explain what does the classifier learn from the set)

My training set comprises 1500 training samples (hence with a set of corresponding binary labels), with over 5000 test samples.

As you can see, the dimensionality of the problem is rather high: the data matrix has 1500 rows and 330 columns. Still, I am able to train a linear SVM on 4/5th of the training set, and achieve over 95% accuracy on the remaining 1/5th part of the training set.

I am using LIBLINEAR, with L² regularization and L² loss function.

I also made sure to not overfit in any way: inside the 4/5ths I perform feature transformation (z-score normalization) and SVM calibration with an additional 5-fold CV.

What's more interesting is that the same SVM still achieves around 88% of accuracy over the entire test set, which is almost four times as big as the full training set. Results are very robust to choice of cross-validation set.

Why does it perform so well, despite the curse of dimensionality? Is it a general (unexpected?) characteristic of SVMs? Also, do I need to do some sort of dimensionality reduction? (bear in mind that it is impossible to put any meaningful probabilistic structure on the features, hence no LDA, QDA, etc.)

My ansatz is that the classes are truly well-separated, both in the training and in the test set. Still, marginal feature distributions are almost overlapped between classes (although I know that this happens easily, even on R^2), and no feature is particularly dominant over the others, judging by the SVM hyperplane coefficients. However, I would like to rule out the dimensionality effect, to be sure of having obtained a good classifier.

Thank you!

submitted by Er4zor
[link][8 comments]

Question on linear SVMs and curse of dimensionality

Trending Articles

Bath man appears in court charged with attempted murder of a man...

MACLEAN, Allan

Black Angus Grilled Artichokes

Practice Sheet of Right form of verbs for HSC Students

Police blotter for Jan. 12

99 God Status for Whatsapp, Facebook

Rajasthan Board 12th Science Result 2018 name wise- RBSE 12th commerce result...

Notorious Naushad of Ippa gang nabbed

Child Kidnapping: Amy McNeil was kidnapped on her way to school by 5 adults;...

Sonible Smartlimit v1.1.5-R2R

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

मतलबी दोस्त स्टेट्स | Matlabi Dost Status in Hindi – Selfish Friends Status

Arrow Flash 2 – Sinhala Dubbed – Episode 23 – 20th March 2016

[GET] AI Traffic Goldmine

[E² Plugin] HDF-Radio

Universal Multi-Patch v1.3 By RADIXX11

IWAN – Thanks and Praise ( Throw Back Thursday )

RONALD P SONDERGAARD Arrested by Miami-Dade County Corrections on Mar 03, 2017

मुख मैथुन से उठाएं सेक्स का भरपूर मज़ा, जानें क्या है इसका सही तरीकामुख मैथुन...

HSSC Excise & Taxation Inspector Result 2017 Scorecard/ Category Wise Merit List