Homework advice - Clustering w/o K-Means?

Hello all, I'm working on a data mining project that ends with me clustering bag-of-words type data.

The majority of the project so far has been pre-processing (the data is an awesome web-crawled data set of tweets from middle eastern countries during the arab spring!). I have a dictionary made of word counts, so I can assign some sort of weight to each word.

I'm getting to the point now where I need to actually cluster the data. The vectors are very sparse (each feature is a word :/ Maybe I should try something else for this? Kernel method to map it onto some subspace??) After alllll the work I've done preprocessing rough, incomplete, arabic/french/english mixtures of tweets I feel like I've got to find SOME algorithm that's more complicated than the k-means that the professor spoon fed us.

Any thoughts? If anyone knows of an algorithm that's particularly good on sparse data, I will upvote you and your family.

submitted by groundshop
[link] [comment]

Homework advice - Clustering w/o K-Means?

Trending Articles

Police confirm man stabbed to death in Selsdon was Andrew David Else of Croydon

Angry father ordered to compensate daughter’s male friend

Download: Rich Bizzy -Panono Ukwenda (Cover)

Anthony Wahome Biography, Family, Wife and Children

Best 5 Happy Mothers Day Poems For Step Mother

IN COURT: Full list of people sentenced at Northampton Magistrates’ Court

Hyper-V replication "Enabling Replication Failed"

DMG Audio Limitless v1.01 WiN/OSX Incl Patched and Keygen-R2R

A/L Technology Stream – Subject combinations, Syllabuses and Teacher guides

Sri Lankan Actress Nadeesha Hemamali Hot Shoot

Prison officer charged!

Moondru Mudichu 20-07-2016 – Polimer tv Serial

Who’s been sentenced from Corby, Kettering, Ringstead, Rothwell, Rushden,...

Jamani mm nauliza hivi second selection za form five zinatoka lini?

Reply: Betrayal at House on the Hill:: Rules:: Re: Haunt #6 - Spoilers Within

JESSIE ROGERSON ON JULY 10, 20...

Madonna – Behind Me (feat. Guido Dos Santos) – Single [iTunes Plus M4A]

Stories • Goddess Stepmom

Laura Pausini - Platinum Collection (3Cd) (2009) .mp3 - 320 Kbps

Joseph Bradley – Carlisle