Text classification - looking for approach suggestions

I'm trying classify short texts, using nltk and scikit-learn, but I am not sure yet how exactly approach it, and I am looking for advice. A particular text may belong to more then one class, or it may not belong to any. The dataset I have is about 100k items, with relatively small amount of items per category (thousands in few cases, hundreds in many, far less in most). For a given cIass I can easily generate samples of items that should be there, but I am not sure what about counter examples (if I need them). So far I am experimenting with naive Bayes classification, where I train classificator using known sample items and random selection of known items that don't belong to this class, doing this separately for each class. As a result classification works well for things that are good match, but generates lot of false positives. Is there a better way of doing this?

submitted by fiedzia
[link][4 comments]

Text classification - looking for approach suggestions

Trending Articles

SEEDUWA SAKURA LIVE IN GONAPALA 2018

Lessons learned from suicide of student Joseph Evans

GTA 5 PPSSPP Zip File Download For Android Mediafire 382 MB

Practice Sheet of Right form of verbs for HSC Students

Telangana Ration Card Online Status Ahara Bhadratha Card Online Status

Bureau of Internal Revenue: Regional Offices (Directory)

Man jailed by Grimsby court for 'degrading' attack on a teenage...

hide – REPSYCLE ~hide 60th Anniversary Special Box~ [CD FLAC + Blu-ray ISO]...

Black Angus Grilled Artichokes

Moondru Mudichu 20-07-2016 – Polimer tv Serial

Demi Lovato – Tell Me You Love Me (Remixes) – 2018 – iTunes Plus AAC M4A – EP

Kerala Government Public Holidays 2016

Chal chalo chalo lyrics and translation | S/O Sathyamurthy (2015)

Download: FK ft Shenky – Nakuyewa ”Prod by: Shenky”

Shivaji University Result 2017 BA B.Com B.Sc 1st, 2nd & 3rd Year परिणाम यंहा...

How to convert JSON to ABAP Internal Table data

£700k teaching scam claim emerges during sex probe into supply teacher

Outlook でメールを保存または送信時に...

NY-PHIL Mafia’s “Peter Pan” Tuccio Got A Beat Down For Being Disrespectful To...

मुख मैथुन से उठाएं सेक्स का भरपूर मज़ा, जानें क्या है इसका सही तरीकामुख मैथुन...