Some questions on Text Analysis

Hi, I hope this is the right subreddit.

For a small project I'd like to sift through a large amount of articles and tag them according to category (Interview, News article etc.) and occurrence of a preset of notable items ( Names, Brands ).

Later on I might want to add some language processing ( figure out if the article is FROM, WITH or ABOUT an item ) or sentiment analysis ( is this article positive or negative in tone ).

I've googled around, and I'm torn between GATE, NTLK and Rapidminer. I also have a couple of questions:

I couldn't find an Open Source library or Suite written in C or C++. Why is that? I'd think that it is quite resource hogging and using Java (which seems to be preferred ) or any other interpreter language would unneccesarilly bog down performance.
I'm not sure if any of the three tools above are really suited for the job. Especially GATE and Rapidminer seem like a bit of overkill. Your thoughts on that? *What are some good books/tutorials that will help me with this project? I've already bookmarked this link which at the time of posting is on the top of this subreddit. It seems to be quite useful for my goal. Other than that I am lost as for example the NTLK examples use tags like NN, NP-BSJ which I guess are shorthand for some grammatical definition (Nominative Noun maybe?) but which don't really help understanding. Any recommendations?

If you got this far, thanks for reading.

submitted by DeusexConstantia
[link] [4 comments]

Some questions on Text Analysis

Trending Articles

Scuffham Amps - S-GEAR 2.6.0 VST, AAX, STANDALONE x86 x64 (R2R NO iLok2, +NO...

Practice Sheet of Right form of verbs for HSC Students

VHSE First (1st) Allotment 2025 - vhscap.kerala.gov.in

UNIVERSE LEAGUE – UNIVERSE LEAGUE – WAR (We Are Ready) – EP [iTunes Plus M4A]

City Hunter Teledrama – Episode 18 – 07th May 2016

Comment on Proposed Criteria for Identifying Predatory Conferences by Luke...

Bureau of Internal Revenue: Regional Offices (Directory)

Kendrick Lamar – Not Like Us (2024) [24Bit-88.2kHz] [PMEDIA] ⭐️

Inception 2010 Hindi Dual Audio 650MB BRRip 720p ESubs HEVC

East Hull MD admits sexual assaults after another victim comes forward

Download: Ziba Zako ft Rich Bizzy & General Kanene – Chikwati (Prod by: Bicko...

R. v. Sargeant, 2023 ONSC 6406 (CanLII)

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Who’s been sentenced at Northampton Magistrates’ Court

मतलबी दोस्त स्टेट्स | Matlabi Dost Status in Hindi – Selfish Friends Status

Family cries out as traditional ruler allegedly abducts brother, extorts N2.5m

Long-Running Conflict In Springfield (MA) Gangland Sphere Has Manzi Family &...

Wondershare Filmora X v10.1.20.16 x64

Man arrested after fracas in flat

Man charged in ongoing Sexual Assault Investigation Derek Nyilas, 46, Faces...