What alternatives exist for encoding text for a neural network or SVM? Bag of words (unigrams, bigrams, trigrams, etc) seems to be the most common approach, but it seems that it is better at encoding the broad "topic" rather than "meaning" of text.
Are there approaches that preserve more or all of the text structure?
[link][15 comments]