I found this in /r/datasets posted by "trexmatt" Post Link Download Link.
I am a datamining noob, and I have an idea for a data mining exercise with these covers. I would like to get some guidance on how to go about it.
I would like to extract just the text in each cover and get as output just the cover quotes. I would like to know what kind of training sets are available for this and what are some good imaging library in python for this.
[link] [2 comments]