I am looking to try and make an android app to pull out data from a receipt as an exercise. After some basic research I think tesseract may be a good choice, but it looks like some pre-processing is required and opinions vary. Does anyone have suggestions on what sort of pre-processing I should look into (grayscale, binary conversion, gaussian blur, etc.) or if there are any other libraries that may help me out?
[link][3 comments]