Submitted by Rodny_ t3_yyenpp in MachineLearning
robertknight2 t1_iwveveb wrote
Reply to comment by flapflip9 in [P]Modern open-source OCR capabilities and which model to choose by Rodny_
To add to this, Tesseract's text recognition of identified lines of text uses a modern approach involving LSTM neural networks, but the text detection process which comes before this uses classical/heuristic (ie. non-ML) approaches which work well on clean-ish document images, but can struggle with photos of documents that have uneven lighting conditions and spotting text in a photo (eg. numberplates in a city scene).
I maintain a JavaScript build of Tesseract with an online demo that you can try with different images: https://robertknight.github.io/tesseract-wasm/
Viewing a single comment thread. View all comments