This is where IronOCR truly shines against other OCR libraries such as Tesseract, and we will find alternative OCR projects shy away from discussing. Now we will try a much lower quality scan of the same page, at a low DPI, which has lots of distortion and digital noise and damage to the original paper. You will also note that Iron OCR can automatically read multi-page documents, such as TIFFs and even extract text from PDF documents automatically. OCR is not a perfect science when it comes to real world documents, yet IronTesseract is about as good as it gets. We can use this even on a medium quality scan with 100% accuracy.Īs you can see, reading the text (and optionally barcodes) from a scanned image such as a TIFF was rather easy. This all may seem daunting, but in the example below you will see the default settings which we would recommend you start with, which will work with almost any image you input to Iron OCR.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |