This thread discusses the gbDocumentCapture app I posted in the Source Code forum.
It allows you to capture the text from all pages in a document. It does so by capturing an image of the document found in the middle of the Desktop, then uses the free Tesseract library to extract the text from the image. gbDocument Capture walks through the document, capturing and merging the text from each page.
I like how this turned out. I'll put some more effort into giving it a much better interface along with some other capabilities that come to mind.
Folks who want to convert a document, such as Kindle book, into a text file will find this useful - particularly for books with DRM protection.
It allows you to capture the text from all pages in a document. It does so by capturing an image of the document found in the middle of the Desktop, then uses the free Tesseract library to extract the text from the image. gbDocument Capture walks through the document, capturing and merging the text from each page.
I like how this turned out. I'll put some more effort into giving it a much better interface along with some other capabilities that come to mind.
Folks who want to convert a document, such as Kindle book, into a text file will find this useful - particularly for books with DRM protection.
Comment