Dear @linux and @academicchatter folks:
Please suggest libre/open source tools that allow for the extraction of text and images from scientific pdf documents?
P.S: I’m on a linux machine. Would like something terminal friendly, if possible!
Dear @linux and @academicchatter folks:
Please suggest libre/open source tools that allow for the extraction of text and images from scientific pdf documents?
P.S: I’m on a linux machine. Would like something terminal friendly, if possible!
@ajayiyer@mastodon.social OCRmyPDF is exactly what you are looking for