Extract text from various document formats (PDF, DOCX, images, etc.)
Extract text from various different types of files
textract
$ textract document.pdf
$ textract document.docx
$ textract image.png