Open-source OCR engine for extracting text from images
OCR (Optical Character Recognition) engine
tesseract
$ tesseract image.png output
$ tesseract image.png output -l fra
$ tesseract image.png output pdf