OCR with Tesseract
Table of Contents
Introduction
Installing Tesseract
Installation instructions for tesseract
are available here
Linux:
OS X:
sudo port install imagemagick
sudo port install tesseract
sudo port install tesseract-eng
Use Tesseract
convert PDF to 8-bit TIFF
convert -density 300 -fill white -draw 'rectangle 10,10 20,20' -background white -alpha Off TechnoCommercialLease.pdf -depth 8 file.tiff
tesseract file.tiff output