Contents:
Pysseract.
GetHOCRText
Make an HTML-formatted string with hOCR. ‘pagenum’ is 0-based, appears as 1-based in results.