1

Closed

Doubled text in pdf files. ParseHOCR function should select "ocr_page" nodes.

description

Hi,
I think there should be:
HtmlNodeCollection nodes = body.SelectNodes("//div[@class='ocr_page']");
in line 43 file /hOcr2Pdf/Elements/parser.cs
otherwise ocred text will be doubled in pdf file.

Regards
Closed Feb 17, 2015 at 4:11 AM by pwizzle
Resolved with proposed changes

comments