Home > Net > To solve the extract PDF text garbled words
To solve the extract PDF text garbled words
Time:11-23
Describe the problem, first, These two days when parsing a PDF document, need to extract a text message in the document, written before the code is the company's predecessor one method, the code is as follows:
X1, y1, x2, y2 framed a range of four parameters extraction is the range of the article, in the document below:
But the actual extracted text is as follows: To checked the baidu, baidu's basic are stil give priority to, I this text extraction has partly right, part of the error, it may not seem like online said character encoding problem, please show a labyrinth bosses help