Home > other >  For help - python - use pdfplumber reads text word repetition
For help - python - use pdfplumber reads text word repetition

Time:11-21

For help, use pdfplumber reads text word repetition


For example are shown in the PDF (lu xun) is read [lu LuXunXun]

Bosses know how to deal with?

CodePudding user response:

Can write a program to traverse the parsed text, if there are two of the same text will replace two of the same characters as one character at a time, or to continue;
It can solve the problem of repeated characters, but may be mistaken delete some normal fold word, I met the repetitive is four characters, is not easy to affect the normal text
  • Related