Home > Net >  Select tables from scanned PDF and process them individually (OCR)
Select tables from scanned PDF and process them individually (OCR)

Time:02-10

How can I select only a predefined part of image for OCR extraction? I know how to extract text with python tesseract and I should be able to process one table, but I dont know how to process multible tables next to each other.

This is the template: enter image description here

And I want to process these tables: enter image description here

The template will be the same each time. I will be processing scaned copies. How can I do it with python or C#?

CodePudding user response:

  •  Tags:  
  • Related