Select tables from scanned PDF and process them individually (OCR)-CodePudding

Home > Net > Select tables from scanned PDF and process them individually (OCR)

Select tables from scanned PDF and process them individually (OCR)

Time：02-10

How can I select only a predefined part of image for OCR extraction? I know how to extract text with python tesseract and I should be able to process one table, but I dont know how to process multible tables next to each other.

This is the template:

And I want to process these tables:

The template will be the same each time. I will be processing scaned copies. How can I do it with python or C#?

CodePudding user response：

Page link：https//www.codepudding.com/net/293707.html

Prev:StreamWriter writes all data into obe Column in CSV

Next:How to compare string, object with string?

Tags：

Links：
CodePudding