Although there is the tabula-rosa package for extracting tables from PDFs, I found it easier to use online PDF to Excel utilities, in this case I used https://pdftables.com/ to convert the PDF files to Excel.
Source:
about 1 year ago
AI-based cloud services: utilize machine learning to extract structured data from PDFs. Examples include pdftables and docparser, but these are not open-source friendly.
– Source: dev.to
/
over 1 year ago
I believe such an API already exists: https://pdftables.com/ (no affiliation). Went to a presentation at a Golang meetup in Amsterdam by the guys behind this company. Seemed to know their stuff. But I have no real world experience using it.
– Source: Hacker News
/
almost 3 years ago
I tried pdftables.com and like it as a solution; other solutions I should investigate?
Source:
almost 3 years ago
I remember using this tool a while back: https://pdftables.com/ – it costs money for the scale of conversion you’re talking about.
Source:
over 3 years ago