I have several folders that contain over 10 pdf files each.
Each pdf contains various text and 1 or 2 tables. Each table has the same 5 column names.
Each pdf is password protected and the tables are located in images (i.e. it seems the pdfs have been scanned)
For the pdfs in each folder:
- extract data from the table (or tables if there are 2)
- create a csv file with the same name as the pdf (eg [login to view URL] -> [login to view URL])
- load the data from the pdf tables into a table in the csv
If there are 10 pdf files in a folder then the script will create 10 csv files in the same folder.
A single script is needed, and it must be tested and working fully before being submitted.
The script must meet PEP 8 and be fully commented as per PEP257.
Include the word silver in the first sentence of your proposal to show you have read this fully and understand it.
In your proposal please indicate
- how soon you can complete this project
- proposed fixed cost
- why you are the right person for this job
I will provide samples and an example csv to shortlisted candidates before we finalise the job, so that you can confirm price and time are correct.
Thank you. I look forward to hearing from you soon.
15 freelancers are bidding on average $124 for this job
Hi Give me a Chance.I'll do this job.I'm passionate and hardworking freelancer.I have 8 years [login to view URL] job will be performed efficiently and accurately before the deadline. Kind regards Niroshi.w
Silver, im familiar with padas and xml in im used to writing scripts and feel this shouldnt take too long. Are the paswords known or not? Full disclosure i dont know that standard, but i can learn it.
Hi Silver, I recently worked upon similar optical character recognition project in my current job. I have over 7 years of experience in Data engineering and can complete your project within 7 working days.