Closed

Python Script to Extract Data from PDFs

I have several folders that contain over 10 pdf files each.

Each pdf contains various text and 1 or 2 tables. Each table has the same 5 column names.

Each pdf is password protected and the tables are located in images (i.e. it seems the pdfs have been scanned)

For the pdfs in each folder:

- extract data from the table (or tables if there are 2)

- create a csv file with the same name as the pdf (eg [login to view URL] -> [login to view URL])

- load the data from the pdf tables into a table in the csv

If there are 10 pdf files in a folder then the script will create 10 csv files in the same folder.

A single script is needed, and it must be tested and working fully before being submitted.

The script must meet PEP 8 and be fully commented as per PEP257.

Include the word silver in the first sentence of your proposal to show you have read this fully and understand it.

In your proposal please indicate

- how soon you can complete this project

- proposed fixed cost

- why you are the right person for this job

I will provide samples and an example csv to shortlisted candidates before we finalise the job, so that you can confirm price and time are correct.

Thank you. I look forward to hearing from you soon.

Skills: Python, Software Architecture, PHP, Web Scraping, Javascript

See more: simple script extract data urls, perl script extract data website, script extract data web, html script extract data email, python script extract data web page, script extract data html perl mysql, perl script extract data file access, script extract data daily, python script crape data, script extract data php excel, python script extract data, python script extract data website, script extract data word document, python script extract web data, script extract data pdf word, python script extract data emails, python script extract data from file

About the Employer:
( 0 reviews ) Moquegua, Peru

Project ID: #22017640

15 freelancers are bidding on average $124 for this job

naravila

⭐⭐⭐Greetings of the day⭐⭐⭐ ☛I have gone through your project details carefully and I think that this project is very fit for my skill sets. ☛I have enough experience in similar projects, so I have a clear way to comple More

$50 USD in 10 days
(62 Reviews)
6.6
RRajeshR

silver Hi, I'm interested to work on your project. Before starting the estimation I would like to look at the sample pdf files. I've done similar project to convert the image files to text pdf using JAVA. As you menti More

$30 USD in 7 days
(8 Reviews)
4.9
sandking19915

Hello sir. High-quality & Fast-delivery is promised! As a highly skilled full stack developer and I can help you perfectly. I am very confident with my skills and I'd like to help your business by doing my best. My cli More

$1000 USD in 7 days
(4 Reviews)
3.2
mymamun

Hi there I am a python developer. I can write a script to extract data from PDF in just 20 minutes. Can you talk to me regarding this? Thanks Anamul

$20 USD in 7 days
(5 Reviews)
2.2
deepakagg500

SIlver Please share with me the sample pdf. Within 2 hours, i will tell you i can do it or not. I will try my best to give you the desired output. DON'T PAY ME A SINGLE PENNY IF YOU NOT LIKE MY WORK. hoping for your re More

$15 USD in 2 days
(2 Reviews)
1.6
Rachitkum55

i am expert in python i have almost about 2 year experience as Python Developer i did lots projects on python dill completed dead lines of projects on time i really serious about my responsibilities and work towards my More

$20 USD in 7 days
(1 Review)
0.6
Niroshi1991

Hi Give me a Chance.I'll do this job.I'm passionate and hardworking freelancer.I have 8 years [login to view URL] job will be performed efficiently and accurately before the deadline. Kind regards Niroshi.w

$15 USD in 2 days
(0 Reviews)
0.0
rjcpph

I have been doing write ups for bands and artists for 10+ years. As a result I am very quick and detail oriented.

$277 USD in 3 days
(0 Reviews)
0.0
suneeli1995

I have done one project on extracting data from pdfs using python and R scripts. If it's scanned I used Google vision for extract data from scanned pdf's. Relevant Skills and Experience I know R and Python. I have one More

$25 USD in 7 days
(0 Reviews)
0.0
AdamsBond

⭐⭐⭐⭐⭐Greetings Dear Client! ⭐⭐⭐⭐⭐ I read your project description carefully and I am confident to finish your project. My main skills is Python, C# ,C/C++, Java and Algorithm and if you assign to me this project, you c More

$100 USD in 2 days
(0 Reviews)
0.0
saranahmed192

Hi, this is Ahmed. I have read the description carefully and i am able to say that I can complete your task perfectly and according to your desire. Give me a chance to work for you and i assure you work as per your ne More

$25 USD in 3 days
(0 Reviews)
0.0
micahchurcha

Silver, im familiar with padas and xml in im used to writing scripts and feel this shouldnt take too long. Are the paswords known or not? Full disclosure i dont know that standard, but i can learn it.

$35 USD in 7 days
(0 Reviews)
0.0
Sreedevi5

silver The script will be implemented in python. The script contains 2 parts 1. Read text & table from password-protected scanned pdf files using Apache Tika server. 2. Convert the table contents to csv file using Pan More

$200 USD in 7 days
(0 Reviews)
0.0
aswary

Hi Silver, I recently worked upon similar optical character recognition project in my current job. I have over 7 years of experience in Data engineering and can complete your project within 7 working days.

$24 USD in 7 days
(0 Reviews)
0.0
giasuddin90

In my organization i write different type of automation script and process data in different file from pdf,xlsx, text [login to view URL] check my profile [login to view URL] https://gist.github. More

$30 USD in 7 days
(0 Reviews)
0.0