C# Spreadsheets and PDF Scraping
$250-750 USD
Paid on delivery
This project is to write code in C# to scrape a variety of spreadsheets and pdf files from a list of sources across the internet.
You will be provided with a code framework that provides methods to pull the data and save it. You will need to define the data you are scraping as C# objects (with entity framework "code first" attributes) and implementing the scraping logic for each file.
A sample of the pattern we wish for you to follow is included. There are simple wrappers provided so you can give a URL or local file and get the relevant Excel\PDF\Html library loaded with the data for you so you only need to write the scrape logic and objects not the infrastructure.
The libraries being used are:
ClosedXml\OpenXml (Excel 2007+)
NPOI (Excel 2003 and earlier)
iTextSharp (PDF)
HtmlAgilityPack (Html - probably not required for this project)
The spreadsheets and pdf files we want to scrape can be found by looking at the below links:
NOTE 1 - In many instances there are identical files for different time periods, so you can reuse the same scrape (so it isn't as many as it might look like at first)
NOTE 2 - Sometimes there are zip files that contain multiple spreadsheets\pdf files which also need to be scraped.
NOTE 3 - Sometimes there are PDF and Excel files representing the same data. In these cases you only need to scrape one of them (probably the spreadsheet as it is easier to do)
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
Project ID: #12489172
About the project
Awarded to:
Hi! Mr. Goodier = ) I have gone through all the links that are under project description. Basically, I counted the number of files on each link, there are around 450 files, considering there are some "master files More
23 freelancers are bidding on average $530 for this job
hi there i can work on this scrapper logic however i have some questions. Please initiate discussion. Thankyou
Hi, I am expert in making scrappers like this. I have plenty of experience using itextsharp, npoi, openxml. Please send me the code framework and other details. Thanks Barun
I am really interested to do this job and get started right away. Can we discuss the project details? Payment after you're completely satisfied nothing advance.
Dear Sir, I'm writing in response to your task post. As a highly competent software specialist with more than nine years of experience , I would bring a high quality and service focused mindset to this job. Based More
Hello, My name is Mohd Rafi, I have 13 years of experience as an Architect/Tech Lead/Developer in .Net Technologies. I have carefully gone through your job post and it looks like a perfect fit for my skills set. More
Hello, We have accomplished 90% of the project which is similar of your requirement. All we need 10% customization as per your requirement set and specifications. I want to discuss in personal chat in order to explore More
If the infrastructure is written and you only need the scrapping algorithm , I may finish this in 5 to 7 days ,but I have to see your Code Framework first before committing to this . Also , I noticed that your sou More
Dear Hiring Manager, Thank you for this wonderful opportunity. Today Your job posting has caught my attention because I’m keenly considering your job post “C# Spreadsheets and PDF Scraping”. I have 6+ years experienc More
I have good knowledge on PDF scraping and also I've worked on iTextsharp. Good knowledge in spreadsheet scraping. I have good technical knowledge.
Hello, Its a pleasure to let you know that I've Completed and Delivered similar project before. All I need to work upon customization part, if we can proceed towards more discussion. I have gone through your project More
I am well experienced in analysing the issues and providing the solution. Has 12+ years of experience in C++, C# dlls and application development. If you hire me, you will get satisfied result in time. I am here to ma More