Sir, I am very interested in web scraping and thus have a great experience of 2 years and handled many projects and worked on them thoroughly. I am very much aware of the latest techniques which are been used in order to the scrapping of any particular website to extract a large amount of data from websites.
We will use lxml, which is an extensive library for parsing XML and HTML documents very quickly; it can even handle messed up tags. We will also be using the Requests module instead of the already built-in urllib2 module due to improvements in speed and readability.
I will do web crawling and web scrape on any website using state of the art technologies,
Technologies used:
Python
BeautifulSoup
Selenium
Processes:
Web Crawling
Web Scraping
Data Cleaning
Data Storage
Output Formats:
Database (SQL)
CSV
MS Excel
Sql
less
0.0