Find Jobs
Hire Freelancers

Email Extractor (Python, AI, Web scrap)

₹37500-75000 INR

Closed
Posted 8 months ago

₹37500-75000 INR

Paid on delivery
I am looking for a skilled Python developer with experience in AI and web scraping to create an email extractor tool for research and data analysis purposes. The ideal candidate will have a strong understanding of data extraction techniques and be able to develop a system that can efficiently extract emails from websites. Requirements: - Proficiency in Python programming language - Experience in web scraping and data extraction - Knowledge of AI techniques for data analysis - Ability to separate emails based on specific industry criteria - Familiarity with handling large volumes of data (40,000+ emails) Responsibilities: - Develop a Python-based email extractor tool - Implement web scraping techniques to extract emails from websites - Modify the system to separate emails based on industry criteria - Ensure the system can handle large volumes of data efficiently If you have the necessary skills and experience, please submit your proposal. This is a great opportunity to work on a project that involves AI, web scraping, and data analysis. Project Details:- E-mail ID extraction success rate should be 100% where email IDs are present on the website. If a website has more than one Email ID then the software should collect all email IDs from the website with a maximum count of two email IDs. The data collection should be excluding the pages like Blog, News, Article and product pages. Along with collecting email IDs, the software should be able to collect Facebook business page URLs from the same website. It will also collect facebook urls even if no email IDs are found on the website If no email IDs or Facebook business page URLs are found then the software should report Not Found. Extraction software should have start, stop and pause option. Live view of data extraction should be displayed on the software home screen with auto scrolling Export option should be always available whenever we wish to stop and collect the data. If the internet connection is interrupted then it should auto pause the collection and make the data available for download whatever it has extracted and it should allow us to resume the collection from the same row once internet reconnected. There should be a progress bar to show the progress on how far the data’s have been collected from the full count of data. If we have uploaded a sheet of 5000 data and the collection count is at 2456 then it will show 2456 out of 5000 with % of success rate and failure rate. Approximately a sheet of 10000-12000 websites to be extracted in one hour time with an internet bandwidth speed of 100 Mbps. The time taken for a full sheet collection may get an exception of 25% extra time depending upon the data. A data sheet of 40000-50000 website should take approximate 4 hours of time with an exception of 25% extra time.
Project ID: 37182028

About the project

16 proposals
Remote project
Active 7 mos ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
16 freelancers are bidding on average ₹60,046 INR for this job
User Avatar
Hello, I'm George, a web crawler expert with over 5 years of experience in developing Python-based extractor tools. I eagerly propose my skills for the job as I strongly believe it is an opportunity to put my knowledge into practice. Aside from Python and web scraping, I have also developed systems using AI for data analysis. I have previously handled datasets with more than 40,000+ emails, sorting them through industry criteria and ensuring high extraction efficiency. Therefore, I'm confident I could deliver top-notch quality results for the job. I have done something similar in the past as I have created a custom web scraping tool which can efficiently extract emails and Facebook business page URLs from websites with a high success rate. Questions concerning the job are: 1. What's the preferred format for the output data? 2. What criteria will be used to separate emails? 3. Is there a specific timeline for the project completion? 4. Are there any specific website parameters that need to be taken into account? 5. Are there any impenetrable websites that need to be bypassed? Thanks! I'm excited to further discuss this project and see how I can help.
₹74,983 INR in 3 days
5.0 (157 reviews)
8.1
8.1
User Avatar
hi i am a python developer and machine learning engineer. i can do the task for you with all the requirements mentioned .contact me.
₹60,000 INR in 2 days
4.9 (28 reviews)
7.4
7.4
User Avatar
Hello there! My name is Abhi Chaudhari and I'm a results-driven Python developer with over 4 years of dynamic experience. I'm excited to hear that you're looking for a skilled Python developer with experience in AI and web scraping who can create an email extractor tool for research and data analysis purposes. With my extensive background in software development, I feel confident that I can deliver an exceptional product that meets all your requirements. I have completed lots of similar project before for other clients. Let’s have a quick chat on this project to discuss the deadlines. Looking Forward to working with you. Thanks
₹37,500 INR in 4 days
4.9 (187 reviews)
7.0
7.0
User Avatar
Hello bhabatoshpanda30! I hope you're well. I'm a senior Scraper developer with experience in developing scraper using scrappy and headless browsers. I can deal with bypassing IP throttling limit, ban and captcha solve, storing the result in JSON, CSV and excel files. I've delivered more than 100 projects over time with 5* rating. Here are some of my skills necessary for this task. ➢ Python: Deep understanding of Python and libraries like Scrappy, Proxy, Beautifulsoup, lxml, Captcha ➢ Tools: Headless Browser, Selenium, Playwright ➢ Databases: MySQL, Postgres, Oracle, MongoDB ➢ Source Code Management: Git, GitLab, Bit-bucket, SVN ➢ Cloud Providers: AWS, GCP and Azure ➢ Containerisation: Docker, Kubernetes Best, Sonu
₹47,000 INR in 20 days
4.9 (58 reviews)
6.2
6.2
User Avatar
Hello, I have 10 years of experience in Python I will help you to Extract emails using python Regards, VishnuLal
₹40,000 INR in 3 days
4.9 (42 reviews)
5.2
5.2
User Avatar
Hi There, I have the skills and experience to develop a data extraction tool that meets the requirements and can handle large volumes of data efficiently. Please let me know if you would like to discuss this project further. I am interested in your project. Please feel free to send me a message with any queries. Thanks in advance for giving me the opportunity to work with you. Best Regards, Sirajum Munir.
₹55,000 INR in 1 day
5.0 (8 reviews)
4.3
4.3
User Avatar
Greeting! I have been work on similar project in which I have written a python script to scrap email id from large volume of websites. Let's talk on this more.
₹56,250 INR in 7 days
5.0 (14 reviews)
4.2
4.2
User Avatar
Here is a detailed plan to develop the robust email extractor tool using Python: Phase 1 - Design and Architecture - Review requirements and sample data - Design system architecture and workflow - Choose appropriate libraries like Scrapy, Selenium, BeautifulSoup - Set up environment and repository Phase 2 - Implement Core Extraction Logic - Use Scrapy and Selenium to crawl supplied sites - Identify and extract emails with regex - Separate emails by industry using NLP/ML techniques - Store data in MongoDB database Phase 3 - Build Management Console - Create GUI with Tkinter for managing extraction - Implement start, stop, pause functionality - Display live extraction stats and progress bar - Add export functionality to download results Phase 4 - Testing and Optimization - Conduct extensive tests with sample datasets - Optimize performance for large volumes - Fine tune email identification accuracy - Validate 100% extraction rates where emails exist Phase 5 - Deployment and Maintenance - Containerize with Docker for easy deployment - Create documentation for usage and maintenance - Provide ongoing support to resolve issues I have few years of experience in Python, web scraping, NLP and building scalable data pipelines. I can deliver a robust, high-performance email extraction tool that meets your requirements. Please let me know if you would like to discuss further.
₹75,000 INR in 7 days
4.9 (2 reviews)
3.0
3.0
User Avatar
I can deliver the web scraping tool including UI too. I understand you are looking for a skilled Python developer with experience in AI and web scraping to create an email extractor tool for research and data analysis purposes. As a full stack web developer with 7 years of experience in the field of Computer Science and extensive knowledge of modern Web techniques, I am confident that my skillset is the best fit for this project. My proficiency in Python programming language provides me with the necessary skills to develop an email extractor tool that can efficiently extract emails from websites while my experience in web scraping and data extraction gives me the knowledge needed to develop a system that can separate emails based on industry criteria. My ability to handle large volumes of data (40,000+ emails) efficiently makes me an ideal candidate for this project.
₹75,000 INR in 7 days
5.0 (2 reviews)
3.0
3.0
User Avatar
Hello, I'm excited about your email extractor project. I'm a skilled Python developer with: Python Proficiency: Extensive experience in Python. Web Scraping Mastery: Success in web scraping for data extraction. AI for Data Analysis: Ability to create sorting algorithms for emails based on industry criteria. Big Data Handling: Proficiency in managing large datasets. Responsibilities: Tool Development: I'll create a Python-based tool to meet your specifications. Web Scraping: Advanced techniques for a 100% email extraction success rate. Industry Criteria Sorting: AI-driven email categorization. Data Management: Efficient handling of large data volumes, with pause, resume, and export features. Live Data Feed: Real-time progress display with auto-scrolling. Internet Resilience: The tool handles interruptions, auto-pauses, allows data retrieval, and resumes. Progress Tracking: Progress bar showing collection status, success, and failure rates. Expected Output: Processing 10,000-12,000 websites per hour with a possible 25% time buffer for complexity. For 40,000-50,000 websites, expect roughly 4 hours, plus a 25% buffer. I'm committed to delivering a top-notch, dependable, and efficient solution. Feel free to reach out for further discussion or questions.
₹70,000 INR in 7 days
4.8 (2 reviews)
2.3
2.3
User Avatar
Hello, my name is Hammad and I am a full stack developer and website designer with 4+ years of experience in the industry. I understand exactly what you are looking for in an email extractor tool - a system that can efficiently extract emails from websites. With my expertise in artificial intelligence, java programming language and web scraping, I believe I am the perfect fit for this project. I have the necessary skills and experience to create an email extractor tool that can successfully extract emails from websites with 100% success rate. Additionally, my software has a start, stop and pause option so that it can effectively manage large volumes of data while still providing live view of data extraction on the home screen with auto-scrolling functionality. Additionally, export option is always available whenever required so you don't have to worry about losing any data during collection. Additionally, I am available for freelance work so if you're interested please don't hesitate to contact me regarding this project. Thank you for considering me for this job!
₹56,250 INR in 1 day
5.0 (1 review)
1.8
1.8

About the client

Flag of INDIA
Dhenkanal, India
0.0
0
Member since Feb 22, 2023

Client Verification

Other jobs from this client

E-mail Extractor
₹37500-75000 INR
Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.