Find Jobs
Hire Freelancers

A VB program to monitor Google for copies of web pages

$100-500 USD

Closed
Posted over 18 years ago

$100-500 USD

Paid on delivery
What I need is a program that, given a series of web pages, uses the Google API to monitor the web for possible copies of the text. The specs are not yet done on this project, so: 1) I am open to suggestions, proposing good ideas here might be a factor in selecting your bid 2) I realize that after the final specs are done, you might want to change your initial bid. I am ready to accept this What I am thinking about is something like the following: get the text of the web page extract a phrase (N words, where N is like 5-6 words) search it in Google if a site exists, it might be a copy of our text. Another approach that will probably be better to search parial copies: get the text of the web page extract a phrase (N words, where N is like 3-4 words) search it in Google if there are more than 100 pages, it was a too common phrase, try with another phrase remember the results repeat the process 5 times, if a page appears more than 3 times in the 5 results, it might be a copy of your text. There are a LOT of enhancements that can be applied to this, of course, and I would like you to be creative on this too. For example, options that might be included are: - automatically "spider" the web site to be checked (i.e. download all its pages) - white-listing of specific pages/domains: the user sees a page and decides that it's ok. He then marks the page as being whitelisted, and it will not be displayed any more in the possi - show the side-by-side the texts of the original and of the possible copy, highlighting the matching parts of text - set up an automatical check of pages every N days - in case of large lists of pages, where you might hit the 1.000 searches limit from Google, process N pages at a day ## Deliverables 1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables): a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment. b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request. 3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement). ## Platform VB6, Google API
Project ID: 3113031

About the project

2 proposals
Remote project
Active 18 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
2 freelancers are bidding on average $310 USD for this job
User Avatar
See private message.
$510 USD in 30 days
4.7 (4 reviews)
4.2
4.2
User Avatar
See private message.
$110.50 USD in 30 days
5.0 (6 reviews)
1.4
1.4

About the client

Flag of ITALY
Rome, Italy
5.0
234
Payment method verified
Member since May 29, 2001

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.