Java programmer for a New Software

In Progress Posted Sep 10, 2013 Paid on delivery
In Progress Paid on delivery

OVERVIEW

An automated webpage scanning application needs to gather information from a specific list of websites (about 10k) and store it in a Java object for further processing. The aim of this project is to give each programmer a group of 100 sites to implement the scanners for such sites, according to the HTML structure.

The required information is usually organized a highly structured manner, so that the operation of gathering such information can be easily implemented as an iteration on each entry.

The programmer is given a class library which the implemented scanners must comply to. Moreover, the provided library already contains an high-level API that abstracts and automates the scanning process. If the site is well-structured, the implementor simply needs to specify in a jQuery-like fashion where the required information is located. Ciononostante al programmatore è consentito di correggerli per ottenere un pagamento integrale.

SOFTWARE REQUIREMENTS

The application is written in Java 7, so JDK 7 is required to compile the scanners.

The application depends on two other libraries: jsoup 1.7.2 (to parse the HTML pages) and Apache Commons Lang 3.0.1 (general purpose). In most cases, the implementor will not need to use directly either of them.

To ease the structural detection of the HTML pages before implementing the related scanner, the use of Firefox with the Firebug plugin is highly recommended.

KNOWLEDGE REQUIREMENTS

A medium or good knowledge of Java is required, in order to produce good-quality code.

Since jQuery-like selectors are used to navigate through the HTML of the scanned page, the programmer must know how to write them; anyway, the jsoup library API docs contains a list of the supported selectors.

The websites to be scanned are in Italian, but no particular knowledge of this language is needed.

The programmer will be provided the API docs for the application library and some example scanners.

TERMINOLOGY

For “site” we mean a domain (i.e. [url removed, login to view]); if a site contains more than a webpage with the information we look for (i.e. [url removed, login to view], [url removed, login to view]), we call these “sub-sites”; if a sub-site with a long list of entries is divided into several numbered pages, such pages belong to the same sub-site and all of them must be scanned.

Hence, the programmer has to implement a scanner for a given list of sub-sites (coming from 100 sites, as said above), keeping in mind that the HTML structure of sub-sites within the same site is often the same.

FULFILLMENT POLICY

As already said, the produced code must comply with the application library we give. The programmer must provide the source code of the implemented scanners and, optionally, the compiled class files.

We reserve the right to verify that the produced scanners actually work, and to pay only the amount equivalent to the working ones. However the programmer is allowed to rectify them to obtain a full payment.

HTML Java jQuery / Prototype

Project ID: #4914999

About the project

13 proposals Remote project Active Sep 18, 2013

Awarded to:

derekmga

Hi! I'm John! I fully understand what your project means, because I worked and I'm currently working with JSOUP in a project that needs data from another site, data taken similarly with your project description. I More

$30 USD in 3 days
(0 Reviews)
0.0

13 freelancers are bidding on average $110 for this job

tarun308

I would be glad to work on this project. I've worked on many java projects including projects Web Scraping. - Please contact me to discuss this further. Thanks.

$263 USD in 20 days
(65 Reviews)
6.0
chinbochen

Hello, I'm a Java, JEE developer with more than 10 years experience, I see your requirement. I'm glad to work this out for you. Thanks.

$24 USD in 8 days
(10 Reviews)
4.4
javoblevw

Hello, I'm a Senior Java developer and I have a strong professional skills in Java on different projects. I could help you with this project.

$25 USD in 7 days
(12 Reviews)
2.9
jguru1

Ciao, sono in possesso delle conoscenze necessarie per lavorare al progetto. Ho grande esperienze in Java ed ho in passato utilizzato anche JQuery e PHP, non ho avuto, invece, ancora modo di utilizzare prototype, ma More

$166 USD in 10 days
(1 Review)
2.8
choks1991

I am ready to do it..i can do it because i have good exp & command over java . I can do it very fastly Thank you

$30 USD in 3 days
(4 Reviews)
2.7
invictos

Sir i have very good skills in JAVA and can provide you this work. Looking forward to work with you.

$100 USD in 3 days
(1 Review)
0.0
harshprataptomar

I have experience on working with jquery and java 7. I like the idea of scanning webpages and using the data provided by them, I did something similar to it in the past, which was to scan HTML pages and test them for A More

$100 USD in 3 days
(0 Reviews)
0.0