Write program to parse HTML and generate XML

Completed Posted Jul 14, 2013 Paid on delivery
Completed Paid on delivery

I am looking for someone to write a script that would retrieve the contents of a public web page and convert the data that it contains into parseable XML. The web page is located here: [login to view URL]

Your script can be in any language, but needs to conform to these standards:

- This is a work for hire, and we get the source code and copyright.

- Must run on either Windows 7 or Linux. I can get you specific info about the box I'll be using

- Must require minimal installation of frameworks or languages - none of which will cost us anything to install and run.

- Must output well-formed XML that can tolerate non xml-safe characters. The specific schema is up to you, as long as it is reasonable and clean.

- Must capture all of the data elements provided on this page, including image URL and arrestee info as well as offense info.

- When you bid, tell us what platform and language you would prefer to use.

We fully understand that the HTML you are looking at is crappy and subject to change in the future. Ideally your code is easy to understand so we can tweak it, or we may call you back if it needs updating. This should be a quick one for the right programmer.

Clarifications:
- Should run on windows OR linux, but doesn't have to be both
- Output can be to a file or STDOUT.
- Needs to be able to run via Windows scheduled task, command line, or Linux cron job. Not looking for a GUI or web interface for this. This will be run by another program, so it should not require any human interaction in order to work.

More clarifications:
- Java and Perl are both fine languages to use. But we're open to any others as well, as long as there is not much for us to install
- I don't have a sample XML file for you, but here is some example of the fields I'm looking for from the web page:

TABLE tt-arrest
FIELD BookingNumber AS INTEGER
FIELD ImageURL AS CHARACTER
FIELD LastName AS CHARACTER
FIELD FirstMiddle AS CHARACTER
FIELD Age AS INTEGER
FIELD DateTimeConfined AS DATETIME

TABLE tt-offense
FIELD BookingNumber AS INTEGER
FIELD OffenseNum AS INTEGER
FIELD StatuteDesc AS CHARACTER
FIELD BondType AS CHARACTER
FIELD BondAmount AS INTEGER

.NET Java Perl Shell Script Software Architecture

Project ID: #4725751

About the project

26 proposals Remote project Active Jul 15, 2013

Awarded to:

Peterpay

i can do this for you check PMB

$40 USD in 0 days
(25 Reviews)
5.1

26 freelancers are bidding on average $34 for this job

gangabass

I'm expert in HTML scraping/parsing. See my PM for details.

$33 USD in 1 day
(677 Reviews)
7.9
PigtailXL

Experience and seriousness here with many IT tools, including customizable scrapping scripts for differents tasks. Please, take a look at your PM for details. Thanks.

$35 USD in 1 day
(119 Reviews)
7.2
rsen75

i'm 14+ years experienced, ready to start work

$50 USD in 3 days
(96 Reviews)
7.1
dobreiiita

Hi, I am interested in this project, Thank you

$35 USD in 3 days
(457 Reviews)
7.5
DucNA

Let me help you!

$30 USD in 2 days
(272 Reviews)
6.6
shenchilang

Experienced java developer.

$35 USD in 3 days
(85 Reviews)
6.5
poornachand

I am ready to do this

$35 USD in 3 days
(100 Reviews)
6.8
waverick

Scraping program will be programmed in Java

$35 USD in 3 days
(43 Reviews)
6.4
techvolcano

We can do this.

$30 USD in 3 days
(167 Reviews)
6.2
omaralieissa

Hi, i can do this using C#, and sure this will work on Windows, or if you want it to work on both windows and linux i can code the program in Java, thanks.

$35 USD in 1 day
(44 Reviews)
5.9
thanhhungqb

Dear sir, please see pmb for details, thanks.

$30 USD in 2 days
(55 Reviews)
5.0
pirlitu

Can have this ready for you in 2-3 days. I will be using C#, so the app will run on Windows. Can be ran via scheduled task w.o problems. Regards.

$40 USD in 3 days
(19 Reviews)
4.9
umesh1787

Have a kind look at PM to see my detail proposal & accept my bid to start work immediately.

$33 USD in 3 days
(10 Reviews)
5.2
santosoftvw

Dear Customer, I am experienced Java Programmer. I can convert HTML data into XML data by using Java HTTP API, Java XML API. I can do this job. Thank you.

$25 USD in 3 days
(21 Reviews)
4.4
proauthor

Hi, Ready to start your work. Eagerly awaiting for your positive reply. Please check your inbox for further details. Thanks, Shaik.

$30 USD in 2 days
(22 Reviews)
4.5
Kolesov

Hello, I will write for you windows service or console application using C#.

$54 USD in 3 days
(13 Reviews)
4.6
raul27868

Hello, I can do this work for you and I'm ready to start. Please see pmb for more details. Regards Raul

$35 USD in 3 days
(7 Reviews)
4.4
Fitzgeraldz

Freelancer Professional, I have 8 years experience in programming with C, C++, C#, Java, J2SE, J2ME, J2EE, Matlab, ASM, python, AutoLISP, Java Script, aspx, php, html5, css3, ajax, jquery, MVC , NET, WCF, Win32, MFC, S More

$38 USD in 3 days
(16 Reviews)
4.9
mz1

I am capable to do your request.

$25 USD in 3 days
(9 Reviews)
4.3
suruiqiang

hello, i can do it.

$30 USD in 3 days
(3 Reviews)
1.9