Write program to parse HTML and generate XML
$10-30 USD
Paid on delivery
I am looking for someone to write a script that would retrieve the contents of a public web page and convert the data that it contains into parseable XML. The web page is located here: [login to view URL]
Your script can be in any language, but needs to conform to these standards:
- This is a work for hire, and we get the source code and copyright.
- Must run on either Windows 7 or Linux. I can get you specific info about the box I'll be using
- Must require minimal installation of frameworks or languages - none of which will cost us anything to install and run.
- Must output well-formed XML that can tolerate non xml-safe characters. The specific schema is up to you, as long as it is reasonable and clean.
- Must capture all of the data elements provided on this page, including image URL and arrestee info as well as offense info.
- When you bid, tell us what platform and language you would prefer to use.
We fully understand that the HTML you are looking at is crappy and subject to change in the future. Ideally your code is easy to understand so we can tweak it, or we may call you back if it needs updating. This should be a quick one for the right programmer.
Clarifications:
- Should run on windows OR linux, but doesn't have to be both
- Output can be to a file or STDOUT.
- Needs to be able to run via Windows scheduled task, command line, or Linux cron job. Not looking for a GUI or web interface for this. This will be run by another program, so it should not require any human interaction in order to work.
More clarifications:
- Java and Perl are both fine languages to use. But we're open to any others as well, as long as there is not much for us to install
- I don't have a sample XML file for you, but here is some example of the fields I'm looking for from the web page:
TABLE tt-arrest
FIELD BookingNumber AS INTEGER
FIELD ImageURL AS CHARACTER
FIELD LastName AS CHARACTER
FIELD FirstMiddle AS CHARACTER
FIELD Age AS INTEGER
FIELD DateTimeConfined AS DATETIME
TABLE tt-offense
FIELD BookingNumber AS INTEGER
FIELD OffenseNum AS INTEGER
FIELD StatuteDesc AS CHARACTER
FIELD BondType AS CHARACTER
FIELD BondAmount AS INTEGER
Project ID: #4725751
About the project
Awarded to:
26 freelancers are bidding on average $34 for this job
Experience and seriousness here with many IT tools, including customizable scrapping scripts for differents tasks. Please, take a look at your PM for details. Thanks.
Hi, i can do this using C#, and sure this will work on Windows, or if you want it to work on both windows and linux i can code the program in Java, thanks.
Can have this ready for you in 2-3 days. I will be using C#, so the app will run on Windows. Can be ran via scheduled task w.o problems. Regards.
Have a kind look at PM to see my detail proposal & accept my bid to start work immediately.
Dear Customer, I am experienced Java Programmer. I can convert HTML data into XML data by using Java HTTP API, Java XML API. I can do this job. Thank you.
Hi, Ready to start your work. Eagerly awaiting for your positive reply. Please check your inbox for further details. Thanks, Shaik.
Hello, I can do this work for you and I'm ready to start. Please see pmb for more details. Regards Raul
Freelancer Professional, I have 8 years experience in programming with C, C++, C#, Java, J2SE, J2ME, J2EE, Matlab, ASM, python, AutoLISP, Java Script, aspx, php, html5, css3, ajax, jquery, MVC , NET, WCF, Win32, MFC, S More