Find Jobs
Hire Freelancers

Docker+Script to write

$8-15 USD / hour

Cancelled
Posted over 8 years ago

$8-15 USD / hour

Create an api that we can POST a file to and get HTML code out of. Possible inputs: 1) Text file: Just spit out contents 2) Image file: OCR with tesseract 3) PDF file -- a) Scanned: Break out into individual pages and save as images; OCR with tesseract -- b) Embedded text: Return HTML with pdftk 4) Other file: Attempt to read with LibreOffice running Headless 5) url + jQuery selector path: Read the html code of the url, and return html and images (excluding some selectors). See attached website.json. Possible implementations A) Extend [login to view URL] with worker containers written in Golang. B) Write the code in node.js, use the same Docker container system with a RabbitMQ server (Jeff/Alex can help with the Docker setup) References: * open-ocr: [login to view URL] * bash script that runs 1-4 above: [login to view URL] * attached [login to view URL] for #5 above Final deliverables will be released as an Open Source project on Github (Apache license).
Project ID: 8599580

About the project

Remote project
Active 9 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs

About the client

Flag of UNITED STATES
Oakland, United States
5.0
2
Member since Aug 4, 2015

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.