Story swapper algorithm and data conversion tool (in python)
$30-250 USD
Completed
Posted over 13 years ago
$30-250 USD
Paid on delivery
I have files containing hundreds of stories and meta data that need to be processed by matching algorithm(s) into a new CSV file.
Work processing:
I need a python program that can take an XML data file and process it to create a CSV output file. This CSV file will be manually uploaded into Clickatell. I also need a means to log which phone number received which story each time the algorithm is run.
Algorithm (the fun part!):
The input XML contains 3000 stories and meta data that will be used to match each storyteller to a different person's story, based on the similarity between the story they told and another story in the set. Each unique phone number will receive one story, so not all stories will match each time it is run. We can discuss the matching criteria, but the metadata will be sufficient and quantitative so that this will not involve a huge amount of *nautural language processing* - but some will be essential. The output file enables a mass "story swap" via clickatell SMS.
I need in python so that I can maintain the code after you write it. Story meta data categories (questionnaire responses) will change, but the tree structure will not.
Program doesn't need to be fancy, but it does need to accept a few user inputs, such as choosing from one of several matching techniques based on the meta data filters, and producing the right CSV output file with an option to edit the SMS template that is applied to all output rows before saving the new CSV file.
------------------------------------------
I also need to log which story (by ID#) has already been sent out to each user, so that no user ever receives the same story twice. This log system should also be CSV friendly, so that it can be imported into Mysql or stored in MYSQL directly. CLickatell does not work with mysql, so the CSV step is required.
PLEASE LOOK AT THE FILES ATTACHED TO THIS PROPOSAL - SAMPLE PROVIDED
*** I had to repost this assignment because my first programmer bailed on me. The last winning bid was $100 and I will not entertain any bids higher than that. I want timeliness, good collaboration, and quality code.