My recent project was about clustering of documents using k-means using scikit python library so I am ready to apply it directly to your needs.
Relevant Skills and Experience
Machine learning, clustering, bag-of-words, python programming, pycharm editor, jupyter notebook for easy visualization.
Proposed Milestones
$60 SGD - Initial setup, data analysis, basic data pipeline with LSA & clustering on small data sample.
$51 SGD - Aplication to the whole dataset.
Additional Services Offered
$1 SGD - Instead of tf-idf and bag-of-words we can use FastText semantic embedding vectors.