On my job i have a dateset of more than 10K job titles. I would like to cluster them into a smaller dataset.
I have tried the approaches for clustering available in OpenRefine, as well as a modifed clustering using Jaccard's similarity.
I managed to cluster the titles into a smaller dataset, however, I was wondering if any fellow wants to recommend me a better approach.
[link][1 comment]