Unsupervised-Thai-Document-Clustering-with-Sanook-news
PublicAn unsupervised model to clustering Thai news. Using TD-IDF, SimCSE-WangchanBERTa with weighted by number of named entities as a vector representation, and using k-means as an clustering model.