Topic Subject Creation Using Unsupervised Learning for Topic Modeling


  •  Rashid Mehdiyev    
  •  Jean Nava    
  •  Karan Sodhi    
  •  Saurav Acharya    
  •  Annie Ibrahim Rana    

Abstract

We address the problem of topic mining and labelling in the domain of retail customer communications to summarize the subject of customers inquiries. The performance of two popular topic mining algorithms - Non-Negative Matrix Factorization (NMF) and Latent Dirichlet Allocation (LDA) – were compared, and a novel method to assign topic subject labels to the customer inquiries in an automated way was proposed. Experiments using a retailer’s call center data verify the efficacy and efficiency of the proposed topic labelling algorithm. Furthermore, the evaluation of results from both the algorithms seems to indicate the preference of using Non-Negative Matrix Factorization applied to short text data.



This work is licensed under a Creative Commons Attribution 4.0 License.