Deep Feature-Based Text Clustering and Its Explanation


The text mining community has devoted a significant amount of time and energy to the research of text clustering as it is an essential step in the text data analysis process. The majority of the text clustering algorithms that are currently in use are based on a model called the bag-of-words, which suffers from problems such as high-dimensionality and sparsity and ignores text structural and sequence information. The models that are based on Deep Learning, such as convolutional neural networks and recurrent neural networks, regard texts as sequences; however, these models do not have supervised signals and do not produce results that can be explained. In this paper, we propose a deep feature-based text clustering (DFTC) framework that integrates pretrained text encoders into text clustering tasks. DFTC stands for deep features, deep features-based text clustering, and deep feature-based text clustering. The dependence on supervision is broken with the help of this model, which is predicated on sequence representations. The results of the experiments show that our model performs better than traditional text clustering algorithms and the most advanced pretrained language model available today, known as BERT, on almost all of the datasets that were taken into consideration. In addition, understanding the principles underlying the Deep Learning approach is significantly aided by the explanation of the clustering results. The explanation module that is included in our proposed framework for clustering is designed to provide users with assistance in comprehending the significance of the clustering results as well as their overall quality.

Did you like this research project?

To get this research project Guidelines, Training and Code... Click Here

PROJECT TITLE : Secure and Efficient Data Transmission for Cluster-Based Wireless Sensor Networks - 2014 ABSTRACT: Secure data transmission is a critical issue for wireless sensor networks (WSNs). Clustering is an effective
PROJECT TITLE :T-Drive Enhancing Driving Directions with Taxi Drivers’ Intelligence - 2013ABSTRACT:This paper presents a smart driving direction system leveraging the intelligence of experienced drivers. In this system, GPS-equipped
PROJECT TITLE :A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data - 2013ABSTRACT:Feature selection involves identifying a subset of the most useful features that produces compatible results as

Ready to Complete Your Academic MTech Project Work In Affordable Price ?

Project Enquiry