Minority Sub-region Estimation-based Oversampling for Imbalance Learning


One of the challenges that has arisen in recent years is a problem known as class imbalance, which is characterized by a skew distribution that favors the majority. Several different oversampling strategies have been proposed as a solution to this issue. Some of these oversampling strategies combine the oversampling procedure with the clustering algorithm, which ensures that newly generated synthetic samples will be grouped together. Nevertheless, due to a property of the clustering algorithm itself, dispersed samples that share the same minority sub-region but come from different geographic locations tend to be divided into distinct groups. As a result, the procedure for oversampling that follows is carried out the majority of the time in incomplete minority subregions where synthetic samples do not adequately cover the entire minority region. In addition, to the best of our knowledge, none of the currently available algorithms is designed to directly estimate minority sub-regions with regard to the class imbalance problem. First of all, a brand new grouping algorithm that has been given the name Direction Distribution-based Minority Sub-region Estimation (DDMSE) has been proposed. The new algorithm makes use of the intuitive observation that minorities with the same sub-region almost always distribute in the same direction when compared to other majorities. This allows the new algorithm to estimate minority sub-regions while deftly ignoring negative impacts brought on by the distance factor, similar to how clustering algorithms do. Last but not least, in those minority sub-regions, brand new synthetic samples are produced. The experimental results on real-world datasets show that the performance of this oversampling method is comparable to that of other cutting-edge oversampling methods.

Did you like this research project?

To get this research project Guidelines, Training and Code... Click Here

PROJECT TITLE :Minority Oversampling in Kernel Adaptive Subspaces for Class Imbalanced Datasets - 2018ABSTRACT:The class imbalance drawback in machine learning occurs when sure classes are underrepresented relative to the others,
PROJECT TITLE :Network Traffic Classification Using Correlation Information - 2013ABSTRACT:Traffic classification has wide applications in network management, from security monitoring to quality of service measurements. Recent
PROJECT TITLE :The Generalization Ability of Online Algorithms for Dependent Data - 2013ABSTRACT:We study the generalization performance of online learning algorithms trained on samples coming from a dependent source of data.

Ready to Complete Your Academic MTech Project Work In Affordable Price ?

Project Enquiry