A Machine Learning Framework for Domain Generation Algorithm (DGA)-Based Malware Detection


To alter Communication, attackers typically employ a command and control (C2) server. Threat actors frequently use a domain generation algorithm (DGA) to carry out an attack, which allows malware to communicate with C2 by producing a variety of network locations. Blacklisting and other traditional malware control approaches are ineffective against DGA attacks. To mitigate the threat, we present a Machine Learning system for recognizing and identifying DGA domains in this research. Over the course of a year, we collect real-time threat data from real-life traffic. A Deep Learning approach is also proposed to classify a large number of DGA domains. A two-level model and a prediction model are included in the proposed Machine Learning framework. We first characterize DGA domains as distinct from normal domains in the two-level model, and then utilize the clustering method to discover the algorithms that generate those DGA domains. Based on the hidden Markov model, a time-series model is built to forecast incoming domain features in the prediction model (HMM). We also built a deep neural network (DNN) model to improve the proposed Machine Learning framework by handling the massive dataset we accumulated over time. The correctness of the suggested framework and the DNN model is demonstrated by our comprehensive experimental data. To be more specific, the framework achieves a classification accuracy of 95.89 percent and a DNN model accuracy of 97.79 percent, second-level clustering accuracy of 92.45 percent, and HMM prediction accuracy of 95.21 percent.

Did you like this research project?

To get this research project Guidelines, Training and Code... Click Here

PROJECT TITLE : MAGNETIC: Multi-Agent Machine Learning-Based Approach for Energy Efficient Dynamic Consolidation in Data Centers ABSTRACT: Two of the most significant challenges for effective resource management in large-scale
PROJECT TITLE : Millimeter-Wave Mobile Sensing and Environment Mapping Models, Algorithms and Validation ABSTRACT: One relevant research paradigm, particularly at mm-wave and sub-THz bands, is to integrate efficient connectivity,
PROJECT TITLE : Proposing Causal Sequence of Death by Neural Machine Translation in Public Health Informatics ABSTRACT: Over 2.7 million people pass away every year in the United States alone, contributing to the annual global
PROJECT TITLE : MM-UrbanFAC Urban Functional Area Classification Model Based on Multimodal Machine Learning ABSTRACT: The majority of the classification methods that are currently used for urban functional areas are only based
PROJECT TITLE : Performance Improvement of a Parsimonious Learning Machine Using Metaheuristic Approaches ABSTRACT: When dealing with data stream mining, autonomous learning algorithms operate in an online fashion. This is desirable

Ready to Complete Your Academic MTech Project Work In Affordable Price ?

Project Enquiry