Towards Classification of Malware on the Basis Their Characteristics and Importance Mining of Features

dc.contributor.advisorDubey, Shubham
dc.contributor.authorRaza, Muhammad Hassan
dc.contributor.authorDubey, Shubhankar
dc.contributor.departmentDE--Informatikai Karhu_HU
dc.date.accessioned2021-04-30T08:53:14Z
dc.date.available2021-04-30T08:53:14Z
dc.date.created2020-04-22
dc.description.abstractThere are several websites, applications and resources that a user visits every day. Some of the resources have malicious threats and harmful entities. It becomes to be careful and identify such resources in advance to save our system maintain our privacy. This malicious software is called Malware, which can be of any type as Virus, Trojan horse, Worms, spam, adware etc. This study is developing four classification models to identify such threats. The algorithms used are Support vector machine, Decision tree algorithm, KNN classification algorithms and Naïve Bayesian classification. Derived models are tested for their accuracy using precision, recall, F-1 score and ROC curves. The models are trained and tested with the recorded data in virtual machine of LINUX. The data consisting 100000 dataset of 35 attributes. The ratio of Malware and Benign is 1:1. Study found decision tree algorithm and KNN classification are the first and two most accurate models of classification respectively. During the preprocessing the attributes were removed up to 17. Study also finds that the static priority, system time, free cache area and reserved area in virtual machine are the factors significantly affecting the classification. Static priority is the main factor which is having the most significant importance and importance values is 0.52. The study will be helpful for security experts and wide area users of internet to identify whether a resource contains any malicious threats or not.hu_HU
dc.description.courseComputer Sciencehu_HU
dc.description.degreeBSc/BAhu_HU
dc.format.extent20hu_HU
dc.identifier.urihttp://hdl.handle.net/2437/307946
dc.language.isoenhu_HU
dc.subjectMalwarehu_HU
dc.subjectdecision treehu_HU
dc.subjectKNN classificationhu_HU
dc.subjectSVM modelhu_HU
dc.subjectNaïve Bayesian modelhu_HU
dc.subjectstatic priorityhu_HU
dc.subjectclassificationhu_HU
dc.subjectmachine learninghu_HU
dc.subjectanalysishu_HU
dc.subject.dspaceDEENK Témalista::Informatikahu_HU
dc.titleTowards Classification of Malware on the Basis Their Characteristics and Importance Mining of Featureshu_HU
Fájlok
Gyűjtemények