Malware detection using n-gram with TF-IDF weighting

In this era of technology, computers and networks are exposed to malwares. Malwares are also known as malicious software. Malwares are created to disrupt, destroy or to gain authorization in access in a computer system. There are different types of software and methods that have been implemented tha...

Full description

Bibliographic Details
Main Author: Natasha, Zainal
Format: Undergraduates Project Papers
Language:English
Published: 2018
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/26839/
http://umpir.ump.edu.my/id/eprint/26839/
http://umpir.ump.edu.my/id/eprint/26839/1/Malware%20detection%20using%20n-gram%20with%20TF-IDF.pdf
Description
Summary:In this era of technology, computers and networks are exposed to malwares. Malwares are also known as malicious software. Malwares are created to disrupt, destroy or to gain authorization in access in a computer system. There are different types of software and methods that have been implemented that are used to detect different types of malware. Powerful malware that was implemented may not get easily detected. Different kinds of anti-virus and methods were used, nevertheless the problem is that this may not fully detect the malware as malwares now a days are hard to detect. The objectives of this research is to identify the attributes of malware, to develop a conceptual model of malware detection using n-gram and TF-IDF and to evaluate the model of malware detection. The scope for this research are dataset, method and evaluation testing and measurements. The methodology are literature review based on previous research, identifying the attributes of malware, developing the conceptual model and lastly, evaluating the conceptual model. The model is implemented by using Python programming language. By using this method, the expected result of this system is based on the n-gram and TF-IDF, thus malware could be detected.