I. Introduction
Human life is rapidly getting affected by the internet technology, but these enriched internet applications exposed numerous cybersecurity threats. In host malware detection, we use system calls generated by the malicious or benign executables. So, there is a tool named NITRSCT [1], which traces the sequential system calls and helps in the dataset creation. But, in the case of malicious websites, we need to monitor scripts. JavaScript language is uncomplicated and leads the three core technologies of the World Wide Web, along with HyperText Markup Language and Cascading Style Sheets. So, it is extensively accessible on the internet [2]. Attackers inject malicious JavaScript code into web-pages such as Trojan Viruses, obtaining user’s personal information and mining data [3], [4]. According to a report published by Tencent Anti-Virus Lab in 2018, VBS accounts for 50.65% of all non-portable executable virus and TOP2, a JavaScript virus accounts for 23.21%. Also, Microsoft security report indicates that count of JavaScript malware is largest in the first half of 2013 [5]. The JavaScript available on the webpage is adjustable and mercurial. Attackers encrypt or obfuscate the script to conceal its malicious behaviour from the malware detector. Some tools can deobfuscate the JavaScript code, but partially. The final steps for getting the real payload needs manual intervention, which is cost-intensive and complicated. The openness and span of the web applications spread the malicious code and fulfills the objective of the attackers. Thus, it is of utmost importance to detect the malicious JavaScript available on the web-pages. Traditional antivirus uses signature-based detection technique, which is imprecise since it is nonresistant to evolving and obfuscated JavaScript. Seeing the deluge of the malicious script and failure of malware detector in detecting them, our objective is to propose a JavaScript malware detection model based on defining feature selection from its analysis using malware-jail sandbox1. This model will improve the detection accuracy with least training as well as detection time. Our proposed model has the innovations listed below: