I. Introduction
To be able to detect effectively malware programs through Supervised Machine Learning approaches, we needed some really big labeled datasets to analyze. We started from the DREBIN dataset used in the DREBIN paper [1], a 123K labeled Android application dataset, containing 5,560 malwares, resulting in a rate of 95.4% benign programs.