I. Introduction
Many data sets are continually streaming into today’s computing and networking systems from weblogs, financial transactions, health records, surveillance logs, business, telecommunications and bio-sciences. Furthermore, logging has become a widely accepted and significant habit [1]. It is the process of recording occurrences on a computer system. The data is saved in what is known as a log file. This subject has lately become a focus of studies and is referred to as "big data", a phrase that indicates the massive and spread nature of the data collections. According to Gartner [2], big data is defined as high-volume, high-velocity and high-variety data sets that necessitate cost-effective innovative data analytics for decision-making and inferring relevant insights.