Print Email Facebook Twitter A distributed and scalable real-time log analysis Title A distributed and scalable real-time log analysis Author Proost, Rick (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor Finavaro Aniche, M. (mentor) van Deursen, A. (graduation committee) Katsifodimos, A. (graduation committee) Degree granting institution Delft University of Technology Programme Computer Science Date 2020-10-20 Abstract Monitoring software behaviour is being done in various ways. Log messages are being output by almost any kind of running software system. Therefore, learning how software behaves from doing analysis over log data can lead to new insights about the system. However, the number of log messages in a computer system grow fast, and analysing the log data by hand is a time-consuming job. The objective of this study is to propose and implement a scalable architecture for doing real-time log analysis. Log data is structured so that analysis can take place, and the solution is horizontally scalable in every module so that the approach can scale with an ever-growing software solution. The focus of the study is on scalability, and ease-of-use of the implementation of the proposed approach. The proposed solution can scale horizontally and the test set up showed that reporting features for anomalies remained instantaneous when processing 1.2 million log lines per minute. The usability of the proposed approach is tested in a case study at Weave, where bugs were found by running the proposed solution in a controlled environment. Subject Scalable Log Data AnalysisDistributed SystemsReal-time Log Data AnalysisSoftware Monitoring To reference this document use: http://resolver.tudelft.nl/uuid:d01caad2-e537-4a1d-b0ca-c83db77cf1fe Part of collection Student theses Document type master thesis Rights © 2020 Rick Proost Files PDF thesis.pdf 1.47 MB Close viewer /islandora/object/uuid:d01caad2-e537-4a1d-b0ca-c83db77cf1fe/datastream/OBJ/view