Bérczes, TamásGaskel, Gregory2023-04-282023-04-282023-04-28https://hdl.handle.net/2437/351195In this thesis, we apply classification models to anomaly-based intrusion detection using system call traces. Several feature extraction techniques commonly used for vectorizing system call traces are examined, including the Boolean Model, Simple Vector Space Model, Traditional N-Gram Vector Space Model, and the N-Gram TF-IDF Model. We then propose a novel variable-length feature extraction framework based on the N-Gram TF-IDF Model, whereby n-gram terms of various length are included in the feature set. We then evaluate the performance of each feature extraction approach with the Australian Defense Force Academy Linux Dataset (ADFA-LD) using three classification models: linear discriminant analysis, random forest classification, and logistic regression. By computing performance metrics, including accuracy, precision, recall, F-measure, false positive rate, and area under the curve (AUC), we obtain insight into the trade-off between model complexity and performance.38encybersecuritysystem call tracesintrusion detectionclassification modelsAnomaly-Based Intrusion Detection Using Classification Models: An Analysis of System Call TracesDEENK Témalista::MatematikaHozzáférhető a 2022 decemberi felsőoktatási törvénymódosítás értelmében.