Xanthus: Push-button Orchestration of Host Provenance Data Collection

Xanthus Workflow

Abstract

Host-based anomaly detectors generate alarms by inspecting audit logs for suspicious behavior. Unfortunately, evaluating these anomaly detectors is hard. There are few high-quality, publicly-available audit logs, and there are no pre-existing frameworks that enable push-button creation of realistic system traces. To make trace generation easier, we created Xanthus, an automated tool that orchestrates virtual machines to generate realistic audit logs. Using Xanthus' simple management interface, administrators select a base VM image, configure a particular tracing framework to use within that VM, and define post-launch scripts that collect and save trace data. Once data collection is finished, Xanthus creates a self-describing archive, which contains the VM, its configuration parameters, and the collected trace data. We demonstrate that Xanthus hides many of the tedious (yet subtle) orchestration tasks that humans often get wrong; Xanthus avoids mistakes that lead to non-replicable experiments.

Publication
In Proceedings of the 3rd International Workshop on Practical Reproducible Evaluation of Computer Systems
Xueyuan Michael Han-Vanbastelaer
Xueyuan Michael Han-Vanbastelaer
Assistant Professor

My research interests include systems security and privacy, data provenance, and graph analysis.

Related