Evaluation of key-value databases for the storage of large process mining event logs

Contact

Name

Alessandro Berti

Software Engineer

Phone

work
+49 241 80 21912

Email

E-Mail
 

BSc Thesis Project

Title: Evaluation of key-value databases for the storage of large process mining event logs

Author: Niklas Schulte

Supervisor: Alessandro Berti

1st examiner: Prof.Dr. ir. Wil M.P. van der Aalst

2nd examiner: Prof. Dr.-Ing Ulrik Schroeder

Summary

Key-value databases are a fast and flexible storage solution. As the amount of process mining data increases constantly and new fields of process mining arise, appropriate storage solutions are necessary. Therefore, we investigated in key-value databases as a storage solution for large process mining event logs in this thesis. We have elaborated a method of storing event data in a key-value fashion and implemented two event stores in Python on top of these. An evaluation is performed with three real-life event logs on different aspects such as importing and exporting data as well as the ability to provide data samples in various orders of access.