Process Mining in Celonis (MSc)
Process Mining in Celonis (MSc)
Language: the language of the course is English; therefore, all meetings and written reports will be in English.
Introduction
Process Mining is a growing branch of Data Science that focuses on analyzing event data recorded in Information Systems, focusing on the process perspective. Investments in Process Mining from public and private companies are steadily increasing, and are expected to more than double in the next five years. Hence a good knowledge of Process Mining is an important skill for Data Scientists.
Celonis as the market leader offers, with the Academic Alliance, the possibilities for students to use and learn the Celonis platform for real-life problems and the analysis of event data extracted from real-life systems. In particular, students accepted in this lab will propose extensions to the features of the Celonis platform, using different technologies and languages (Python, Pycelonis, Javascript, Angular).
Successful students will be prioritized for internships and thesis projects offered by the Celonis Lab in Aachen.
The course will use Python as the core language for the implementation. It is expected that the students will follow the Software engineering principles throughout the lab.
Introductory Sessions
All the above topics will be introduced in brief. Participation is mandatory throughout the course. In the introductory sessions, topics will be assigned to the students, and the deadlines for submitting the report and implementation will be discussed.
Groups will be formed to work on the assignments
Student work structure
Students will be required to understand and implement the assignment requirements in Python and provide proper visualizations. A proper SDLC lifecycle will be followed during this phase to track the development. The details of the methodology will be communicated in the introductory session.
A written report on the implementation, its advantages, and issues should be produced individually by the students.
Grading
The grading will take into account the written report and the Python code implemented. Moreover, a final oral examination is required.
Location: the course will be primarily held on ZOOM with the possibility to contact the reference people in PADS (RWTH) and the Celonis Lab in Aachen.
Assumptions: some assignments will require the students to sign Non-Disclosure Agreements (NDA) to access the code/data of the project. Alternative assignments will be given to students refusing to sign the NDA.
Prerequisites
- Software Engineering knowledge(Design, development, and testing)
- Prior programming experience. Not necessarily Java or Python.
- Interest to learn and code in Python
Optionals
Resources:
- PM4Py: installation tutorial
- PM4Py: documentation
- Python Tutorial The Python Foundation
- Interactive Tutorial covering the basics of Python
- Introduction to Git
- Introduction to Sprint Planning / SCRUM
- Introduction to Unit Testing
- (Advanced) Python design patterns
- (Advanced) Deploying Flask application on Dockers
- Celonis PQL: a Query Language for Process Mining
- Access to the Celonis Academy for free courses (how to upload data, PQL, ...)
Registration
The registration is carried out by the central registration process in January 2023
In order to increase your chance of being elected for this lab, please state your qualifications, experiences, and overall grades in your enrolled study as detailed as possible. Please give clearly why you are a suitable candidate for this lab.
You will be informed about the first meeting in the weeks after the registration closes.
Kontakt
Eduardo Goulart Rocha