5148 Data Processing 2: Scalable Data Processing, Legal & Ethical Foundations of Data Science
Dr. Sabrina Kirrane, Astrid Krickl, MSc.
Contact details
  • Type
  • Weekly hours
  • Language of instruction
01/31/22 to 05/03/22
Registration via LPIS
Notes to the course
Subject(s) Bachelor Programs
Day Date Time Room
Tuesday 05/10/22 09:00 AM - 12:00 PM D5.0.001
Tuesday 05/17/22 09:00 AM - 12:30 PM TC.-1.61
Tuesday 05/24/22 09:00 AM - 12:30 PM TC.-1.61
Tuesday 06/07/22 09:00 AM - 12:30 PM D4.0.022
Tuesday 06/14/22 09:00 AM - 12:30 PM LC.-1.038
Tuesday 06/21/22 09:00 AM - 12:30 PM TC.-1.61
Tuesday 06/28/22 09:00 AM - 12:30 PM TC.2.01


This fast-paced class is intended for students interested in scalable handling of big data, understanding legal fundamentals and ethical frameworks in dealing with data in an international context. The course focuses on gaining fundamental knowledge in dealing with large amounts of data and learning about efficient and scalable processing methods. Throughout the course there will be an emphasis on important aspects regarding legal and ethical principals related to data processing and data science.

Learning outcomes

Students in the course will learn about the scalable handling of big data, understanding legal fundamentals and ethical frameworks in dealing with data in an international context.

This includes:

  • Basic knowledge about different scalable data processing frameworks and paradigms, including:
    • The Hadoop ecosystem
    • Batch processing with Apache Spark
    • Stream processing with Apache Kafka
  • Ethical frameworks
  • Intellectual Property Rights / handling of different licensing schemes
  • Legal protection of personal data and typical privacy issues
  • Algorithmic bias

Attendance requirements

According to the examination regulation full attendance is intended for a PI. Absence in one unit is tolerated if a proper reason is given.

If a student cannot attend a particular class, the student should send an email to the course instructor before the class starts, providing a legitimate justification for their absence.

All students are expected to attend the 1:1 project feedback sessions with the instructors.

Teaching/learning method(s)

The course will focus on in-class code walkthroughs of high-quality, well-commented code that students can later reference.
The course puts a particular emphasis on in-class discussion and project work.


Homework: 15% 

Project Proposal: 15%

Project: 70% (the project will mainly consist of adaptations and discussion of the practical examples presented in class)

Prerequisites for participation and waiting lists

Successful conclusion of the course 1 of SBWL Data Science.

Please be aware that for all courses in this SBWL registration is only possibly for students who successfully have completed the entry course (Einstieg in die SBWL: Data Science).

Note that for courses within the SBWL "Data Science" we can only accept students enrolled in one of WU's bachelor programmes who qualify for starting an SBWL; particularly, we cannot accept students from other courses and programmes enrolled at WU as 'Mitbeleger' only.

Availability of lecturer(s)

During the lecture and based on individual appointments. To request an appointment send an email to the lecturers with the subject “[Data Processing 2]”.

Last edited: 2022-03-28