0664 Advanced Data Analysis with R
Dr. Marcus Wurzer
Contact details
Weekly hours
Language of instruction
09/21/21 to 09/30/21
Registration via LPIS
Notes to the course
Day Date Time Room
Monday 10/04/21 01:15 PM - 02:15 PM D2.0.025 Workstation-Raum
Monday 10/11/21 01:45 PM - 03:45 PM D2.0.025 Workstation-Raum
Monday 10/18/21 01:45 PM - 03:45 PM D2.0.025 Workstation-Raum
Monday 11/08/21 01:45 PM - 03:45 PM D2.0.025 Workstation-Raum
Monday 11/15/21 01:15 PM - 03:15 PM D2.0.025 Workstation-Raum
Monday 11/22/21 01:15 PM - 03:45 PM Online-Einheit
Monday 11/29/21 01:15 PM - 03:45 PM Online-Einheit
Monday 12/06/21 01:15 PM - 03:45 PM Online-Einheit
Monday 12/13/21 01:15 PM - 03:15 PM Online-Einheit
Monday 12/20/21 01:15 PM - 03:15 PM Online-Einheit
Monday 01/10/22 01:15 PM - 03:15 PM D2.0.025 Workstation-Raum
Monday 01/17/22 01:15 PM - 03:15 PM D2.0.025 Workstation-Raum
Wednesday 01/19/22 11:45 AM - 01:45 PM D2.0.025 Workstation-Raum
Monday 01/24/22 12:45 PM - 02:45 PM D2.0.025 Workstation-Raum


R is a high-level language and an environment for data analysis and data visualization. While R can be used as a calculator and all important basic statistical methods are included as well, the main benefit is its open-source philosophy which makes R highly extensible and renders possible the availability of new, cutting edge applications in many different fields. The popularity of R increased constantly during the last years and by now, it is arguably the most popular software for data analysis in the statistical community. The course starts with an introduction to R, covers some elementary statistical techniques and then continues with more advanced methods. In particular, the course will focus on the following:

  • Linear Models (Standard Linear Regression and Analysis of Variance)
  • Generalized Linear Models (Binary, Multinomial and Proportional-Odds Logistic Regression, Poisson and Negative-Binomial Regression)
  • Decision Trees
  • Naive Bayes Classifiers
  • k-NN Classifiers
  • Cluster Analysis (hierarchical and non-hierarchical)
  • Correspondence Analysis

Additional methods may be covered as well, depending on student's interest, e.g.,

  • Principal Components Analysis
  • Social Network Analysis
  • Mixed-Effects Models
  • ...

Learning outcomes

On completion of the course students are able to:

  • manipulate and visualize data in R
  • understand the theory and functionality of the methods employed in the course
  • apply the adequate statistical methods to a given problem and perform the statistical calculations using R
  • interpret the results of such analyses
  • communicate and discuss the results of the statistical analysis of data

Attendance requirements

  • Attendence is compulsory. Students have to be attend classes for at least 80% of the total time, i.e., 18 of 22.5 hours. If you miss a class, please inform me in advance!
  • Notice of special regulation for Covid-19: If a student is required to quarantine, or is otherwise prevented from attending class, due to a certified case of Covid-19 infection or a federally mandated Covid-19 lockdown, and this affects either attendance or the completion of an exam or other required course assignment, the course instructor is empowered to provide an alternative means for said student to meet the attendance/assessment requirement as necessary. The same means will be required of any student in the same situation in the same course.

Teaching/learning method(s)

Lectures, Practicals


  • development of a project concept (10 %)
  • written report on the analysis of a dataset using advanced statistical methods (50 %)
  • oral presentation of analysis results (40 %)

Availability of lecturer(s)
Last edited: 2021-09-29