R is a high-level language and an environment for data analysis and data visualization. While R can be used as a calculator and all important basic statistical methods are included as well, the main benefit is its open-source philosophy which makes R highly extensible and renders possible the availability of new, cutting edge applications in many different fields. The popularity of R increased constantly during the last years and by now, it is arguably the most popular software for data analysis in the statistical community. The course starts with an introduction to R, covers some elementary statistical techniques and then continues with more advanced methods. In particular, the course will focus on the following:
- Linear Models (Standard Linear Regression and Analysis of Variance)
- Generalized Linear Models (Binary, Multinomial and Proportional-Odds Logistic Regression, Poisson and Negative-Binomial Regression)
- Decision Trees
- Naive Bayes Classifiers
- k-NN Classifiers
- Cluster Analysis (hierarchical and non-hierarchical)
- Correspondence Analysis
Additional methods may be covered as well, depending on student's interest, e.g.,
- Principal Components Analysis
- Social Network Analysis
- Mixed-Effects Models
- ...