SFB1036 Course

Data Analysis for Biologists

Simon Anders

SFB 1036

Course: Best Practices of Data Analysis for Biologists

These pages contain material for the data analysis course taught by me in fall 2018 at ZMBH

Lecture 1: Exploring the NHANES data

This lecture explains how to use R/tidverse and ggplot to explore a data set.

HTML fileRmd file

(The HTML file is for reading in your web browser; the Rmd file allows you to open the lecture as notbook in RStudio and rerun all computations.)

I have also recorded the lecture, and uploaded it here. Not too great quality, though, but maybe useful if you missed the lectrue or a part of it.

Lecture 2: Inference the pedastrian way

In this lecture, we carefully approach the topic of inference, and study the standard error of the mean

HTML fileRmd file

Lecture 3: Test statistics, p values and all that

This lecture discussed Fisher test and binomial test to illustrate the idea behind p values

HTML fileRmd file

For this lecture, I also have a recording

Lecture 4: t tests, Bayes theorem

This lecture discussed the t test and Bayes theorem

HTML fileRmd file

Lecture recording

Lecture 5: Analysing plate-reader data

This lecture shows an end-to-end analysis of a drug perturbation experiment on primary lymphoma cells.

HTML fileRmd file

Lecture recording

Lecture 6: RNA-Seq data analysis: Preprocessing

This lecture shows how to get from RNA sequencing data to a count table

HTML fileRmd file

Lecture 6 recording

Lecture 7/8: RNA-Seq data analysis: Downstream analysis

HTML fileRmd file

Lecture 7 recording and Lecture 8 recording

Lecture 9: RNA-Seq data analysis: Gene set enrichment etc

HTML fileRmd file

recording

Lecture 10: One-way ANOVA

HTML fileRmd file

recording