Welcome! This primer provides a concise introduction to conducting applied analyses of microbiome data in R. While this primer does not require extensive knowledge of programming in R, the user is expected to install R and all packages required for this primer.
Please install the required software and download the example data before coming to the workshop.
This primer provides a concise introduction to conducting the statistical analyses and visualize microbiome data in R based on metabarcoding and high throughput sequencing (HTS). This primer does not cover “shotgun” metagenomic analysis, which is very different in nature. The reader is expected to have a very basic understanding of ecological diversity theory and some experience with R. The techniques presented here assume the raw sequences have been converted to amplicon sequence variants (ASVs) or operational Taxonomic Units (OTUs) and classified (i.e., assigned a taxonomy) using tools such as QIIME, mothur, or dada2 (Schloss et al. 2009; Caporaso et al. 2010; Callahan et al. 2016).
R is an open source (free) statistical programming and graphing language that includes tools for analysis of statistical, ecological diversity and community data, among many other things. R provides a cohesive environment to analyze data using modular “toolboxes” called R packages. R runs on all major operating systems including Microsoft Windows, Linux (e.g., Ubuntu), and Apple’s OS X. The general type of analyses done in this workshop could be done in python, Perl, or using command line tools. We like R for the following reasons:
This workshop will not start with the raw reads, since the first steps in a metabarcoding workflow are typically done using command line tools such as QIIME or mothur (dada2 is an exception) in the cloud. Data that can be analysed using techniques presented here is typically the result of the following steps (Comeau, Douglas, and Langille 2017):
Here we focus on the statistical analysis and visualizations following OTU calling that include:
We hope you enjoy this primer. Please provide us feedback on any errors you might find or suggestions for improvement.
Please cite this primer if you find it useful for your research as: ZSL Foster and NJ Grünwald. 2018. Analysis of Microbiome Community Data in R. DOI: XXX.
Niklaus J. Grünwald orcid.org/0000-0003-1656-7602 and Zach S. L. Foster orcid.org/0000-0002-5075-0948
© 2018, Corvallis, Oregon, USA