The purpose of this repository is to provide public access to the R code and data files used in the following publication:
Baker JM, Hinkle KJ, McDonald RA, Brown CA, Falkowski NR, Huffnagle GB, Dickson RP. Whole lung tissue is the preferred sampling method for amplicon-based characterization of murine lung microbiota. Microbiome 9, 99 (2021). https://doi.org/10.1186/s40168-021-01055-4
- Specimen collection and processing
- Goal: Selection of appropriate numbers and types of negative and positive controls - sample all potential source communities
- Method: Thoughtful experimental design and sterile specimen collection
- Quantification
- Goal: Confirm sufficient quantities of DNA (bacterial and total) have beeen isolated from low-biomass samples
- Method: droplet digital PCR or qPCR for bacterial DNA quantification, Qubit or Nanodrop for total DNA quantification
- Sequencing, Identification, and Quantification of Bacterial Reads
- Goal: Amplify, sequence, identify, and quantify bacterial reads in low-biomass specimens
- Method: PCR amplification, 16S rRNA gene amplicon sequencing, mothur, calculation of read counts per specimen
- Diversity of Identified Bacterial Taxa
- Goal: Determine bacterial signal is distinct from controls and reproducible among biological replicates
- Method: Calculation of alpha (e.g. rarified richness, Shannon diversity index) and beta (e.g. Bray-Curtis dissimilarity index) diversity metrics
- Relative Abundance of Identified Bacterial Taxa
- Goal: Determine the biological plausibility of bacterial taxa identified in low-biomass samples
- Method: Principal component analysis and ranked relative abundance of bacterial taxa
- Rmarkdown with code (defaults to final dataset version)
- Taxonomy file (contains taxonomy assignments for each OTU from mothur)
- Shared file (contains OTU counts from mothur)
- 16S rRNA gene ddPCR quantification data
- Read count data from MiSeq run
- Knitted reports (raw html and pdf) for untrimmed, trimmed, and final datasets for comparison
- PDF versions of each figure generated for the final publication, & additional figures that did not make the final publication
- License information
- Hidden files (.RData and .Rhistory from local sessions)
- .gitignore file (ignores cache and other hidden files when pushing directory contents to GitHub)
- Link to publication: https://microbiomejournal.biomedcentral.com/articles/10.1186/s40168-021-01055-4
- Link to preprint: https://www.biorxiv.org/content/10.1101/2020.09.08.283259v1
- Link to SRA submission: https://www.ncbi.nlm.nih.gov/sra/PRJNA644805
- Links to Rmarkdown htmls:
- Trimmed & Quality-Checked Dataset (final version):
- Untrimmed dataset:
- Trimmed dataset:
Packages and dependencies required to run code in the repository can be found in session info in the knitted Rmarkdown pdf and html.