Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...


1. Trimming, adapter-removal, and size filtering (BBDuk, http://jgi.doe.gov/data-and-tools/bbtools/bb-tools-user-guide/bbduk-guide/ )
1.1. Use bbduk.sh to remove the adapter sequence from the 3' end of the reads and trim homopolymers.
1.2. Split up the reads into short read and long read fractions.
1.2.13. Short reads read into the adapter sequence AND are shorter than 31 bp and are processed for miRNA alignment.
1.24. 2. Long reads are kept separate for alignment to the reference genome.


2. Mapping* (BWA, http://bio-bwa.sourceforge.net/, BBMap, https://sourceforge.net/projects/bbmap/)
2.1. Short read fraction
2.1.12. Filter out reads <= 14 bp -> outputs: "filtered reads: too short" statistic
2.1.23. Map the remaining 15 – 31 bp reads to the ribosomal RNA sequences using BWA. -> outputs: "mapped reads" and "unmapped reads", mapping statistics
2.14. 3. Use the unmapped reads from the BWA alignment and map these against the mirBase annotation (http://www.mirbase.org; mature miRNAs, species-specific) -> output "mapped reads", read count statistics, and miRNA-count table.
2.25. Long read fraction
2.2.16. Map long reads against the reference genome using STAR (https://github.com/alexdobin/STAR ) -> output: "mapped reads", read count statistics, gene_biotype statistics (including rRNA sequences), and gene-read table.

Alternative programs for read mapping can also be used. A comparison of alignment programs for small RNA-Seq data can be found in Tam et al., 2015.