From b2feda451ccfbeaed02dce9088d6dd228cf15861 Mon Sep 17 00:00:00 2001 From: Bonface Date: Tue, 13 Feb 2024 23:52:26 -0600 Subject: Update dataset RTF Files. --- general/datasets/Pgenratbrn_rna_0520/processing.rtf | 3 +++ 1 file changed, 3 insertions(+) create mode 100644 general/datasets/Pgenratbrn_rna_0520/processing.rtf (limited to 'general/datasets/Pgenratbrn_rna_0520/processing.rtf') diff --git a/general/datasets/Pgenratbrn_rna_0520/processing.rtf b/general/datasets/Pgenratbrn_rna_0520/processing.rtf new file mode 100644 index 0000000..4c9da95 --- /dev/null +++ b/general/datasets/Pgenratbrn_rna_0520/processing.rtf @@ -0,0 +1,3 @@ +

Prior to alignment, reads were demultiplexed and read fragments were trimmed for adaptors and for quality using Cutadapt (version 1.9.1;(Martin, 2011)). Reads were eliminated if the trimmed length of either read fragment was <20 nucleotides. Next, bowtie2(v 2.2.6) (4) was used to align reads to ribosomal RNA, and the unmapped reads were retained. These will be referred to as “cleaned” reads. The RNA-Seq Expectation Maximization (RSEM v1.2.31) algorithm (Li and Dewey 2011) was used to estimate the number of aligned reads for individual Ensembl transcripts (version 96). The number of reads aligned to each Ensembl gene was calculated as the sum of the number of reads aligned to individual Ensembl transcripts annotated to that gene.

+ +

Prior to normalization and transformation, a detection above background filter was applied where we required a gene to have at least 2/3 of the samples have 1 count or more to be considered expressed above background. All genes that did not pass this detection above background criteria were removed. Count values were normalized using RUVg (PMID: 25150836) and transformed using the regularized log function (PMID: 25516281).

-- cgit v1.2.3