From d029d5d7f8ead1f1de8d318045004a4a6f68f5fb Mon Sep 17 00:00:00 2001 From: Bonface Date: Fri, 9 Feb 2024 09:41:28 -0600 Subject: Update dataset RTF Files. --- general/datasets/EYE_M2_1105_R/processing.rtf | 11 +++++++++++ 1 file changed, 11 insertions(+) create mode 100644 general/datasets/EYE_M2_1105_R/processing.rtf (limited to 'general/datasets/EYE_M2_1105_R/processing.rtf') diff --git a/general/datasets/EYE_M2_1105_R/processing.rtf b/general/datasets/EYE_M2_1105_R/processing.rtf new file mode 100644 index 0000000..ff25e0d --- /dev/null +++ b/general/datasets/EYE_M2_1105_R/processing.rtf @@ -0,0 +1,11 @@ +
Probe (cell) level data from the CEL file: These CEL values produced by GCOS are 75% quantiles from a set of 91 pixel values per cell. + +-- cgit v1.2.3+
+- Step 1: We added an offset of 1.0 unit to each cell signal to ensure that all values could be logged without generating negative values. We then computed the log base 2 of each cell.
+- Step 2: We performed a quantile normalization of the log base 2 values for the total set of arrays using the same initial steps used by the RMA transform.
+- Step 3: We computed the Z scores for each cell value.
+- Step 4: We multiplied all Z scores by 2.
+- Step 5: We added 8 to the value of all Z scores. The consequence of this simple set of transformations is to produce a set of Z scores that have a mean of 8, a variance of 4, and a standard deviation of 2. The advantage of this modified Z score is that a two-fold difference in expression level corresponds approximately to a 1 unit difference.
+- Step 7: Finally, when appropriate, we computed the arithmetic mean of the values for the set of microarrays for each strain. Technical replicates were averaged before computing the mean for independent biological samples.
+