summaryrefslogtreecommitdiff
path: root/topics
diff options
context:
space:
mode:
authorjgart2021-11-17 01:13:38 -0500
committerjgart2021-11-17 01:13:38 -0500
commit5300d6034fad3a57e45795a485f70b0aeef1f665 (patch)
tree0e790e99e49ad98fee0e50eaee35b7baa52af48d /topics
parent50797e6b3edadf2ba2c93c635ca67fc589b7622b (diff)
downloadgn-gemtext-5300d6034fad3a57e45795a485f70b0aeef1f665.tar.gz
add qc checks list
Diffstat (limited to 'topics')
-rw-r--r--topics/quality-control/qc-checks.gmi29
1 files changed, 29 insertions, 0 deletions
diff --git a/topics/quality-control/qc-checks.gmi b/topics/quality-control/qc-checks.gmi
new file mode 100644
index 0000000..55b4847
--- /dev/null
+++ b/topics/quality-control/qc-checks.gmi
@@ -0,0 +1,29 @@
+# Quality Control Checks
+
+1. Gene Symbols to ProbeSetId (Affymetrix format):
+
+AFFX-BkGr-GC03_st -> TCO500002136.mm.2
+
+2. Inbred Strain names should prefer long form:
+
+B6 -> C57BL/6
+D2 -> DBA/2
+
+3. Probeset IDs that don't have any values should be pruned:
+
+For example an Affymetrix data set might have ~28,000 entries and the data set that
+is allowed into the GeneNetwork will be 22,000 entries.
+
+4. The standard error between male and female mice has to be computed.
+
+5. SE values have to be computed to 8 decimal places.
+
+6. The average between male and female mice has to be computed.
+
+7. AVG values have to be computed to only 3 decimal places.
+
+8. Datasets/studies having the same ProbeSetID should be grouped together.
+
+9. There should be no trailing spaces in data cells.
+
+10. Entries should have the same capitalization style.