From 5300d6034fad3a57e45795a485f70b0aeef1f665 Mon Sep 17 00:00:00 2001 From: jgart Date: Wed, 17 Nov 2021 01:13:38 -0500 Subject: add qc checks list --- topics/quality-control/qc-checks.gmi | 29 +++++++++++++++++++++++++++++ 1 file changed, 29 insertions(+) create mode 100644 topics/quality-control/qc-checks.gmi (limited to 'topics') diff --git a/topics/quality-control/qc-checks.gmi b/topics/quality-control/qc-checks.gmi new file mode 100644 index 0000000..55b4847 --- /dev/null +++ b/topics/quality-control/qc-checks.gmi @@ -0,0 +1,29 @@ +# Quality Control Checks + +1. Gene Symbols to ProbeSetId (Affymetrix format): + +AFFX-BkGr-GC03_st -> TCO500002136.mm.2 + +2. Inbred Strain names should prefer long form: + +B6 -> C57BL/6 +D2 -> DBA/2 + +3. Probeset IDs that don't have any values should be pruned: + +For example an Affymetrix data set might have ~28,000 entries and the data set that +is allowed into the GeneNetwork will be 22,000 entries. + +4. The standard error between male and female mice has to be computed. + +5. SE values have to be computed to 8 decimal places. + +6. The average between male and female mice has to be computed. + +7. AVG values have to be computed to only 3 decimal places. + +8. Datasets/studies having the same ProbeSetID should be grouped together. + +9. There should be no trailing spaces in data cells. + +10. Entries should have the same capitalization style. -- cgit v1.2.3