summaryrefslogtreecommitdiff
path: root/issues/gemma
diff options
context:
space:
mode:
authorPjotr Prins2022-03-12 14:35:28 +0100
committerPjotr Prins2022-03-12 14:35:28 +0100
commit403bb5b1e546944ad561e27562337aef154b5836 (patch)
treefcd9a015601143d9e8ddf061663d2bbe547e130f /issues/gemma
parent820ebba098569509dc62ea9a5394eb5e3e14fbdd (diff)
downloadgn-gemtext-403bb5b1e546944ad561e27562337aef154b5836.tar.gz
issue: gemma: CHR1 uses 10% of RAM
Diffstat (limited to 'issues/gemma')
-rw-r--r--issues/gemma/HS-Rat-crashes-gemma.gmi30
1 files changed, 29 insertions, 1 deletions
diff --git a/issues/gemma/HS-Rat-crashes-gemma.gmi b/issues/gemma/HS-Rat-crashes-gemma.gmi
index be0af66..6d8f5c8 100644
--- a/issues/gemma/HS-Rat-crashes-gemma.gmi
+++ b/issues/gemma/HS-Rat-crashes-gemma.gmi
@@ -61,10 +61,38 @@ The geno file is massive:
3.4M Mar 12 11:56 HSNIH-Palmer_true_snps.txt
```
-Probably best to test on a different machine! Let's move to tux02. Running luna (a half year old version of GN2) gives `**** FAILED: number of columns in the kinship file does not match the number of individuals for row = 79`. So, that does not help! I think this is a known issue that got fixed later. Next up, try and run gemma by hand for chromosome 1 after installing gemma tools with a recent GNU Guix:
+Probably best to test on a different machine! Let's move to tux02. Running luna (a half year old version of GN2) gives `**** FAILED: number of columns in the kinship file does not match the number of individuals for row = 79`. So, that does not help! I think this is a known issue that got fixed later. Next up, try and run gemma by hand for (largest) chromosome 1 after installing gemma tools with a recent GNU Guix:
```
+tux02:~/tmp/gemma-cpu-lockup$ /usr/bin/time -v gemma -loco 1 -gk -g HSNIH-Palmer_true_geno.txt -p PHENO_2+FcfQiTVSC7FmmbsatUPg.txt -a HSNIH-Palmer_true_snps.txt
+
+GEMMA 0.98.5-pre1 (2021-08-14) by Xiang Zhou, Pjotr Prins and team (C) 2012-2021
+Reading Files ...
+## number of total individuals = 6147
+## number of analyzed individuals = 1306
+## number of covariates = 1
+## number of phenotypes = 1
+## leave one chromosome out (LOCO) = 1
+## number of total SNPs/var = 134918
+## number of SNPS for K = 121094
+## number of SNPS for GWAS = 13824
+## number of analyzed SNPs = 132628
+Calculating Relatedness Matrix ...
+**** INFO: Done.
+ Command being timed: "gemma -loco 1 -gk -g HSNIH-Palmer_true_geno.txt -p PHENO_2+FcfQiTVSC7FmmbsatUPg.txt -a HSNIH-Palmer_true_snps.txt"
+ User time (seconds): 3096.83
+ System time (seconds): 1435.51
+ Percent of CPU this job got: 1781%
+ Elapsed (wall clock) time (h:mm:ss or m:ss): 4:14.36
+ Maximum resident set size (kbytes): 26851444
+```
+
+That is 4 minutes wall clock time using about 26GB of RAM. The machine has 256 GB RAM, so you can see where the problem is. Gemma-wrapper should handle this gracefully, but because of overcommit the machine goes in a tail-spin.
+Next run lmm with
+
+```
+/usr/bin/time -v gemma -loco 1 -lmm 9 -g HSNIH-Palmer_true_geno.txt -p PHENO_2+FcfQiTVSC7FmmbsatUPg.txt -a HSNIH-Palmer_true_snps.txt -k output/result.cXX.txt
```
Now the goal is to try and crash the server before setting overcommit.