aboutsummaryrefslogtreecommitdiff
path: root/doc
diff options
context:
space:
mode:
authorPjotr Prins2018-06-29 12:51:41 +0000
committerPjotr Prins2018-06-29 12:51:41 +0000
commit9f9887eb5befe025e85641f1930e46e5e6a25e56 (patch)
treeaf87047d49fb39b7a7a5c7c1d5dc2b98400c5595 /doc
parent7f61ca0761e93918f64119c50b32e2812eba8291 (diff)
parent159f95233afd36c98335059e35cd6c51e4760d24 (diff)
downloadpangemma-9f9887eb5befe025e85641f1930e46e5e6a25e56.tar.gz
Merge branch 'master' of github.com:genetics-statistics/GEMMA
Diffstat (limited to 'doc')
-rw-r--r--doc/compile_GEMMA_win64.txt48
-rw-r--r--doc/manual.pdfbin269308 -> 319480 bytes
-rw-r--r--doc/manual.tex4
3 files changed, 52 insertions, 0 deletions
diff --git a/doc/compile_GEMMA_win64.txt b/doc/compile_GEMMA_win64.txt
new file mode 100644
index 0000000..84f16ec
--- /dev/null
+++ b/doc/compile_GEMMA_win64.txt
@@ -0,0 +1,48 @@
+// install R 3.4.3
+https://cran.r-project.org/bin/windows/base/
+
+// install Rtools 3.4
+https://cran.r-project.org/bin/windows/Rtools/
+
+// Download openblas (v0.2.19-Win64-int32)
+https://sourceforge.net/projects/openblas/files/v0.2.19/
+
+// Make a place to store the files
+mkdir Github
+cd Github/
+
+// Clone the required dependancies
+git clone https://github.com/eigenteam/eigen-git-mirror.git
+git clone https://github.com/genetics-statistics/GEMMA.git
+
+// Download and install gsl2.4, unzip/tar into Github
+http://gnu.askapache.com/gsl/
+
+// Download and install msys from http://downloads.sourceforge.net/mingw/MSYS-1.0.11.exe
+cd c:/msys/1.0
+// Run mysys
+msys.bat
+
+// Under msys, Compile GSL-2.4 inside the msys
+cd /c/
+cd Github/gsl-2.4
+./configure --prefix=C:/MinGW
+make -j 4
+make install
+
+// Building Gemma on the R tool chain under windows using CMD compile gemma
+cd gemma
+make -j 2
+
+// Get all the DLLs from:
+- MinGW DLLs: https://sourceforge.net/projects/openblas/files/v0.2.12/mingw64_dll.zip/download
+- DLLs from the compiled gsl-2.4
+- DLLs from openBLAS
+
+// Required DLLs:
++ libgcc_s_seh-1.dll
++ libgfortran-3.dll
++ libgsl-23.dll
++ libgslcblas-0.dll
++ libopenblas.dll
++ libquadmath-0.dll
diff --git a/doc/manual.pdf b/doc/manual.pdf
index b760cc1..1b7dc5d 100644
--- a/doc/manual.pdf
+++ b/doc/manual.pdf
Binary files differ
diff --git a/doc/manual.tex b/doc/manual.tex
index 1e042e7..8e5efe2 100644
--- a/doc/manual.tex
+++ b/doc/manual.tex
@@ -1373,6 +1373,10 @@ format. In addition, to fit MQS-LDW, you will need to add "-wcat
specifies the LD score file, which can be provided in a gzip
compressed format.
+A feature of MQS based variance component estimation is that one only need to use a subset of samples to estimate certain quantities. Using a subset of samples dramatically improves computation speed while maintaining variance component estimation accuracy. To take this strategy, one can use ``-sample [num]" to use a fixed number of random samples to perform estimation.
+
+Instead of using the genotype data from the study, one can also use genotype data from a reference panel. For example, one can use the genotype data from the 1000 genomes project as the reference. However, any population stratification in the reference panel should be dealt with first. For example, the individuals with European ancestry in the 1000 genomes project come from five subpopulations: CEU, FIN, GBR, IBS, and TSI. MQS computes SNP correlations across all SNP pairs as it should be under the LMM assumption. Therefore, any population stratification in the reference panel would increase the overall SNP correlation estimate, leading to down-ward bias in the final heritability estimate. To address the population stratification in the reference panel, one can include a few dummy variables in the model fitting step as covariates. These covariates represent, for example, the five subpopulations, and are used to effectively center the genotype mean in each subpopulation separately. To do this, one can create a covariate file containing five columns (no header): the first column is all 1 representing the intercept; the second column is 1 for CEU and 0 for others; the third column is 1 for FIN and 0 for others; ...; while the fifth column is 1 for IBS and 0 for others. Afterwards, one can add "-c [filename]" to include this covariate file in the command line.
+
\subsubsection{Detailed Information}
MQS-LDW uses an iterative procedure to update the variance