summaryrefslogtreecommitdiff
path: root/topics
diff options
context:
space:
mode:
authorFrederick Muriuki Muriithi2024-09-23 10:07:57 -0500
committerFrederick Muriuki Muriithi2024-09-23 10:08:28 -0500
commit022f1c084121a5a4938f016733201fade7a3afd2 (patch)
treed73b54d270b008c30090cab4d8d0a6e552bde4c4 /topics
parent77a2b9a0f342f82acb18812a38d4a37f1b48359b (diff)
downloadgn-gemtext-022f1c084121a5a4938f016733201fade7a3afd2.tar.gz
Documentation: Genotypes, Assemblies, Markers and GeneNetwork.
Diffstat (limited to 'topics')
-rw-r--r--topics/gn-uploader/genotypes-assemblies-markers-and-genenetwork.gmi40
1 files changed, 40 insertions, 0 deletions
diff --git a/topics/gn-uploader/genotypes-assemblies-markers-and-genenetwork.gmi b/topics/gn-uploader/genotypes-assemblies-markers-and-genenetwork.gmi
new file mode 100644
index 0000000..db0ddf3
--- /dev/null
+++ b/topics/gn-uploader/genotypes-assemblies-markers-and-genenetwork.gmi
@@ -0,0 +1,40 @@
+# Genotypes, Assemblies, Markers and GeneNetwork
+
+## Tags
+
+* type: documentation, docs, doc
+* keywords: genotype, assembly, markers, data, database, genenetwork, uploader
+
+## Markers
+
+```
+The marker is the SNP…
+
+— Rob (Paraphrased)
+```
+
+SNPs (Single Nucleotide Polymorphisms) are specific locations of interest within the genome, where the pair of nucleotides can take different forms.
+
+A SNP and its immediate neighbourhood (a number of megabase pairs before and after the SNP) form a sequence that is effectively the marker, e.g. for mouse (Mus musculus) you could have the following sequence from the GRCm38 genome assembly (mm10):
+
+```
+GAGATAAAGATGGGTCCCTTGGCACAGGACTGGCCCACATTTCCaatataaattacaacaattttttttaaatttttaaaCAAAACAAGCATCTCACACAC/TTGAAAAAGAAGATGCATTCAAAGAAAATAGATGTTTCAATGTATTTAAGATAATCAAGAGATAACCATGACCATATCATGAGGAAACTTAAGAATTGGCA
+```
+
+where the position with `C/T` represents the SNP of interest and thus the marker.
+
+You can search this on the UCSC Genome Browser, specifically the
+=> https://genome.ucsc.edu/cgi-bin/hgBlat BLAT search
+to get the name of the marker, and some extra details regarding it.
+
+## Genome Assemblies
+
+The genome assembly used will "determine" the position of the marker on the genome — newer assemblies will (generally) give a better position accounting for more of the issues discovered in older assemblies.
+
+With most of the newer assemblies, the positions do not shift very drastically.
+
+## GeneNetwork
+
+Currently (September 2024), GeneNetwork uses the GRCm38 (mm10) assembly for mice.
+
+Unfortunately, since the system was built for mice, the tables (e.g. Geno table) do not account for the fact that you could have markers (and other data) from species other than Mus musculus. You thus have the Geno table with fields like `Mb_mm8`, `Chr_mm8` which are very mouse-specific.