summaryrefslogtreecommitdiff
path: root/topics/next-gen-databases/design-doc.gmi
blob: f8463b3ff1023937489180ca952082a779b32a66 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
# LMDB Phenotype/Genotype Store

* lead: bonfacem
* team: fred, alex
* contact: pjotr

Please go through:

=> https://github.com/BonfaceKilz/gn-dataset-dump
=> https://github.com/genenetwork/dump-genenetwork-database

## Tasks
* Go through the above repository.  We'll have a check in on Friday 23 to discuss this.
* Write and package guile-lmdb.  Use cl-lmdb as inspiration
* Find out a more efficient way to store utf-8 strings
* Integrate LMDB in GN:
* Rewrite some statistical computations using LMDB+RDF
* Integrate editing LMDB entries using Fred's new auth system


## Actionable items (Week 1):
Bonface:
* [A] Setup a directoryallowed for dumping rdf files
* [A] Autogenerate documentation: trees, and labels.  How do you test for correctness and completeness in a data store?
* [C] guile bindings for lmdb for important stuff
* [B] Using hashes to track updates on database---proposal

Alex:
* Fetching all the phenotype data from the database using sql and genotype file
* Using LMBD as input for correlations---make proposal for how we can export data in a way we can use

Fred:
* Fetch metadata in RDF and possibly document things