summaryrefslogtreecommitdiff
path: root/issues/materialised-views-for-correlations.gmi
diff options
context:
space:
mode:
authorFrederick Muriuki Muriithi2022-10-19 11:18:52 +0300
committerFrederick Muriuki Muriithi2022-10-19 11:20:59 +0300
commit21f33beec687e71bef2b33238a14b4e907aa8ec3 (patch)
tree55170923c2bce926b52ccea8ef21f7fc001094f9 /issues/materialised-views-for-correlations.gmi
parent4e093c9e3ae7f49c07b3bc97b9c5141d8ed7cf45 (diff)
downloadgn-gemtext-21f33beec687e71bef2b33238a14b4e907aa8ec3.tar.gz
Issues (materialised-views-for-correlations): Update issue
* issues/materialised-views-for-correlations.gmi: Update issue with notes on the different types of traits, and also some candidate queries that could be materialized.
Diffstat (limited to 'issues/materialised-views-for-correlations.gmi')
-rw-r--r--issues/materialised-views-for-correlations.gmi22
1 files changed, 22 insertions, 0 deletions
diff --git a/issues/materialised-views-for-correlations.gmi b/issues/materialised-views-for-correlations.gmi
index 855a143..10b878d 100644
--- a/issues/materialised-views-for-correlations.gmi
+++ b/issues/materialised-views-for-correlations.gmi
@@ -22,6 +22,12 @@ There is some work on
=> /topics/genotype-database the genotype database
that should allow intermediate materialised views to be stored in lmdb
+There might need to be multiple materialised views for the different types of traits, i.e.
+* Phenotypes (Publish)
+* Genotypes (Geno)
+* mRNA (ProbeSet)
+* Temp (Not sure how this would be handled)
+
### Implementation Concerns
The implementation of the materialised views will need to concern itself with the following issues:
@@ -36,3 +42,19 @@ Maybe, if we are going down the materialised views road, we should also have mig
The reason for the concern above is, for example, The developers (alex, arun, bonz, fredm, etc) can implement whatever schema changes, triggers and views locally, but they do not (and probably should not) have access to production. As such, there should be provided a sort of CLI endpoint for the deployment system (or person) to run to implement the changes on the production (and CI/CD) database to get similar results.
The problem here, is that the migration might be moot, if the data is then moved out of the database, as is being planned.
+
+### Queries to Materialise
+
+Possible candidate queries for materialisation are:
+
+=> https://github.com/genenetwork/genenetwork2/blob/a2b837801d479ed2fb06ca33c07de9c271532c46/wqflask/base/data_set/dataset.py#L200-L292
+
+The method above is doing way too much - it should probably be split into separate methods for each class, to simplify the code a little and make it clearer what each part does before reworking the queries for the materialized view.
+
+
+=> https://github.com/genenetwork/genenetwork2/blob/a2b837801d479ed2fb06ca33c07de9c271532c46/wqflask/base/trait.py#L386-L610
+
+The method above is also doing way too much.
+
+
+Both methods above do not have the metadata, so probably also have a look at adding the metadata to the materialized views