summary refs log tree commit diff
diff options
context:
space:
mode:
authorFrederick Muriuki Muriithi2022-10-05 04:44:41 +0300
committerFrederick Muriuki Muriithi2022-10-05 04:45:27 +0300
commit965a104ccb7c7d23ecd33779b01222406855f8df (patch)
treeb2f794d9d823375f5915913bfd83753463fbdfad
parent2e14a28bff11416e1d18416ad7c337d7e0ca0323 (diff)
downloadgn-gemtext-965a104ccb7c7d23ecd33779b01222406855f8df.tar.gz
Issue: update with results of testing on GN1
-rw-r--r--issues/phenotype-correlation-error.gmi39
1 files changed, 37 insertions, 2 deletions
diff --git a/issues/phenotype-correlation-error.gmi b/issues/phenotype-correlation-error.gmi
index 8b37975..7bec588 100644
--- a/issues/phenotype-correlation-error.gmi
+++ b/issues/phenotype-correlation-error.gmi
@@ -71,8 +71,6 @@ KeyError: "'1422223_at' information is not found in the database."
 so far, triangulated the issue to possibly being the fact that the "target_dataset" value is not used
 => https://github.com/genenetwork/genenetwork2/blob/53aa084fd2c9c930ac791ee43affffb3f788547c/wqflask/wqflask/correlation/rust_correlation.py#L271-L289 in this function.
 
-@zsloan and @alexm: any ideas?
-
 ## Literature Correlation: Probeset Trait Against Publish/Genotype Dataset
 
 Run literature correlation for
@@ -107,6 +105,9 @@ The code seems to imply that we should not run literature correlations against a
 
 ## Some Reflections
 
+The `target_dataset` is not used in the
+=> https://github.com/genenetwork/genenetwork2/blob/c38bee43c1256c3515bbd1d805745d8dfb8ce390/wqflask/wqflask/correlation/rust_correlation.py#L271-L289 tissue correlations which seems like an error to me (fredm).
+
 In my (fredm) work on partial correlations, before doing the computations,
 => https://github.com/genenetwork/genenetwork3/blob/ff34aee0f39c2e91db243461d7d67405e7aea0e3/gn3/computations/partial_correlations.py#L704-L750 there were error checks
 that were run.
@@ -119,6 +120,40 @@ The failures above with the Publish/Genotype datasets implies one of two things:
 
 Better yet, we should probably not present invalid data to the user, i.e. do not present user with a dataset which would lead to errors if a correlation of a particular type is run against it with the given trait.
 
+## Trial Against GN1
+
+@zsloan @alexm: Running the failing tissue and literature correlations above with the same trait against the "BXD Published Phenotypes" and the "BXD Genotypes" on
+=> http://gn1.genenetwork.org/
+I got the error
+```
+Wrong correlation type
+
+    Sorry! Error occurred while processing your request.
+
+    The nature of the error generated is as follows:
+
+    Correlation Type Error :
+
+        It is not possible to compute the Tissue Correlation (Pearson's r) between your trait and data in this BXDGeno database. Please try again after selecting another type of correlation.
+```
+for the tissue correlations and
+```
+Wrong correlation type
+
+    Sorry! Error occurred while processing your request.
+
+    The nature of the error generated is as follows:
+
+    Correlation Type Error :
+
+        It is not possible to compute the SGO Literature Correlation between your trait and data in this BXDPublish database. Please try again after selecting another type of correlation.
+```
+for the literature correlations.
+
+My initial hunch was correct. We should not be running the tissue and literature correlations in the way we were in the cases above.
+
+We now need to check for these combinations and display an error for the user, as is done in GN1
+
 ## Tags
 * assigned: alexm, fredm, zsloan
 * type: bug