From 96454e93ef7fcce430419a812db576559c3bbefe Mon Sep 17 00:00:00 2001 From: Munyoki Kilyungi Date: Thu, 14 Dec 2023 15:22:42 +0300 Subject: Close issue. Signed-off-by: Munyoki Kilyungi --- issues/handling-resource-links-in-probeset-page.gmi | 13 ++++++++++++- 1 file changed, 12 insertions(+), 1 deletion(-) (limited to 'issues/handling-resource-links-in-probeset-page.gmi') diff --git a/issues/handling-resource-links-in-probeset-page.gmi b/issues/handling-resource-links-in-probeset-page.gmi index de214be..3b9e185 100644 --- a/issues/handling-resource-links-in-probeset-page.gmi +++ b/issues/handling-resource-links-in-probeset-page.gmi @@ -28,15 +28,24 @@ gn:probeset1435395_s_at gnt:hasGeneManiaResource http://genome.cse.ucsc.edu/cgi-bin/hgTracks?db=mm10&hgg_gene=NM_020582&hgg_chrom=chr5&hgg_start=145183705&hgg_end=145191592 +To construct the above link, we need to have the ProbeSet.refseq_transcriptid and the GeneList.TxStart/GeneList.TxEnd. However, doing a join between these 2 tables takes too long and the generated file get's too big. +### Resolution Transforming ProbeSet metadata takes long. The exact command: ```shell time guix shell guile-dbi \ -guile-hashing -m manifest.scm -- ./pre-inst-env ./examples/probeset.scm --settings conn.scm --output /export/data/genenetwork-virtuoso/probeset-metadata.ttl --documentation ./docs/probeset-metadata.md + guile-hashing -m manifest.scm \ + -- ./pre-inst-env \ + ./examples/probeset.scm --settings \ + conn.scm --output \ + /export/data/genenetwork-virtuoso/probeset-metadata.ttl \ + --documentation ./docs/probeset-metadata.md ``` The aforementioned command takes: @@ -54,3 +63,5 @@ A second try: The file-size of the transformed metadata is 6.0G which is to be expected. Optimisations---perhaps using guile-fibers---can be considered later. + +* closed -- cgit v1.2.3