From 4e8bdddb8e7d8f9cfbe801888bf7e4f3c92fab30 Mon Sep 17 00:00:00 2001 From: Frederick Muriuki Muriithi Date: Mon, 24 Oct 2022 11:20:10 +0300 Subject: Issues (pre-cache-datasets): New issue * issues/pre-cache-datasets.gmi: Add new issue to track the pre-caching of datasets. --- issues/pre-cache-datasets.gmi | 28 ++++++++++++++++++++++++++++ 1 file changed, 28 insertions(+) create mode 100644 issues/pre-cache-datasets.gmi (limited to 'issues/pre-cache-datasets.gmi') diff --git a/issues/pre-cache-datasets.gmi b/issues/pre-cache-datasets.gmi new file mode 100644 index 0000000..6f1c165 --- /dev/null +++ b/issues/pre-cache-datasets.gmi @@ -0,0 +1,28 @@ +# Pre-Cache Datasets + +## Tags + +* assigned: +* priority: medium +* type: enhancement +* status: open +* keywords: cache, optimisation + +## Description + +To improve the performance of the system when running computations (correlations, mappings, etc), we need to pre-cache the datasets in text files. + +The triggers for pre-caching could be: +* creation of a new dataset +* changes in data for (a) trait(s) in the dataset +* changes in sample list for a dataset + +I propose an external job be triggered whenever any of the triggers above happen. The job could, among other things: +* Delete existing cache file +* Create new cache file with new data + +Maybe, if possible, we could have the pre-cache service generate the cache files based on dates for latest changes without deleting older cache files -- we could look into whether this is possible. + +## Related Issues + +=> /issues/sample-data-caching-problem Sample Data Caching Bug -- cgit v1.2.3