From 4e8bdddb8e7d8f9cfbe801888bf7e4f3c92fab30 Mon Sep 17 00:00:00 2001
From: Frederick Muriuki Muriithi
Date: Mon, 24 Oct 2022 11:20:10 +0300
Subject: Issues (pre-cache-datasets): New issue

* issues/pre-cache-datasets.gmi: Add new issue to track the
  pre-caching of datasets.
---
 issues/pre-cache-datasets.gmi | 28 ++++++++++++++++++++++++++++
 1 file changed, 28 insertions(+)
 create mode 100644 issues/pre-cache-datasets.gmi

(limited to 'issues/pre-cache-datasets.gmi')

diff --git a/issues/pre-cache-datasets.gmi b/issues/pre-cache-datasets.gmi
new file mode 100644
index 0000000..6f1c165
--- /dev/null
+++ b/issues/pre-cache-datasets.gmi
@@ -0,0 +1,28 @@
+# Pre-Cache Datasets
+
+## Tags
+
+* assigned:
+* priority: medium
+* type: enhancement
+* status: open
+* keywords: cache, optimisation
+
+## Description
+
+To improve the performance of the system when running computations (correlations, mappings, etc), we need to pre-cache the datasets in text files.
+
+The triggers for pre-caching could be:
+* creation of a new dataset
+* changes in data for (a) trait(s) in the dataset
+* changes in sample list for a dataset
+
+I propose an external job be triggered whenever any of the triggers above happen. The job could, among other things:
+* Delete existing cache file
+* Create new cache file with new data
+
+Maybe, if possible, we could have the pre-cache service generate the cache files based on dates for latest changes without deleting older cache files -- we could look into whether this is possible.
+
+## Related Issues
+
+=> /issues/sample-data-caching-problem Sample Data Caching Bug
-- 
cgit v1.2.3