summaryrefslogtreecommitdiff
path: root/issues/systems
diff options
context:
space:
mode:
authorPjotr Prins2023-03-07 11:32:22 +0100
committerPjotr Prins2023-03-07 11:32:22 +0100
commit5586fc4f1ad5a3c410bc5183a8d9b9c1b81067c9 (patch)
treef0d033391ca34b9784bff9008794441c6127d5e5 /issues/systems
parentc2d9aa4a000da885e26f601cb641725f2b052d10 (diff)
downloadgn-gemtext-5586fc4f1ad5a3c410bc5183a8d9b9c1b81067c9.tar.gz
mariadb fixes/speedup
Diffstat (limited to 'issues/systems')
-rw-r--r--issues/systems/mariadb/move-to-innodb.gmi41
1 files changed, 40 insertions, 1 deletions
diff --git a/issues/systems/mariadb/move-to-innodb.gmi b/issues/systems/mariadb/move-to-innodb.gmi
index d846ade..dbf3918 100644
--- a/issues/systems/mariadb/move-to-innodb.gmi
+++ b/issues/systems/mariadb/move-to-innodb.gmi
@@ -22,6 +22,20 @@ I am starting with the two SE tables first - because they are small.
Actually ProbeData and ProbeSE (containing the Affy data) are not referenced in GN2. I need to check that.
+## Tasks
+
+* assigned: pjotrp
+* priority: high
+* status: in-progress
+* keywords: correlations, database
+
+For every table
+
+* [ ] Check for primary key
+* [ ] Check fulltext fields (see below)
+* [ ] Convert to innodb
+* [ ] Convert to utf8 and utf_general_ci (don't use utf8mb, see below)
+
## Report
With the SQL database we need to move from myisam to innodb format,
@@ -531,14 +545,39 @@ CREATE FULLTEXT INDEX ft_ProbeSet_Symbol ON ProbeSet(Symbol);
Now we have
```
+PRIMARY KEY ('Id'),
+UNIQUE KEY 'ProbeSetId' ('ChipId','Name'),
+KEY 'Name_IDX' ('Name'),
+KEY 'symbol_IDX' ('Symbol'),
+KEY 'RefSeq_TranscriptId' ('RefSeq_TranscriptId'),
+KEY 'GENBANK_IDX' ('GenbankId'),
+KEY 'TargetId' ('TargetId'),
+KEY 'Position' ('Chr'),
+KEY 'GeneId_IDX' ('GeneId'),
FULLTEXT KEY 'SEARCH_GENE_IDX' ('Symbol','alias'),
-FULLTEXT KEY 'SEARCH_FULL_IDX' ('Name','description','Symbol','alias','GenbankId','UniGeneId','Probe_Target_Description'),
FULLTEXT KEY 'RefSeq_FULL_IDX' ('RefSeq_TranscriptId'),
+FULLTEXT KEY 'SEARCH_FULL_IDX' ('Name','description','Symbol','alias','GenbankId','UniGeneId','Probe_Target_Description'),
FULLTEXT KEY 'ft_ProbeSet_Symbol' ('Symbol')
```
and the query works.
+Converting to unicode I was getting
+
+```
+Specified key was too long; max key length is 3072 bytes
+```
+
+Turned out simple utf8 worked:
+
+```
+ALTER TABLE ProbeSet CONVERT TO CHARACTER SET utf8 COLLATE utf8_general_ci;
+```
+
+and utf8mb is not recommended. We'll need fix that FIXME. See
+
+=> https://stackoverflow.com/questions/766809/whats-the-difference-between-utf8-general-ci-and-utf8-unicode-ci
+
pjotrp 16 Oct 2019 at 09:24 (edited)
Full text search is the least straightforward, see