diff options
author | Arun Isaac | 2023-03-14 20:01:42 +0000 |
---|---|---|
committer | Arun Isaac | 2023-03-14 20:01:42 +0000 |
commit | 08d350bab237ece0ee2d71ca561d334b7c5d832b (patch) | |
tree | 37aae761a23c9166e3428ecff1de4f6e94133df3 /issues | |
parent | f1382e04d971600017018d501c7275aa1b1fa18b (diff) | |
download | gn-gemtext-08d350bab237ece0ee2d71ca561d334b7c5d832b.tar.gz |
Add brca search issue.
Diffstat (limited to 'issues')
-rw-r--r-- | issues/search-for-brca.gmi | 10 |
1 files changed, 10 insertions, 0 deletions
diff --git a/issues/search-for-brca.gmi b/issues/search-for-brca.gmi new file mode 100644 index 0000000..c42c745 --- /dev/null +++ b/issues/search-for-brca.gmi @@ -0,0 +1,10 @@ +# Search for brca + +* assigned: arun + +Search for brca does not return results for brca1 and brca2. It should. +=> https://cd.genenetwork.org/gsearch?type=gene&terms=brca + +The xapian stemmer does not stem brca1 to brca. That's why when one searches for brca, results for brca1 are not returned. + +Perhaps we should write a custom stemmer that stems brca1 to brca. But, at the same time, we should be wary of stemming terms like p450 to p. Pjotr suggests the heuristic that we look for at least 2 or 3 alphabetic characters at the beginning. Another approach is to hard-code a list of candidates to look for. |