summaryrefslogtreecommitdiff
path: root/issues/search-for-brca.gmi
diff options
context:
space:
mode:
authorArun Isaac2023-03-14 20:01:42 +0000
committerArun Isaac2023-03-14 20:01:42 +0000
commit08d350bab237ece0ee2d71ca561d334b7c5d832b (patch)
tree37aae761a23c9166e3428ecff1de4f6e94133df3 /issues/search-for-brca.gmi
parentf1382e04d971600017018d501c7275aa1b1fa18b (diff)
downloadgn-gemtext-08d350bab237ece0ee2d71ca561d334b7c5d832b.tar.gz
Add brca search issue.
Diffstat (limited to 'issues/search-for-brca.gmi')
-rw-r--r--issues/search-for-brca.gmi10
1 files changed, 10 insertions, 0 deletions
diff --git a/issues/search-for-brca.gmi b/issues/search-for-brca.gmi
new file mode 100644
index 0000000..c42c745
--- /dev/null
+++ b/issues/search-for-brca.gmi
@@ -0,0 +1,10 @@
+# Search for brca
+
+* assigned: arun
+
+Search for brca does not return results for brca1 and brca2. It should.
+=> https://cd.genenetwork.org/gsearch?type=gene&terms=brca
+
+The xapian stemmer does not stem brca1 to brca. That's why when one searches for brca, results for brca1 are not returned.
+
+Perhaps we should write a custom stemmer that stems brca1 to brca. But, at the same time, we should be wary of stemming terms like p450 to p. Pjotr suggests the heuristic that we look for at least 2 or 3 alphabetic characters at the beginning. Another approach is to hard-code a list of candidates to look for.