summaryrefslogtreecommitdiff
path: root/issues/full-text-search.gmi
blob: 4a32a18bcb8054e9287162375adedc9ed058edec (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
# Full text search

Implement full text search on the dumped RDF using xapian. This may be a feature to be implemented in the dump-genenetwork-database repo using guile-xapian.

=> https://xapian.org/ Xapian search engine library
=> https://git.genenetwork.org/arunisaac/dump-genenetwork-database dump-genenetwork-database repository

* assigned: aruni, bonfacem

* [x] Add type hinting to the index-genenetwork script in gn3

## Virtuoso full text search

We also considered that virtuoso supports full text search in SPARQL. But, the query syntax is not as natural as xapian, and there may be no stemming. So, this option was rejected.

=> http://docs.openlinksw.com/virtuoso/rdfsparqlrulefulltext/ Virtuoso full text search

## Completion

An initial full text search has been implemented. This initial implementation indexes data from the SQL database and provides an API to search that index. See the index-genenetwork script and the search API implementation in the genenetwork3 repository.
=> https://github.com/genenetwork/genenetwork3/blob/main/scripts/index-genenetwork
=> https://github.com/genenetwork/genenetwork3/blob/main/gn3/api/search.py

In the future, full text search must index RDF rather than data from SQL. Please open a new issue or reopen this issue when working on that.

* closed