genenetwork3 - GeneNetwork3 REST API for data science and machine learning

Age	Commit message (Collapse)	Author
2023-01-18	Tolerate WEIGHT operator in parsed search queries.	Arun Isaac
	* gn3/api/search.py (remove_synteny_field): Tolerate WEIGHT operator in parsed search queries.
2023-01-18	Implement synteny search.	Arun Isaac
	* gn3/api/search.py: Import gzip, Path from pathlib and curry from pymonad.tools. (IntervalLiftoverFunction): New variable. (query_subqueries, query_terms, field_processor_or, liftover, liftover_interval, parse_synteny_field, is_synteny_on, remove_synteny_field): New functions. (parse_location_field): Generalize to support synteny searches. (parse_query): Support synteny search queries. (search_results): Pass synteny files directory to parse_query.
2023-01-18	Support location shorthands in search queries.	Arun Isaac
	* gn3/api/search.py: Import partial and reduce from functools. Import Callable from typing. (ChromosomalPosition, ChromosomalInterval, FieldProcessor): New classes. (apply_si_suffix, combine_queries, parse_location_field, interval_start, interval_end): New functions. (parse_query): Add field processors for location shorthands.
2023-01-18	Split off search query parsing into separate function.	Arun Isaac
	* gn3/api/search.py (parse_query): New function. (search_results): Use parse_query.
2022-12-06	Use a dataset's name to fetch it's metadata from RDF	Munyoki Kilyungi
	* gn3/api/metadata.py (jsonify_dataset_metadata): Rewrite metadata end-point to use a dataset's name instead of it's accession_id. * gn3/db/rdf.py (get_dataset_metadata): Replace accession_id with name. Use one single RDF query instead of multiple queries.
2022-11-28	Catch a URLError when virtuoso isn't running at all	Munyoki Kilyungi
	* gn3/api/metadata.py: Import URLError. (jsonify_dataset_metadata): Add URLError in except block.
2022-11-28	Fix pylint errors	Munyoki Kilyungi
	* gn3/api/general.py: Delete rdf import. Delete trailing white spaces. * gn3/api/metadata.py: Delete trailing white spaces.
2022-11-28	Fail gracefully when virtuoso is not accessible	Munyoki Kilyungi
	* gn3/api/metadata.py: import RemoteDisconnected. (jsonify_dataset_metadata): Wrap get_dataset_metadata in try block.
2022-11-23	Create a new REST endpoint for fetching a phenotype's metadata	Munyoki Kilyungi
	* gn3/api/general.py: (dataset_metadata) Delete. * gn3/api/metadata.py: Import Blueprint, jsonify, current_app, SPARQLWrapper and get_dataset_metadata. (metadata): New Blueprint (jsonify_dataset_metadata): New function/end-point. * gn3/app.py: Import metadata (create_app): Register metadata blueprint.
2022-11-02	Re-implement RDF related code to use monads in its own module	Munyoki Kilyungi
	* gn3/api/general.py: Replace gn3.db.datasets import with gn3.db.rdf. (dataset_metadata) <jsonify>: Replace datasets.dataset_metadata with rdf.get_dataset_metadata. * gn3/db/datasets.py: Remove unused imports. (sparql_query, dataset_metadata): Delete. * gn3/db/rdf.py: (sparql_query, get_dataset_metadata): New functions.
2022-10-28	Use Xapian index path from app settings.	Arun Isaac
	App settings should be accessed from current_app. It should not be hard-coded to a variable in a module. * gn3/db_utils.py: Do not import XAPIAN_DB_PATH from gn3.settings. (xapian_database): Accept path argument. * gn3/api/search.py: Import current_app from flask. (search_results): Pass Xapian index path to xapian_database.
2022-10-28	Add search.	Arun Isaac
	* gn3/api/search.py: New file. * gn3/app.py: Register the search blueprint.
2022-07-28	Add command to run the sample correlations in an external process	Frederick Muriuki Muriithi

2022-05-26	Add Endpoint to get menu items for use in UI	Frederick Muriuki Muriithi

2022-05-24	Run partial correlations with external script	Frederick Muriuki Muriithi
	Use new external script to run the partial correlations for both cases, i.e. - against an entire dataset, or - against selected traits
2022-05-24	New script to compute partial correlations	Frederick Muriuki Muriithi
	* Add a new script to compute the partial correlations against: - a select list of traits, or - an entire dataset depending on the specified subcommand. This new script is meant to supercede the `scripts/partial_correlations.py` script. * Fix the check for errors * Reorganise the order of arguments for the `partial_correlations_with_target_traits` function: move the `method` argument before the `target_trait_names` argument so that the common arguments in the partial correlation computation functions share the same order.
2022-05-21	Fix linting errors	Frederick Muriuki Muriithi

2022-05-16	Run computation in one-shot asynchronous process	Frederick Muriuki Muriithi
	After reworking the worker/runner to have a one-shot mode, add a function that queues up the task and then runs the worker in the one-shot mode to process the computation in the background.
2022-05-06	Fix linting and typing errors	Frederick Muriuki Muriithi

2022-05-06	Hook up pcorrs with target traits computations	Frederick Muriuki Muriithi
	Enable the endpoint to actually compute partial correlations with selected target traits rather than against an entire dataset. Fix some issues caused by recent refactor that broke pcorrs against a dataset
2022-03-30	Revert "Run json.loads on request.get_json, since request.get_json was just ↵	Frederick Muriuki Muriithi
	returning a string" This reverts commit b93b22386056347d8002dd2e403425beeb4657cd. The appropriate fix should have been in GN2. The original statement args = request.get_json() was correct, since `request.get_json()` should return a python object parsed from the JSON string in the request. Unfortunately, GN2 was encoding the request data two times, which led to the call returning a JSON-encoded string instead of the expected object. The issue has been fixed in GN2 and therefore, the "fix" here can be reverted.
2022-03-28	Run json.loads on request.get_json, since request.get_json was just ↵	zsloan
	returning a string
2022-03-22	Fixes pylint errors	zsloan

2022-03-22	Fix issue that causes R/qtl to always run pair-scan even if pair-scan isn't ↵	zsloan
	selected
2022-03-22	Added genofile name to inputs for processing R/qtl pair-scan results, since ↵	zsloan
	it's needed to store the proximal/distal markers for each position
2022-03-22	Fix imports to import both process_rqtl_mapping and process_rqtl_pairscan in ↵	zsloan
	api/rqtl.py
2022-03-22	Added pairscan boolean kwarg and process_rqtl_pairscan function for reading ↵	zsloan
	in pairscan results + renamed process_rqtl_output to process_rqtl_mapping to distinguish between that and pairscan
2022-03-11	Fix some linting issues	Frederick Muriuki Muriithi

2022-03-08	Fix tests, and issues caught by tests	Frederick Muriuki Muriithi
	Fix some issues caught by tests due to changes introducing the hand-off of the partial correlations computations to an external process Fix some issues due to the changes that introduce context managers for database connections Update some tests to take the above two changes into consideration
2022-03-08	Create database connections within context managers	Frederick Muriuki Muriithi
	Use the `with` context manager to open database connections, so as to ensure that those connections are closed once the call is completed. This hopefully avoids the 'too many connections' error
2022-03-04	Automatically decode Redis strings	Frederick Muriuki Muriithi

2022-03-03	Add endpoint for checking state of external processes	Frederick Muriuki Muriithi
	Long-running computations are handed off to external processes. This avoids timeouts in the webserver, and also reduces chances of instability of the webserver. The results of these long-running computations are needed eventually, so this commit provides a way to check for the state of the computation, and the results if any.
2022-03-03	Run partial correlations in an external process	Frederick Muriuki Muriithi
	Run the partial correlations code in an external python process decoupling it from the server and making it asynchronous. Summary of changes: * gn3/api/correlation.py: - Remove response processing code - Queue partial corrs processing - Create new endpoint to get results * gn3/commands.py - Compose the pcorrs command to be run in an external process - Enable running of subprocess commands with list args * gn3/responses/__init__.py: new module indicator file * gn3/responses/pcorrs_responses.py: Hold response processing code extracted from ~gn3.api.correlations.py~ file * scripts/partial_correlations.py: CLI script to process the pcorrs * sheepdog/worker.py: - Add the genenetwork3 path at the beginning of the ~sys.path~ list to override any GN3 in the site-packages - Add any environment variables to be set for the command to be run
2022-02-21	Fix a myriad of linter issues	Frederick Muriuki Muriithi
	* Use `with` in place of plain `open` * Use f-strings in place of `str.format()` * Remove string interpolation from queries - provide data as query parameters * other minor fixes
2022-02-18	Test partial correlations endpoint with non-existent primary traits	Frederick Muriuki Muriithi
	Test that the partial correlations endpoint responds with an appropriate "not-found" message and the corresponding 404 status code in the case where a request is made and the primary trait requested for does not exist in the database. Summary of the changes in each file: * gn3/api/correlation.py: generalise the building of the response * gn3/computations/partial_correlations.py: return with a "not-found" if the primary trait does not exist in the database * gn3/db/partial_correlations.py: Fix a number of bugs that led to exceptions in the case that the primary trait did not exist * pytest.ini: register a `slow` pytest marker * tests/integration/test_partial_correlations.py: Add a new test to check for an appropriate 404 response in case of a primary trait that does not exist in the database.
2022-02-17	Test partial correlations endpoint with missing data in POST request	Frederick Muriuki Muriithi
	Add a test for the partial correlations endpoint, with: - no data in the request - missing items in the data Fix the bugs caught by the test
2022-02-02	response object error fix	Alexander Kabui

2022-02-02	pep8 formatting	Alexander Kabui

2022-02-02	return 401 on request fail	Alexander Kabui

2022-02-02	pep8 formatting	Alexander Kabui

2022-02-02	new line fix	Alexander Kabui

2022-01-22	generate required json data for ctl api	Alexander Kabui

2022-01-22	add endpoint for ctl	Alexander Kabui

2022-01-10	Convert NaN to None	Frederick Muriuki Muriithi
	Issue: https://github.com/genenetwork/gn-gemtext-threads/blob/main/topics/gn1-migration-to-gn2/partial-correlations.gmi Comment: https://github.com/genenetwork/genenetwork3/pull/67#issuecomment-1000828159 * Convert NaN values to None to avoid possible bugs with the string replace method used before.
2021-12-24	Replace `NaN` with `null` in JSON string	Frederick Muriuki Muriithi
	Issue: https://github.com/genenetwork/gn-gemtext-threads/blob/main/topics/gn1-migration-to-gn2/partial-correlations.gmi * `NaN` is not a valid JSON value, and leads to errors in the code. This commit replaces all `NaN` values with `null`.
2021-12-24	Encode the data to JSON and set the status code	Frederick Muriuki Muriithi
	Issue: https://github.com/genenetwork/gn-gemtext-threads/blob/main/topics/gn1-migration-to-gn2/partial-correlations.gmi * Encode bytes objects to string * Encode NaN values to "null" * gn3/api/correlation.py:
2021-12-24	Add API endpoint for partial correlations	Frederick Muriuki Muriithi
	Issue: https://github.com/genenetwork/gn-gemtext-threads/blob/main/topics/gn1-migration-to-gn2/partial-correlations.gmi * Add an API endpoint for the partial correlation. * gn3/api/correlation.py:
2021-12-17	Add API endpoint for partial correlations	Frederick Muriuki Muriithi
	Issue: https://github.com/genenetwork/gn-gemtext-threads/blob/main/topics/gn1-migration-to-gn2/partial-correlations.gmi * Add an API endpoint for the partial correlation.
2021-12-02	Implement dataset metadata API endpoint.	Arun Isaac
	* guix.scm: Import (gnu packages rdf). (genenetwork3)[propagated-inputs]: Add python-sparqlwrapper. * gn3/settings.py (SPARQL_ENDPOINT): New variable. * gn3/api/general.py: Import datasets from gn3.db. (dataset_metadata): New API endpoint. * gn3/db/datasets.py: Import re, Template from string, Dict and Optional from typing, JSON and SPARQLWrapper from SPARQLWrapper, SPARQL_ENDPOINT from gn3.settings. (sparql_query, dataset_metadata): New functions.
2021-10-19	Enable vertical and horizontal heatmaps	Frederick Muriuki Muriithi
	Issue: https://github.com/genenetwork/gn-gemtext-threads/blob/main/topics/gn1-migration-to-gn2/non-clustered-heatmaps-and-flipping.gmi * Update the request endpoint, so that it produces a vertical or horizontal heatmap depending on the user's request.