genenetwork3 - GeneNetwork3 REST API for data science and machine learning

Age	Commit message (Collapse)	Author
2022-04-12	Delete "get_allowable_sampledata_headers"	BonfaceKilz
	* gn3/csvcmp.py (get_allowable_sampledata_headers): Delete it. * tests/unit/test_csvcmp.py: Remove "get_allowable_sampledata_headers" import. (test_get_allowable_csv_headers): Delete it.
2022-04-12	Strip any newline, tab or carriage-return chars from sample data	BonfaceKilz
	* gn3/db/sample_data.py (get_trait_csv_sample_data): Strip out "\n", "\t", or "\r" from the sample data. See: <https://issues.genenetwork.org/issues/csv-error-ITP_10001-longevity-data-set.html>
2022-04-07	Fix pylint errors	BonfaceKilz

2022-04-07	Fix mypy error	BonfaceKilz

2022-04-07	Use case attribute id inside brackets if present during insertions	BonfaceKilz
	* gn3/db/sample_data.py (delete_sample_data): If an id is present in the column header, use it. * tests/unit/db/test_sample_data.py (test_delete_sample_data): Update tests to capture the above.
2022-04-07	Use case attribute id inside brackets if present during insertions	BonfaceKilz
	* gn3/db/sample_data.py (insert_sample_data): If an id is present in the column header, use it. * tests/unit/db/test_sample_data.py (test_insert_sample_data): Update tests to capture the above.
2022-04-07	Use case attribute id inside brackets if present during updates	BonfaceKilz
	* gn3/db/sample_data.py: Import "parse_csv_column". (update_sample_data): If an id is present in the column header, use it. * tests/unit/db/test_sample_data.py (test_update_sample_data): Update tests to capture the above.
2022-04-07	Add method for fetching the case_attributes	BonfaceKilz
	* gn3/db/sample_data.py (get_case_attributes): New function. * tests/unit/db/test_sample_data.py (test_get_case_attributes): Test case for the above.
2022-04-07	Run python black on file	BonfaceKilz
	* gn3/db/sample_data.py: Run "python black -l 79 ..."
2022-04-07	Add method for parsing a csv header from uploaded sample-data file	BonfaceKilz
	* gn3/csvcmp.py (parse_csv_column): New function. * tests/unit/test_csvcmp.py: Test case for the above.
2022-04-01	Run python-black in file	BonfaceKilz
	* gn3/csvcmp.py: Run "black -l 79 ..." * tests/unit/db/test_sample_data.py: Ditto. * tests/unit/test_csvcmp.py: Ditto.
2022-03-30	Revert "Run json.loads on request.get_json, since request.get_json was just ↵	Frederick Muriuki Muriithi
	returning a string" This reverts commit b93b22386056347d8002dd2e403425beeb4657cd. The appropriate fix should have been in GN2. The original statement args = request.get_json() was correct, since `request.get_json()` should return a python object parsed from the JSON string in the request. Unfortunately, GN2 was encoding the request data two times, which led to the call returning a JSON-encoded string instead of the expected object. The issue has been fixed in GN2 and therefore, the "fix" here can be reverted.
2022-03-29	Remove unused module	Frederick Muriuki Muriithi
	* Remove a module that is no longer in use
2022-03-28	Run json.loads on request.get_json, since request.get_json was just ↵	zsloan
	returning a string
2022-03-23	Run python-black on file and fix other pylint issues.	BonfaceKilz
	See: <https://ci.genenetwork.org/jobs/genenetwork3-pylint/126> * gn3/computations/rqtl.py: Run `black gn3/computations/rqtl.py`. Also, manually fix other pylint issues.
2022-03-22	Fixes pylint errors	zsloan

2022-03-22	Fixed mypy typing errors	zsloan

2022-03-22	Tried to make the docstrings more consistent	zsloan

2022-03-22	Add typing to some functions	zsloan

2022-03-22	Add functions for getting proximal/distal markers for each pseudomarker ↵	zsloan
	position in pair-scan results + return only the sorted top 500 results
2022-03-22	Fix issue that causes R/qtl to always run pair-scan even if pair-scan isn't ↵	zsloan
	selected
2022-03-22	Added genofile name to inputs for processing R/qtl pair-scan results, since ↵	zsloan
	it's needed to store the proximal/distal markers for each position
2022-03-22	Removed quotes from beginning and end of chromosome string	zsloan

2022-03-22	Fixed a cople function calls to use the updated function names	zsloan

2022-03-22	Create pairscan_for_figure and pairscan_for_table functions that return the ↵	zsloan
	Dict and List respectively used for the pair scan figure and the table showing the results
2022-03-22	Fix imports to import both process_rqtl_mapping and process_rqtl_pairscan in ↵	zsloan
	api/rqtl.py
2022-03-22	Added pairscan boolean kwarg and process_rqtl_pairscan function for reading ↵	zsloan
	in pairscan results + renamed process_rqtl_output to process_rqtl_mapping to distinguish between that and pairscan
2022-03-18	Clean all csv fields before diffing	BonfaceKilz
	There was a subtle bug where "csvdiff" generated an error related to "different column headings" caused something akin to diffing: "a, b \n, ..." with "a, b\n, ...". * gn3/csvcmp.py (csv_diff): Clean csv texts before any diffing. * tests/unit/test_csvcmp.py (test_csv_diff_same_columns): Modify test case to capture aforementioned bug.
2022-03-18	Create new function for cleaning individual fields in csv text	BonfaceKilz
	* gn3/csvcmp.py (clean_csv_text): New function. * tests/unit/test_csvcmp.py: Import "csv_text". (test_clean_csv_text): Test case for the above.
2022-03-15	Feature/refactored pca (#79)	Alexander Kabui
	* compute zscore function * test case for computing zscore * function to compute pca * generate scree plot data * generate new pca trait data from zscores and eigen_vec * remove redundant functions * generate factor loading table data * generate pca temp dataset dict * variable naming and error fixes * unit test for processing factor loadings * minor fixes for generating temp pca dataset * pass datetime as argument to generate_pca temp dataset function * add unittest for caching pca datasets * cache temp datasets * ignore missing imports for sklearn * mypy fixes * pylint fixes * refactor tests for pca * remove ununsed imports * fix for generating pca traits vals * mypy and code refactoring * pep8 formatting and add docstrings * remove comments /pep8 formatting * sort eigen vectors based on eigen values * minor fix for zscores * fix for rounding variance ratios * refactor tests * rename module to pca * rename datasets to traits * fix failing tests * fix caching function * fixes return x and y coordinates for scree plot * expand exception scope * fix for deprecated numpy.matrix function * fix for failing tests * pep8 fixes * remove comments * fix merge conflict * pylint fixes * rename module name to test_pca
2022-03-14	Dummy White Space commit to fix laminar	BonfaceKilz

2022-03-14	Only loop through the diff's modifications if it exists	BonfaceKilz

2022-03-14	Given a csv text and permissible headers, extract invalid headers	BonfaceKilz
	* gn3/csvcmp.py (extract_invalid_csv_headers): New function. * tests/unit/test_csvcmp.py: Import "extract_invalid_csv_headers". (test_extract_invalid_csv_headers_with_some_wrong_headers): Test case for the above.
2022-03-14	Get all permissible column data	BonfaceKilz
	* gn3/csvcmp.py: Import "Any" and "List". (get_allowable_sampledata_headers): New function. * tests/unit/test_csvcmp: Import "get_allowable_sampledata_headers". (test_get_allowable_csv_headers): Test case for the above.
2022-03-12	Remove unused imports	BonfaceKilz

2022-03-12	Fix mypy issues	BonfaceKilz

2022-03-12	Fix pylint issues	BonfaceKilz

2022-03-12	Compose csv-diff command within single quotes	BonfaceKilz
	* gn3/csvcmp.py (csv_diff): Use single quotes. There was a change in 6d39c92 that broke this.
2022-03-12	Delete noisy "print" statement	BonfaceKilz

2022-03-12	Store the first element as strain_id	BonfaceKilz

2022-03-12	Append the strain name when extracting "actions"	BonfaceKilz
	* gn3/db/sample_data.py (__extract_actions): During updates, make sure that the strain name is part of the returned string when extracting "actions". * tests/unit/db/test_sample_data.py: Add test cases for the above.
2022-03-12	Apply auto-pep8 to sample_data.py and it's test file	BonfaceKilz

2022-03-12	Add missing return type-annotations	BonfaceKilz
	* tests/unit/db/test_sample_data.py (delete_sample_data): Add missing return type for type annotations. (insert_sample_data): Ditto.
2022-03-12	Update how data is updated by re-using existing functions	BonfaceKilz
	* gn3/db/sample_data.py (get_sample_data_ids): Re-use "delete_sample_data" and "insert_sample_data" when updating data; and also add logic for updating modified data. * tests/unit/db/test_sample_data.py: Add tests for the above.
2022-03-12	Create action dict that's created when updating data	BonfaceKilz
	* gn3/db/sample_data.py (__extract_actions): An update on a vector of data can contain: inserts, deletes and updates. This functions extracts these actions during an update. * tests/unit/db/test_sample_data.py (test_extract_actions): Add test-case for the above.
2022-03-12	Remove check for inserted data when inserting individual data	BonfaceKilz
	* gn3/db/sample_data.py (insert_sample_data)[__insert_data]: Move check to the main body. With this check here, you have 3 redundant checks. For a successful insert, it will insert the first value to the `PublishData` table and ignore the rest of the inserts.
2022-03-12	Make `_map` a constant	BonfaceKilz
	* gn3/db/sample_data.py: Now constant, `_MAP`. (delete_sample_data)[__delete_data]: Replace `_map` with `_MAP`. (insert_sample_data)[__insert_data]: Ditto.
2022-03-12	Fix faulty SQL query string when deleting case-attributes	BonfaceKilz

2022-03-12	Explicitly get CaseAttributeId and fix broken sql query	BonfaceKilz
	* gn3/db/sample_data.py (insert_sample_data): Use correct query string. Also, use CaseAttributeId to determine whether case-attributes were inserted. If so, do not attempt an insert.
2022-03-12	Remove duplicate params	BonfaceKilz
	* gn3/db/sample_data.py (insert_sample_data)[__insert_case_attribute]: Remove extra parameters.