aboutsummaryrefslogtreecommitdiff
path: root/wqflask/base/data_set.py
AgeCommit message (Collapse)Author
2022-02-21Fix incorrect dataset trait data cachingzsloan
Trait data caching wasn't working correctly because it didn't account for the samplelist, causing caching to work incorrect in any situation where the target dataset's samplelist wasn't the same as that of the trait being correlated against. Trait data is stored as a dictionary where the keys are trait IDs and values are *lists* of sample values. This means that the caching needs to account for the exact same set of samples; otherwise you'll end up with samples being mismatched (since "the third sample with a value" for one dataset's trait might not be the same as "the third sample with a value" for another dataset's trait). To fix this, I added the samplelist to the functions that generate and fetch the hash file. This will require more cache files, though, so this should probably be reexamined later to make the code work with only a single cache file for each dataset.
2021-11-25Delete noisy loggingBonfaceKilz
2021-11-23resolve merge conflictsAlexander Kabui
2021-11-23pep8 formatting remove dead variablesAlexander Kabui
2021-11-17refactor sql query & date formattingAlexander Kabui
2021-11-17replace redis fetch for cached timestamp with a queryAlexander Kabui
2021-11-17isolate SQL_URI parse to a functionAlexander Kabui
2021-11-17use a dynamic value for the db_nameAlexander Kabui
2021-11-17temp disable query cachingAlexander Kabui
2021-11-16use comprehension list;fix typoAlexander Kabui
2021-11-16pep8 formatting remove debug statementsAlexander Kabui
2021-11-16test generate new filesAlexander Kabui
2021-11-16refactor:fix on the query :modify cache pointAlexander Kabui
2021-11-04check for null timestampAlexander Kabui
2021-11-04store table timestamp on redisAlexander Kabui
2021-11-04init function to get the table timestampAlexander Kabui
2021-11-04pep8 formatting ,minor fixesAlexander Kabui
2021-11-04generate filename hashAlexander Kabui
2021-11-04fetch cached resultsAlexander Kabui
2021-11-04function to cache sample resultsAlexander Kabui
2021-08-17Merge pull request #598 from zsloan/feature/add_filter_by_study_sampleszsloan
Feature/add filter by study samples
2021-08-13base: data_set: Remove unnecessary comments and logging statementsBonfaceKilz
2021-08-12Return empty list instead of None in get_study_samplelistszsloan
2021-08-12Fix the way the study_sample_lists path is set and checkedzsloan
2021-08-12add function for reading in JSON file that lists sample lists unique to each ↵zsloan
study within a group (in this case only BXD Longevity for now)
2021-06-17delete loggers and commentsAlexander Kabui
2021-06-17sort sample name by sample_idsAlexander Kabui
2021-06-17remove unused functions + minor fixesAlexander Kabui
2021-06-17minor fixAlexander Kabui
2021-06-17pep8 formattingAlexander Kabui
2021-06-17add query for fetching probeset dataAlexander Kabui
2021-06-16Added function as_dict to data_set.py for situations where we don't want to ↵zsloan
pass around the whole dataset object
2021-05-10Merge branch 'testing' into feature/integrate-correlation-apiAlexander Kabui
2021-05-10add previous endpoint for correlationAlexander Kabui
2021-04-30autopep8: Fix E121,E122,E123,EI24,E125,E126,E127,E128,E129,E131,E133BonfaceKilz
2021-04-30autopep8: Fix W504BonfaceKilz
2021-04-30autopep8: Fix E501BonfaceKilz
2021-04-30autopep8: Fix E301,E302,E303,E304,E305,E306BonfaceKilz
2021-04-30autopep8: Fix E20-E27BonfaceKilz
Run: python -m autopep8 --in-place --recrusive ./ --select\ E20,E211,E22,E224,E224,E225,E226,E227,E228,E231,E241,\ E242,E251,E252,E26,E265,E266,E27 -p 3
2021-04-29Run `sed -i 's/(object)//g'`BonfaceKilz
See: https://is.gd/pL7IJF Ran: find . \( -type d -name .git -prune \) -o -type f -print0 | xargs -0 sed -i 's/(object)//g'
2021-04-29base: data_set: Remove unused methodBonfaceKilz
* wqflask/base/data_set.py (Dataset.get_accession_id): Delete it.
2021-04-29base: data_set: Apply pep-8BonfaceKilz
2021-04-29base: data_set: Remove unused methodBonfaceKilz
* wqflask/base/data_set.py (riset): Delete class method.
2021-04-29base: data_set: Rewrite data_set using @dataclassBonfaceKilz
@dataclasses should be used to hold only data. Since dataset only encapsulates data, re-writing it using a dataclass makes it more readable and easier to follow.
2021-04-23debugging for fetching probe dataAlexander Kabui
2021-04-23Remove stale commentsBonfaceKilz
2021-02-11Removed some log statementszsloan
2020-10-12Merge branch 'testing' into build/python3-migrationBonfaceKilz
2020-10-08Fixed remaining issue that applies to all dataset typeszsloan
* wqflask/base/data_set.py - Fixed issue where there was an error when trying to take the len of the query results when there were no results
2020-10-08Fixed issue where new phenotype groups wouldn't be saved to the ↵zsloan
self.datasets property because the group name was used as the key (instead of the group name + "Publish", which is the full dataset name for phenotypes) * wqflask/base/data_set.py - Set "group_name" as a separate variable from "name" to avoid it being used as the key in self.datasets