aboutsummaryrefslogtreecommitdiff
path: root/gn3/computations
AgeCommit message (Collapse)Author
2022-11-08Fix issue where only primary samples were used when all samples arezsloan
selected This is because base_samples was set to all_samples_ordered, which only includes primary samples + parents/f1s. Setting this to an empty list fixed the issue and caused it to use all samples again.
2022-10-22fix issue;parsing sample data (#102)Alexander Kabui
2022-10-06DEBUG: Collect more info for debuggingFrederick Muriuki Muriithi
2022-10-06DEBUG: Collect more information on possible failure.Frederick Muriuki Muriithi
2022-10-03Bug: Do not process data for output too earlyFrederick Muriuki Muriithi
* gn3/computations/rust_correlation.py: (parse_tissue_corr_data): The function was parsing data from a file, and then converting the parsed values into a comma-separated string for output, despite the fact that the data parsed is not output in this specific function. This is a really bad idea, and leads to errors showing up down the line, far removed from the cause.
2022-09-30Add escape char to avoid error notice.Frederick Muriuki Muriithi
2022-09-29Use the built-in csv module for file outputFrederick Muriuki Muriithi
Use the built-in csv module for file output in order to avoid subtle errors in the formats.
2022-08-25revert for commit 6db3b2af8fc09999996d3acc09890a06f179c669;BugAlexander_Kabui
2022-08-12Add mandatory/expected fields to parsed data.Frederick Muriuki Muriithi
2022-08-10Fix returned value for 'parse_correlation_output'Frederick Muriuki Muriithi
* Always return dict * Fix internal '__parse_line__' to always have a valid return
2022-08-10code refactoringFrederick Muriuki Muriithi
2022-08-10remove unnecessary parsing of floatsAlexander_Kabui
2022-08-10code refactoringFrederick Muriuki Muriithi
2022-08-10remove unnecessary parsing of floatsAlexander_Kabui
2022-08-10enhancements for rust tissueFrederick Muriuki Muriithi
2022-08-10pylint fixesAlexander_Kabui
2022-08-10code refactoringAlexander_Kabui
2022-08-10remove unnessary codeAlexander_Kabui
2022-08-10remove unnecessary parsing of floatsAlexander_Kabui
2022-08-10merge for resultsAlexander_Kabui
2022-08-10enhancements for rust tissueAlexander_Kabui
2022-08-03Refactor: Remove unnecessary whitespace and nameFrederick Muriuki Muriithi
2022-08-03Refactor: Remove unnecessary iterations and nameFrederick Muriuki Muriithi
* remove extra iteration that is unnecessary * remove unnecessary variables
2022-07-29Fix failing testsFrederick Muriuki Muriithi
* gn3/computations/correlations.py: rename function * mypy.ini: deactivate mypy error about missing imports * tests/integration/test_correlation.py: mock correct function * tests/unit/computations/test_correlation.py: test correct function
2022-07-28Update sample correlations code to use multiprocessingFrederick Muriuki Muriithi
* To help speed up the processing of the correlations, convert the `compute_all_sample_correlation` function to use the multiprocessing module.
2022-07-22mypy fixesAlexander_Kabui
2022-07-22pep8 fixesAlexander_Kabui
2022-07-22linting and tests fixesAlexander_Kabui
2022-07-22compute num overlapAlexander_Kabui
2022-07-22fix for bug splitting list dataAlexander_Kabui
2022-07-22minor fixes for parsing dataAlexander_Kabui
2022-07-22code to parse tissue correlation inputAlexander_Kabui
2022-07-22mypy pylint fixesAlexander_Kabui
2022-07-22code to fetch correct sample corr dataAlexander_Kabui
2022-07-22delete dead_code;rename variablesAlexander_Kabui
2022-07-22minor fixesAlexander_Kabui
2022-06-29use explicit parametersAlexander_Kabui
2022-06-29fix issuesAlexander_Kabui
2022-06-29pylint fixesAlexander
2022-06-29minor fixesAlexander
2022-06-29handle tmp files generationAlexander
2022-06-29parse output dataAlexander
2022-06-29pep8 formattingAlexander
2022-06-29init rust correlation moduleAlexander
2022-05-30Pass trait data as args to `fix_strains` and fix some bugsFrederick Muriuki Muriithi
The `fix_strains` function works on the trait data, not the basic trait info. This commit fixes the arguments passed to the function, and also some bugs in the function.
2022-05-24New script to compute partial correlationsFrederick Muriuki Muriithi
* Add a new script to compute the partial correlations against: - a select list of traits, or - an entire dataset depending on the specified subcommand. This new script is meant to supercede the `scripts/partial_correlations.py` script. * Fix the check for errors * Reorganise the order of arguments for the `partial_correlations_with_target_traits` function: move the `method` argument before the `target_trait_names` argument so that the common arguments in the partial correlation computation functions share the same order.
2022-05-21Use multiprocessing to improve performanceFrederick Muriuki Muriithi
2022-05-21Process primary, target and control traits in a single iterationFrederick Muriuki Muriithi
Rework the code to process the traits in a single iteration to improve performance.
2022-05-21Return generator object rather than tuplesFrederick Muriuki Muriithi
Return generator objects rather than pre-computed tuples to reduce the number of iterations needed to process the data, and thus improve the performance of the system somewhat.
2022-05-06Fix linting and typing errorsFrederick Muriuki Muriithi