Age | Commit message (Collapse) | Author |
|
* remove extra iteration that is unnecessary
* remove unnecessary variables
|
|
* gn3/computations/correlations.py: rename function
* mypy.ini: deactivate mypy error about missing imports
* tests/integration/test_correlation.py: mock correct function
* tests/unit/computations/test_correlation.py: test correct function
|
|
|
|
* To help speed up the processing of the correlations, convert the
`compute_all_sample_correlation` function to use the multiprocessing module.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Allow the resources (endpoints) to be accessible from anywhere on the internet
by default, while still allowing for restriction via CLI environment variable
on a case by case basis.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
To reduce the chances of the system failing due to the external process being
launched with the wrong parameters, add a parsing stage that converts the
method from the UI into a form acceptable by the CLI script.
* gn3/commands.py: parse the method from UI
* scripts/partial_correlations.py: simplify the acceptable methods
|
|
* gn3/fs_helpers.py: Remove unused "pathlib" import.
(cache_ipfs_file): Disable "unused-argument" warting.
|
|
This library pollutes the Genenetwork2 profile with an old version
"dataclasses" thereby causing it to fail.
|
|
* gn3/db/correlations.py (__fetch_data__): Use a more readable code as
opposed to an error code.
|
|
* gn3/db/correlations.py (__fetch_data__): Ignore "Too many args" [R0913]
error.
|
|
* gn3/db/correlations.py (__build_query__): Ignore the "sample_ids" and
"joins" types when calling build_query_sgo_lit_corr
(fetch_all_database_data): Ignore the return type.
TODO: Ping Alex/Arun to fix this.
|
|
ATM, it's very difficult to work the correct type that is returned. Ignore
this for now and fix this later.
|
|
* gn3/genodb.py (get): Delete function.
(matrix): Use db.txn.get instead of get.
|
|
* gn3/genodb.py (GenotypeMatrix): Match class and function names.
|
|
db is unused. nrows and ncols are available in the array and transpose numpy
arrays.
* gn3/genodb.py (GenotypeMatrix)[db, nrows, ncols]: Delete fields.
* gn3/genodb.py (matrix): Do not initialize db, nrows and ncols fields.
|
|
* gn3/genodb.py: Mention reading entire matrix in module docstring.
|
|
- Have "Pearson's r" and "Spearman's rho" as the only valid choices for the
partial correlations
|
|
* gn3/genodb.py (Matrix): Rename to GenotypeMatrix.
(matrix): Update invocation of Matrix.
|
|
* gn3/genodb.py: Document nparray in the module docstring.
(nparray): New function.
|
|
The genotype database now stores the current version of the matrix alone in a
read-optimized form, while storing the older versions of the matrix in a more
compressed form. We are only interested in the current version of the
matrix. So, always use the read optimized storage.
* gn3/genodb.py (Matrix)[row_pointers, column_pointers]: Delete fields.
[array, transpose]: New fields.
* gn3/genodb.py (matrix, row, column): Read from read-optimized storage.
(vector_ref): Delete function.
|
|
* gn3/genodb.py: Remove blank line in module docstring.
|
|
We rewrite genodb using only functions. This makes for much more readable
code.
* gn3/genodb.py: Rewrite without classes.
|
|
* gn3/genodb.py (Matrix.__init__): Retrieve column pointers from database.
(row): Abstract out vector access code to ...
(Matrix.__vector): ... here.
(Matrix.column): New method.
|
|
The genotype database format now supports versioning of matrices. So, we
update genodb.py to return only the most recent genotype matrix.
* gn3/genodb.py (GenotypeDatabase.matrix): Return only the most recent
genotype matrix.
|
|
* gn3/genodb.py (GenotypeDatabase.__init__): Open genotype database in
read-only mode.
|
|
* gn3/genodb.py (GenotypeDatabase.__init__): Do not create genotype database
if it does not exist.
|
|
It has been decided that the genotype database will use little endianness
wherever applicable.
* gn3/genodb.py (Matrix.__init__): Remove TODO note to decide on endianness.
|
|
* gn3/genodb.py (GenotypeDatabase.get_metadata, GenotypeDatabase.matrix): Do
not terminate database strings with the null character.
|
|
genodb is a tiny library to read our new genotype database file format.
* gn3/genodb.py: New file.
|