aboutsummaryrefslogtreecommitdiff
path: root/scripts/rqtl2
AgeCommit message (Collapse)Author
2024-12-03Improve error textFrederick Muriuki Muriithi
Move the list to the end of the text to make it easier to clip text in the UI.
2024-12-03Update file metadata using a mappingFrederick Muriuki Muriithi
Update the metadata using a mapping rather than updating a field at a time to make it easier to ensure everything is updated in one go.
2024-12-03Provide consistent way to build file-specific fully-qualified keysFrederick Muriuki Muriithi
2024-12-03Save 'pheno' errors in redis as they are found.Frederick Muriuki Muriithi
2024-12-03Add some phenocovar metadata.Frederick Muriuki Muriithi
2024-12-03Save phenocovar errors in redis as they are found.Frederick Muriuki Muriithi
2024-12-03Pass the redis connection and fully qualified job idFrederick Muriuki Muriithi
Pass the redis connection on to the function used to build main since it might need to use a connection to redis. Also pass the computed fully qualified job id rather than recomputing it every time. Update dependent functions to take the new arguments.
2024-12-03Wrap everything in try-catch to handle errors gracefully.Frederick Muriuki Muriithi
Put everything in the build_main function within a try-catch block to ensure we capture all exceptions that might occur in different scripts and log them out. This helps with debugging errors in the asynchronous scripts.
2024-12-02Fix typo.Frederick Muriuki Muriithi
2024-12-02Send logs to STDOUT too.Frederick Muriuki Muriithi
2024-12-02Return 0 on completion of QC processing.Frederick Muriuki Muriithi
2024-12-02Change from 'pending' to 'started' on entry to script.Frederick Muriuki Muriithi
Update the status of the job from 'pending' to 'started' at the entry-point of the script.
2024-11-22Use gn-libs code for db connection.Frederick Muriuki Muriithi
Use the code in gn-libs to connect to the database, rather than a local module.
2024-10-24Reduce number of local variables created in function.Frederick Muriuki Muriithi
Use the values in the argparse.Namespace object directly to avoid creating many local variables.
2024-10-24Add `jobid` to extraction directoryFrederick Muriuki Muriithi
Add the job id to help "group" files that belong to the same job.
2024-10-24Check pheno, phenose, and phenonum files with loggingFrederick Muriuki Muriithi
Add logging to the `qc_pheno_file` function so that we get the messages pushed to redis for every file that is being checked for errors.
2024-10-24Check phenocovar files with loggingFrederick Muriuki Muriithi
Build a function-scope logger using the new `scripts.redis_logger.RedisMessageListHandler` log handler to log-out the messages for each phenocovar file being processed in a more thread-safe way.
2024-10-24Fix linting and typing errors.Frederick Muriuki Muriithi
2024-10-24Parse entire namespace object rather than individual values.Frederick Muriuki Muriithi
2024-10-24Move logger creation to `build_main` functionFrederick Muriuki Muriithi
Since the module-level loggers are built mostly the same, move the creation of the logger to the more general function to reduce repetition.
2024-10-22Refactor `qc_pheno_file` and reuse it for different file types.Frederick Muriuki Muriithi
The QC/QA steps taken by the `qc_pheno_file` function are very similar across the "pheno", "phenose" and "phenonum" files. This commit makes the `qc_pheno_file` function a higher-order function and we pass the file-type specific check(s) as a callable (function) to be used for the QC/QA process.
2024-10-22Check for errors in `pheno` files.Frederick Muriuki Muriithi
2024-10-21Check `phenocovar` files for errors.Frederick Muriuki Muriithi
2024-10-17Cleanup: Delete all extracted files after processing.Frederick Muriuki Muriithi
2024-10-17Leave TODO notes for what needs to be done.Frederick Muriuki Muriithi
2024-10-17Fetch samples from databaseFrederick Muriuki Muriithi
Fetch the samples from the database. These will be used to verify that the samples in the phenotype files already exist in the database and are valid.
2024-10-17Undo transpose for any transposed filesFrederick Muriuki Muriithi
To reduce the complexity involved in the processing of the files, we undo any transposition of the CSV files for those files that are marked as transposed.
2024-10-17Pass new arguments to QC function.Frederick Muriuki Muriithi
2024-10-17Add `speciesid` and `populationid` arguments to the script.Frederick Muriuki Muriithi
2024-10-17Add the working directory argument to the script.Frederick Muriuki Muriithi
Add a `--working-dir` argument to allow passing a directory where the script process the files within. If not provided, it defaults to a directory within the systems temporary directory.
2024-10-17Extract the R/qtl2 bundle for processing.Frederick Muriuki Muriithi
To enable processing of the files individually, this commit will enable the extraction of the files into a known working directory in which all further processing will take place.
2024-10-17Extract common functions.Frederick Muriuki Muriithi
2024-10-14Initialise background script for running QC on phenotype bundles.Frederick Muriuki Muriithi
2024-10-14BugFix: Use provided prefixFrederick Muriuki Muriithi
Use the provided prefix rather than calling `jobs.jobsnamespace()` function that depends of an app context existing.
2024-10-14Make addition of arguments independent of each other.Frederick Muriuki Muriithi
2024-08-16Log out correct parameters.Frederick Muriuki Muriithi
2024-08-13Bug: cross reference with NULL cM when "gmap" file is absentFrederick Muriuki Muriithi
The "gmap" file might not exist in some bundles. In those instances, cross-reference the data without including the genotypes' physical positions (cM).
2024-08-08Fix bugs and pass in logger to functions.Frederick Muriuki Muriithi
2024-08-06Pass logger on to inner functionsFrederick Muriuki Muriithi
Pass the logger forward to inner functions to help with debugging things.
2024-07-25Rename module: qc_app --> uploaderFrederick Muriuki Muriithi
2024-07-05bug: Return a hashable key, not a dict.Frederick Muriuki Muriithi
2024-07-02Call correct method.Frederick Muriuki Muriithi
2024-07-02Ensure no duplicated values for the query.Frederick Muriuki Muriithi
2024-04-03Reduce size of data inserted per queryFrederick Muriuki Muriithi
Reduce the size of data inserted per query since MariDB allows a packet with a maximum size of 1GB. This should hopefully resolve the …OperationalError: (2006, 'Server has gone away') error.
2024-02-15Filter out NULL valuesFrederick Muriuki Muriithi
2024-02-05Fix linting and type errors.Frederick Muriuki Muriithi
2024-01-23Update scripts to use redis prefix.Frederick Muriuki Muriithi
2024-01-22scripts: Pass in logger to get detailed updates.Frederick Muriuki Muriithi
2024-01-19scripts: Process R/qtl2 bundleFrederick Muriuki Muriithi
Build script to start the processing of the R/qtl2 bundle.
2024-01-16Filter out items with values of `None`.Frederick Muriuki Muriithi
The ProbeSetData table does not allow `NULL` for the `value` column for good reason - if there is no value, just don't have a record for that particular sample.