aboutsummaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
8 daysCompute the data_link_ids from the request results.Frederick Muriuki Muriithi
8 daysFix bug in data for creating a new resource.Frederick Muriuki Muriithi
8 daysCompute request data for …/data/link/phenotypes endpoint.Frederick Muriuki Muriithi
8 daysAdd debug logging to track process.Frederick Muriuki Muriithi
8 daysFix return: Don't use `RETURNING *`.Frederick Muriuki Muriithi
Fetching the results of the insert query with `RETURNING *` does not work as expected with MySQLdb. This commit, thus, returns the explicitly computed values instead.
12 daysBreak up long lines.Frederick Muriuki Muriithi
12 days`update_auth()` scaffoldingFrederick Muriuki Muriithi
Add untested scaffolding for how the `update_auth()` function will work. Include notes to help with figuring the feature out.
12 daysRequest token for use with script to update authorisationsFrederick Muriuki Muriithi
After the script has completed loading the data into the database, the user who uploads the data will need to access the data they've uploaded. This commit requests a new token to use with the script for updating the authorisations as appropriate, to grant the user access to their data.
12 daysOnly update authorisation after all data is entered.Frederick Muriuki Muriithi
Pull the auth update function out of the database connection contextmanager to prevent any problems with an auth update from causing a rollback of all the data. We can always manually update the authorisation later, therefore, we do not want a failure in that causing the (potentially) time-consuming data entry process from being rolled back.
2025-06-03Add more logging to track progress.Frederick Muriuki Muriithi
2025-06-03Save data with `LOAD DATA INFILE …` queryFrederick Muriuki Muriithi
To help speed up the saving of the data (for really huge files) into the database, use the `LOAD DATA INFILE …` command if available, and if not fallback to the one using raw queries.
2025-06-03Extract building of data items into helper function.Frederick Muriuki Muriithi
2025-06-03Increase number of rows per query.Frederick Muriuki Muriithi
2025-06-03Debug the query: useful to figure out what's happening.Frederick Muriuki Muriithi
2025-06-03Memory saving: Don't reload saved dataFrederick Muriuki Muriithi
We already have all the data in memory, so we do not need to reload it and keep yet another copy.
2025-06-02Compute data in files lazily.Frederick Muriuki Muriithi
2025-06-02Process NA-strings in the worker script.Frederick Muriuki Muriithi
2025-06-02Add note in documentation.Frederick Muriuki Muriithi
2025-06-02Revert "Convert N/A values to NoneType objects."Frederick Muriuki Muriithi
This reverts commit 6870c08d484f482cc2f2501d28b474636dd0810d. The "read_csv_file" should return the data in the CSV file, as is, before processing the N/A values.
2025-06-02Update placeholder functionFrederick Muriuki Muriithi
We'll probably provide a token to use to update the authorisation for the new data. The commit also explicitly raises a `NotImplement` exception for now.
2025-06-02Add missing import.Frederick Muriuki Muriithi
2025-06-02Fix bug: Compute data correctly. Replace N/A strings with NoneType.Frederick Muriuki Muriithi
2025-06-02Add some debugging statements.Frederick Muriuki Muriithi
2025-06-02Parametrize the "DataId" column for different tables.Frederick Muriuki Muriithi
2025-06-02Explicitly query for newly entered data.Frederick Muriuki Muriithi
The `cursor.executemany()` call only returned the last row that was inserted, rather than ALL the rows. This is not the correct thing to do, therefore, this commit fixes that by explicitly querying for all the newly entered data.
2025-06-02Convert N/A values to NoneType objects.Frederick Muriuki Muriithi
2025-06-02Pass the phenotypes name-to-id map to functions.Frederick Muriuki Muriithi
2025-06-02Compute the map from a phenotype's name to its ID.Frederick Muriuki Muriithi
2025-05-30Refactor: Extract common pattern into separate functionFrederick Muriuki Muriithi
Extract the common pattern into a separate, more generalized function and pass the new function the data it requires to perform its tasks for the different file types.
2025-05-30Add typing to load_data function and invoke it.Frederick Muriuki Muriithi
2025-05-30Save standard errors and counts.Frederick Muriuki Muriithi
2025-05-30Get the species, population and dataset.Frederick Muriuki Muriithi
2025-05-30Add more tables to lock list.Frederick Muriuki Muriithi
2025-05-30Fix query: Add missing field and parameter.Frederick Muriuki Muriithi
2025-05-30Fix function names.Frederick Muriuki Muriithi
2025-05-30Fix processing of numerical data to save to the database.Frederick Muriuki Muriithi
2025-05-30Fix file paths: append the path to the extraction directory.Frederick Muriuki Muriithi
2025-05-30Add new file type.Frederick Muriuki Muriithi
2025-05-30Add dataset_id and bundle path to job metadata.Frederick Muriuki Muriithi
2025-05-30Fix typo: Fix table name.Frederick Muriuki Muriithi
2025-05-30Bug: Loop until the data is exhausted.Frederick Muriuki Muriithi
2025-05-30Fix typo, and bug.Frederick Muriuki Muriithi
2025-05-30Fix indentation.Frederick Muriuki Muriithi
2025-05-30Refactor out common `__row_to_dataitems__` function.Frederick Muriuki Muriithi
2025-05-30Use full table names rather than aliasesFrederick Muriuki Muriithi
Using aliases leads to errors when you have to use table locking to prevent data corruption. This commit updates queries to use the full table names rather than aliases, in order to prevent such troubles.
2025-05-28Bug: Fix actual data value, not result set.Frederick Muriuki Muriithi
2025-05-28Bug: Add missing commas.Frederick Muriuki Muriithi
2025-05-28Maintain single list of tables — build logs and query from listFrederick Muriuki Muriithi
Maintain a list of tables to lock when running the script and build the logging messages, and lock query from the list for easier maintenance.
2025-05-28Add missing import.Frederick Muriuki Muriithi
2025-05-28Pass path to bundle extraction dir, and make paths relative to it.Frederick Muriuki Muriithi
Pass the path where the bundle was extracted to the function doing the work. Make the paths for the files being used be relative to the extraction directory thus passed.