Age | Commit message (Collapse) | Author |
|
`read_datavalues(…)` function returns a dict of the form:
```
{
ProbeSetName01: ({…}, …),
ProbeSetName02: ({…}, …),
︙
}
```
Previously, the generator would thus try to index into the keys of the
datavalues, which were strings, leading to an error.
This commit changes the generator to return the values of the
datavalues dict as a flattened list of values.
|
|
|
|
Provide a better error message that can help the user figure out what
they did wrong and fix it for themselves instead of the older, cryptic
and extremely unhelpful error message.
|
|
|
|
Fix the wrong assumption that the ID key will always be
"ProbeSetID". The key for the ID column could change from study to
study or dataset to dataset.
|
|
Fix a bug that was causing the data insertion to fail.
|
|
|
|
Log out any DB errors that occur to help with tracking and debugging
failures.
|
|
|
|
Insert the ProbeSets first before inserting the average data and
cross-referencing it.
|
|
|
|
|
|
* Avoid overwriting existing data by adding a random string to the
name of the ProbeSet
* Read ProbeSet names separately from the means for the strains, to
avoid entering the same ProbeSet record over and over with varying
names.
|
|
The "ProbeSetId" value in the file is actually stored in the database
in the `ProbeSet.Name` field, whereas the `ProbeSetXRef` table
actually uses the `ProbeSet.Id` value to define the
relationships. This commit fixes that issue.
|
|
|
|
|
|
|
|
* Display the status of the job, as it is running
* Display STDERR output if an error occurs
* Display STDOUT output as job is running and on successful completion
of the job
|
|
|
|
|
|
* quality_control/file_utils.py (new file): extract common file
utilities.
* quality_control/parsing.py (refactor): extract common file
utilities.
* scripts/insert_data.py: Implement data insertion for averages/means
|