summaryrefslogtreecommitdiff
path: root/topics
diff options
context:
space:
mode:
authorMunyoki Kilyungi2024-04-30 14:38:01 +0300
committerMunyoki Kilyungi2024-04-30 14:38:01 +0300
commit493954515b34888dee2622a8e6ae2dca47e7b4be (patch)
treec79458a71823aadb0917be97efe3e88dd3c9c993 /topics
parentdc829d9824cc95db6e9f93f948feca157261132a (diff)
downloadgn-gemtext-493954515b34888dee2622a8e6ae2dca47e7b4be.tar.gz
Minor updates.
Diffstat (limited to 'topics')
-rw-r--r--topics/gn-uploader/data-upload-validation.gmi14
1 files changed, 9 insertions, 5 deletions
diff --git a/topics/gn-uploader/data-upload-validation.gmi b/topics/gn-uploader/data-upload-validation.gmi
index 9c9596f..221cee5 100644
--- a/topics/gn-uploader/data-upload-validation.gmi
+++ b/topics/gn-uploader/data-upload-validation.gmi
@@ -6,11 +6,15 @@
* keywords: documentation, gn-uploader
The process of uploading data to the GeneNetwork platform began with retrieving the dataset from the Gene Network page:
+
=> https://genenetwork.org/show_trait?trait_id=24668&dataset=BXDPublish BXDPublish
-which comprised a phenotype matrix of the BXD mouse group identified by the phenotype ID: BXD_24668. This research paper associated with this dataset is titled "Novel pre-clinical model to identify genetic modifiers of triple negative breast cancer," accessible at => https://aacrjournals.org/cancerres/article/81/13_Supplement/2919/668784/Abstract-2919-Novel-pre-clinical-model-to-identify.
+which comprised a phenotype matrix of the BXD mouse group identified by the phenotype ID: BXD_24668. This research paper associated with this dataset is titled "Novel pre-clinical model to identify genetic modifiers of triple negative breast cancer," accessible at:
+
+=> https://aacrjournals.org/cancerres/article/81/13_Supplement/2919/668784/Abstract-2919-Novel-pre-clinical-model-to-identify.
+
+This dataset contained both "standard error" and "average" values. To begin upload of the data to the uploader at:
-This dataset contained both “standard error” and “average” values. To begin upload of the data to the uploader at:
=> https://staging-uploader.genenetwork.org/ Staging GN Uploader
I selected the suitable file type from the expression data options, whether average or standard error. Following this, I selected the species - mouse, platform - 54- Affymetrix Clariom S Array Mouse, created a new study - "UTHSC BXD Mice Group for Breast Cancer Dataset", group - BXD family, type - mammary gland mRNA and defined the dataset - "BXD Mice Group Study for Breast Cancer Dataset".
@@ -22,7 +26,7 @@ During the data upload process, I encountered several challenges that required s
Another error surfaced during the dataset upload process:
```
-"ERROR: no annotations found for platform 54 and dataset 1068. Quiting’."
+ERROR: no annotations found for platform 54 and dataset 1068. Quiting
```
@@ -34,7 +38,7 @@ The original format of the data was in an Excel file which contained 8 columns,
## Modifications
-* To clean up the data, I used Python to remove the metadata fields: "index", "status," "RRID," "epoch," and "SeqCvge." in order to remain with only the relevant fields "name", “value” and “SE”.
+* To clean up the data, I used Python to remove the metadata fields: "index", "status," "RRID," "epoch," and "SeqCvge." in order to remain with only the relevant fields "name", "value" and "SE".
* I ensured the first row in the matrix contains the headings to so that the dataset complies with the system requirements.
@@ -72,6 +76,6 @@ The specific code used for fetching data from the API endpoints was:
curl -k https://staging.genenetwork.org/api/v_pre1/datasets/bxd
```
-The “-k” option in this command permits insecure SSL connections, while the provided API endpoint “datasets/bxd’ fetches information about the specific BXD dataset.
+The "-k" option in this command permits insecure SSL connections, while the provided API endpoint “datasets/bxd’ fetches information about the specific BXD dataset.
The information retrieved by accessing the specified API endpoint included fields such as create time, ProbeFreeze Id, Id, public, confidentiality, full name, short name, long abbreviation, short abbreviation and data scale, confirming the data has been successfully uploaded.