HBP/Rosen Striatum M430v2 (Nov04) RMA
Accession number: GN52
Summary:
This November 2004 data freeze provides estimates of mRNA expression
in the striatum (caudate nucleus of the forebrain) of BXD recombinant inbred mice measured using
Affymetrix Mouse Genome 430 2.0 short oligomer microarrays. Data were generated at Beth Israel Deaconess Medical Center (BIDMC, Boston MA) by Glenn D. Rosen with the support of a Human Brain Project (HBP) grant. Approximately 125 brain samples from 24 strains were used in this initial experiment. Data were processed using the Robust Multiarray (RMA) transform (a Bioconductor module). To simplify comparison among Nov04 data sets, values of each array have been log2 transformed and adjusted to an average expression of 8 units.
About the cases used to generate this set of data:
This data set includes estimate of gene expression for 24 genetically uniform lines of mice: C57BL/6J (B6, or simply B), DBA/2J (D2 or D), and 22 BXD recombinant inbred (RI) strains derived by crossing female B6 mice with male D2 mice and then inbreeding progeny for over 21 generations. This set of RI strains is a remarkable resource because these strains have been extensively phenotyped for hundreds of interesting traits over a 25-year period (see the WebQTL BXD Published Phenotypes database). A significant advantage of this RI set is that both parental strains (B6 and D2) have been extensively sequenced and are known to differ at approximately 1.8 million SNPs. Coding variants (mostly single nucleotide polymorphisms and insertion-deletions) that may produce interesting phenotypes can be rapidly identified in this particular RI set.
BXD1 through BXD32 were produced by Benjamin A. Taylor starting in the late 1970s. BXD33 through BXD42 were also produced by Taylor, but from a second set of crosses initiated in the early 1990s. These strains are all available from the Jackson Laboratory, Bar Harbor, Maine.
The table below lists the arrays
by strain, sex, and age. Each array was hybridized to a pool of mRNA
from 3 to 4 mice.
Select the strain name in the table above to review details about the specific cases and to view the array quality control image processed using the PerfectMatch program by Li Zhang.
About the samples used to generate these data:
Animals were obtained from The Jackson Laboratory and housed for several weeks at BIDMC until they reached ~2 months of age (range from 55 to 62 days). Mice were killed by cervical dislocation and brains were removed and placed in RNAlater for 5 to 10 minutes prior to dissection. Cerebella and olfactory bulbs were removed; brains were hemisected, and both striata were dissected using a medial approach by Rosen that typically yields 5 to 7 mg of tissue per side. The purity of this dissection has been validated by an analysis of acetylcholinestase activity. A pool of dissected tissue from 3 or 4 adults (approximately 25-30 mg of tissue from 6 striata) of the same strain, sex, and age was collected in one session and used to generate cRNA samples. Rought 90 to 95% of all cells in the striatum are medium spiny neurons (Gerfen, 1992, for a review of the structure and function of the neostriatum).
mRNA processing: We used the Amersham Biosciences cRNA synthesis kit protocol.
About the array platfrom :
Affymetrix Mouse Genome 430 2.0 array: The 430v2 array consists of 992936 useful 25-nucleotide probes that estimate the expression of approximately 39,000 transcripts (many are actually duplicates). The array sequences were selected late in 2002 using Unigene Build 107. The array nominally contains the same probe sequence as the 430A and B series. However, we have found that roughy 75000 probes differ from those on A and B arrays.
About data processing:
Affymetrix CEL files obtained from the BIDMC Genomics Core were processed as follows.
- Step 1: We added an offset of 1 to the CEL expression values
for each cell to ensure that all values could be logged without
generating negative values.
- Step 2: We took the log base 2 of each cell.
- Step 3: We computed the Z score for each cell.
- Step 4: We multiplied all Z scores by 2.
- Step 5: We added 8 to the value of all Z scores. The consequence
of this simple set of transformations is to produce a set of Z scores
that have a mean of 8, a variance of 4, and a standard deviation of
2. The advantage of this modified Z score is that a two-fold
difference in expression level corresponds approximately to a 1 unit
difference.
- Step 6: We plotted these modified Z score probe level expression estimates in DataDesk. Male-female scatterplots of the probe signals were compared strain by strain to highlight poor array data sets. A total of 36 arrays passed this stringent quality control step.
- Step 7: We computed the arithmetic mean of the values for the set
of microarrays for each of the individual strains. We have not corrected for background beyond the
background correction implemented by Affymetrix in generating the
CEL file.
Probe set data from the CHP file: The expression values were
generated using RMA. The same simple steps described above
were also applied to these values. Every microarray data set
therefore has a mean expression of 8 with a standard deviation of 2.
A 1 unit difference therefor represents roughly a two-fold difference
in expression level. Expression levels below 5 are usually close to
background noise levels.
Data source acknowledgment:
Data were generated with funds to Glenn Rosen from P20
MH62009 (see below for specifics). Samples and arrays were processed by the
Genomics Core at Beth Israel Deaconess Medical Center by Towia Libermann and colleagues.
About this test file:
This text file originally generated by GDR, RWW, and YHQ Nov 2004. Updated by RWW Nov 17, 2004
|