U74Av2 MAS5 Database (September/03 Freeze)
Accession number: GN13
About the mice used to map microarray data:
The set of animals used for mapping (a mapping panel) consists of 30 groups of genetically uniform mice of the BXD type. The parental strains are C57BL/6J (B6 or B) and DBA/2J (D2 or D). The first generation hybrid is labeled F1. The F1 hybrids were made by crossing B6 females to D2 males.
All other lines are recombinant inbred strains derived from C57BL/6J and DBA/2J crosses. BXD2 through BXD32 were produced by Dr. Benjamin Taylor starting in the late 1970s. BXD33 through BXD42 were also produced by Dr. Taylor, but they were generated in the 1990s. Lines BXD67 and BXD68 are two partially inbred advanced recombinant strains (F8 and F9) that are part of a large set of BXD-Advanced strains being produced by Drs. Robert Williams, Lu Lu, Guomin Zhou, Lee Silver, and Jeremy Peirce. There will eventually be 45 of these strains. For additional background on recombinant inbred strains, please see http://www.nervenet.org/papers/bxn.html.
The table below lists the arrays by strain, sex, and age. Each array was hybridized to a pool of mRNA from 3 mice.
Strain
|
Age
|
Strain
|
Age
|
8 Wks
|
20 Wks
|
52 Wks
|
8 Wks
|
20 Wks
|
52 Wks
|
C57BL/6J (B6) |
♂♂♂ |
♀ |
♀ |
DBA/2J (D2) |
♀ |
♂♂♀ |
|
B6D2F1 (F1) |
♀ ♀ |
♀ |
|
BXD1 |
♀♀ |
|
♀ |
BXD2 |
♂ |
♀ |
♀ |
BXD5 |
♂♂♀ |
|
|
BXD6 |
♀ |
|
|
BXD8 |
♀ |
♂♀ |
|
BXD9 |
♂ |
♀ |
♀ |
BXD11 |
♀♀ |
|
♀ |
BXD12 |
|
♂ |
♀ |
BXD13 |
♀ |
|
|
BXD14 |
|
♀♀ |
♀ |
BXD15 |
♀ |
|
♀ |
BXD16 |
♀ |
♀ |
|
BXD18 |
♀ |
♂ |
♀ |
BXD19 |
♀ |
♀ |
♀ |
BXD21 |
♀♀ |
♂♂ |
|
BXD22 |
♀ |
♀♀ |
|
BXD23 |
♀ |
|
|
BXD24 |
♀♀ |
|
♀ |
BXD25 |
♀♀ |
♀♀ |
|
BXD27 |
|
|
♀♀ |
BXD28 |
♀ |
♀ |
♀ |
BXD29 |
♂ |
|
♀ |
BXD31 |
♀♀ |
♀♀ |
|
BXD32 |
♀ |
♂♀ |
♀ |
BXD33 |
♂♀ |
♀ |
|
BXD34 |
♂♀ |
♀ |
|
BXD38 |
♂♀♀ |
|
|
BXD39 |
♂♀ |
♂ |
|
BXD40 |
♂♂♀ |
|
|
BXD42 |
♂♂ ♀ |
|
|
BXD67 |
♀ ♀ |
|
|
BXD68 |
♀ ♀ |
♂ |
|
|
|
|
|
|
About the tissue used to generate these data:
Most expression data are averages based on three microarrays (U74Av2). Each individual array experiment involved a pool of brain tissue (forebrain plus the midbrain, but without the olfactory bulb) that was taken from three adult animals usually of the same age. A total of 83 arrays were used: 67 were female pools and 16 were male pools. Animals ranged in age from 56 to 441 days, usually with a balanced design (one pool at 8 weeks, one pool at ~20 weeks, one pool at approximately 1 year).
About data processing:
Probe set data from the .TXT file: These .TXT files
were generated using the MAS 5.0.
- Step 1: We added an offset of 1.0 to the .TXT expression values
for each cell to ensure that all values could be logged without
generating negative values.
- Step 2: We took the log base 2 of each cell.
- Step 3: We computed the Z-score for each cell.
- Step 4: We multiplied all Z scores by 2.
- Step 5: We added 8 to the value of all Z-scores. The consequence
of this simple set of transformations is to produce a set of Z-scores
that have a mean of 8, a variance of 4, and a standard deviation
of 2. The advantage of this modified Z-score is that a two-fold
difference in expression level corresponds approximately to a 1
unit difference.
- Step 6: We computed the arithmetic mean of the values for the
set of microarrays for each of the individual strains.
Every microarray data set therefore has a mean expression of 8 with
a standard deviation of 2. A 1-unit difference therefor represents
roughly a two-fold difference in expression level. Expression levels
below 5 are usually close to background noise levels.
About the chromosome and megabase position values:
The chromosomal locations of probe sets and gene markers were determined by BLAT analysis using the Mouse Genome Sequencing Consortium Oct 2003 (mm4) Assembly (see http://genome.ucsc.edu/cgi-bin/hgBlat?command=start&org=mouse). We thank Dr. Yan Cui (UTHSC) for allowing us to use his Linux cluster to perform this analysis.
About the array probe set names:
In addition to the _at (anti-sense target) and _st (sense target) probe set name designations, there are other designations that reflect special characteristics of a particular probe set based on probe design and selection crieteria. These designaions are listed below.
Probe set name designations
_f_at (sequence family): Includes probes that target identical and/or slightly polymorphic regions of different transcripts.
_s_at (similarity constraint): Probes all target common sequences found in multiple transcripts.
_g_at (common groups): Some of the probes target identical sequences and some target unique sequences regions .
_r_at (rules dropped): "Designates sequences for which it was not possible to pick a full set of unique probes using Affymetrix' probe selection rules. Probes were picked after dropping some of the selection rules."
_i_at (incomplete): "Designates sequences for which there are fewer than the required numbers of unique probes specified in the design."
Most of the descriptions for the probe set ID extensions above were taken from the Affymetrix GeneChip Expression Analysis Fundamentals.
Data source acknowledgment:
Data were generated with funds to RWW from the Dunavant Chair of
Excellence, University of Tennessee Health Science Center, Department
of Pediatrics. The majority of arrays were processed at Genome Explorations by Dr. Divyen Patel.
|