OHSU/VA B6D2F2 Brain mRNA M430A PDNN Database (March/04 Freeze)
Summary
Accession number: GN33
This March 2004 data freeze provides estimate of mRNA expression in adult brains of F2 intercross mice (C57BL/6J x DBA/2J F2) measured using Affymetrix M430A microarrays. Data were generated at The Oregon Health Sciences University (OHSU) in Portland, Oregon, by John Belknap and Robert Hitzemann. Data were processed using the Position-Dependent Nearest Neighbor (PDNN) method developed by Zhang and colleagues (2003. To simplify comparison between transforms, PDNN values of each array were adjusted to an average of 8 units and a variance of 2 units.
This data set was run as a single large batch with careful consideration to balancing samples by sex, age, and environment.
About the cases used to generate this set of data:
Fifty-six B6D2F2 samples, each taken from a single brain hemisphere from an individual mouse, were assayed using 56 M430A Affymetrix short oligomer microarrays. [The remaining hemisphere will be used later for an anaysis of specific brain regions.] Each array ID (see table below) includes a three letter code; the first letter usually denotes sex of the case (note that we have made a few corrections and there are therefore several sex-discordant IDs), the second letter denotes the hemisphere (R or L), and the third letter is the mouse number within each cell. The F2 mice were experimentally naive, born within a 3-day period from second litters of each dam, and housed at weaning (20- to 24-days-of-age) in like-sex groups of 3 to 4 mice for females and 2 to 3 mice for males in standard mouse shoebox cages within Thoren racks. All 56 F2 mice were killed at 77 to 79 days-of-age by cervical dislocation on December 17, 2003. The brains were immediately split at the midline and then quickly frozen on dry ice. The brains were stored for about two weeks at -80 degrees C until further use.
The F2 was derived as follows: C57BL/6J (B6) and DBA/2J (D2) breeders were obtained from The Jackson Laboratory, and two generations later their progeny were crossed to produce B6D2F1 and D2B6F1 hybrid at the Portland VA Veterinary Medical Unit (AAALAC approved). The reciprocal F1s were mated to create an F2 population with both progenitor X and Y chromosomes about equally represented.
About the tissue used to generate these data:
Brain samples were from 31 male and 25 females and between 28 right and 28 left hemispheres distributed with good balance across the two sexes. The tissue arrayed included the forebrain, midbrain, one olfactory bulb, the cerebellum; and the rostral part of the medulla. The medulla was trimmed transversely at the caudal aspect of the cerebellum. The sagittal cut was made from a dorsal to ventral direction. (Note that several of the other brain transcriptome databases do not include olfactory bulb or cerebellum.) Total RNA was isolated with TRIZOL Reagent (Life Technologies Inc.) using a modification of the single-step acid guanidinium isothiocyanate phenol-chloroform extraction method according to the manufacturer’s protocol. The extracted RNA was then purified using RNeasy (Qiagen, Inc.). RNA samples were evaluated by UV spectroscopy for purity; only samples with an A260/280 ratio greater than 1.8 were used. RNA quality was monitored by visualization on an ethidium bromide-stained denaturing formaldehyde agarose gel. Samples containing at least 10 micrograms of total RNA were sent to the OHSU Gene Microarray Shared Resource facility for analysis. The procedures used at the facility precisely follow the manufacturer’s specifications. Details can be found at http://www.ohsu.edu/gmsr/amc. Following labeling, all samples were hybridized to the GeneChip Test3 array for quality control. If target performance did not meet recommended thresholds, the sample would have been discarded. All labeled samples passed the threshold and were hybridized to the 430A array.
About the arrays:
All 56 430A arrays used in this project were purchased at one time and had the same Affymetrix lot number.
The table below lists the arrays by Case ID, Array ID, Side, Cage ID and Sex.
Order
|
CaseID
|
ArrayID
|
Side
|
CageID
|
Sex
|
1 | 20 | FL10 | L | H1 | F |
2 | 2 | FL11 | L | H2 | F |
3 | 5 | FL12 | L | H3 | F |
4 | 63 | FL13 | L | H4 | F |
5 | 6 | FL14 | L | K2 | F |
6 | 10 | FL15 | L | Q2 | F |
7 | 52 | FL2 | L | E1 | F |
8 | 53 | FL3 | L | E2 | F |
9 | 42 | FL4 | L | E3 | F |
10 | 31 | FL5 | L | E4 | F |
11 | 14 | FL6 | L | F1 | M |
12 | 48 | FL7 | L | F2 | F |
13 | 60 | FL8 | L | F3 | M |
14 | 54 | FL9 | L | F4 | F |
15 | 35 | FR10 | R | K3 | F |
16 | 11 | FR11 | R | O1 | F |
17 | 21 | FR12 | R | O2 | F |
18 | 23 | FR13 | R | Q1 | F |
19 | 15 | FR14 | R | Q3 | F |
20 | 4 | FR15 | R | Q4 | F |
21 | 41 | FR2 | R | A2 | F |
22 | 44 | FR3 | R | A3 | F |
23 | 37 | FR4 | R | C1 | F |
24 | 8 | FR5 | R | C2 | F |
25 | 19 | FR6 | R | C3 | F |
26 | 40 | FR7 | R | C4 | F |
27 | 62 | FR8 | R | D2 | M |
28 | 39 | FR9 | R | D3 | F |
29 | 13 | ML1 | L | B1 | M |
30 | 22 | ML10 | L | L2 | M |
31 | 38 | ML11 | L | L4 | M |
32 | 43 | ML12 | L | M1 | M |
33 | 58 | ML13 | L | N2 | M |
34 | 7 | ML14 | L | R1 | M |
35 | 30 | ML15 | L | R3 | M |
36 | 46 | ML3 | L | G1 | M |
37 | 57 | ML4 | L | G2 | M |
38 | 51 | ML5 | L | I1 | M |
39 | 27 | ML6 | L | I2 | M |
40 | 50 | ML7 | L | J2 | M |
41 | 16 | FL1 | L | O2 | M |
42 | 3 | ML9 | L | L1 | M |
43 | 47 | MR10 | R | R2 | M |
44 | 56 | MR11 | R | S1 | M |
45 | 1 | MR12 | R | S2 | M |
46 | 55 | MR13 | R | T1 | M |
47 | 34 | MR14 | R | U1 | M |
48 | 25 | MR15 | R | U2 | M |
49 | 59 | MR2 | R | J1 | M |
50 | 32 | MR3 | R | M2 | M |
51 | 24 | MR4 | R | M3 | M |
52 | 12 | MR5 | R | M4 | M |
53 | 9 | MR6 | R | N1 | M |
54 | 36 | MR7 | R | N3 | M |
55 | 28 | MR8 | R | P1 | M |
56 | 33 | MR9 | R | P2 | M |
|
About the marker set:
The 56 mice were each genotyped at 309 MIT microsatellite markers distributed across the genome, including the Y chromosome. The genotyping error check routine (Lincoln and Lander, 1992) implemented within R/qtl (Broman et al., 2003) showed no likely errors at p <.01 probability. Initial genotypes were generated at OHSU. Approximately 200 genotypes were generated at UTHSC by Jing Gu and Shuhua Qi.
About data processing:
Probe (cell) level data from the CEL file: These CEL values produced by GCOS are the 75% quantiles from a set of 91 pixel values per cell. Probe values were processed as follows:
- Step 1: We added an offset of 1 to the CEL expression values for each cell to ensure that all values could be logged without generating negative values.
- Step 2: We took the log2 of each probe signal.
- Step 3: We computed the Z score for each signal within array.
- Step 4: We multiplied all Z scores by 2.
- Step 5: We added 8 to the value of all Z scores. The consequence of this simple set of transformations is to produce a set of Z scores that have a mean of 8, a variance of 4, and a standard deviation of 2. The advantage of this modified Z score is that a two-fold difference in expression level corresponds approximately to a 1 unit difference.
- Step 6: We computed the arithmetic mean of the values for the set of microarrays for each of the individual strains.
Probe set data: The uncorrected, untransformed CEL files were subject to probe (low) level processing using both the RMA (Robust Multiarray Average; Irizarry et al. 2003) and PDNN (Position Dependent Nearest Neighbor; Zhang et al. 2003) methods because these two performed the best of four methods tested in a recent four inbred strain comparison using the M430A chip on whole brain samples (Hitzemann et al, submitted). RMA was implemented by the Affy package (11/24/03 version) within Bioconductor (http://www.bioconductor.org) and PDNN by the PerfectMatch v. 2.1 program from Li Zhang (PDNN ). For sake of comparison with other data sets, MAS 5 files have also been generated.
To better compare data sets, the same simple steps (1 through 6 above) were applied to PDNN and RMA values. Every microarray data set therefore has a mean expression of 8 units with a standard deviation of 2 units. A 1-unit difference therefore represents roughly a 2-fold difference in expression level. Expression levels below 5 are usually close to background noise levels.
About the chromosome and megabase position values:
The chromosomal locations of M430A and M430B probe sets were determined by BLAT analysis of concatenated probe sequences using the Mouse Genome Sequencing Consortium May 2004 (mm5) assembly. This BLAT analysis is performed periodically by Yanhua Qu as each new build of the mouse genome is released. We thank Yan Cui (UTHSC) for allowing us to use his Linux cluster to perform this analysis. It is possible to confirm the BLAT alignment results yourself simply by clicking on the Verify link in the Trait Data and Editing Form (right side of the Location line).
Data source acknowledgment:
This project was supported by two Department of Veterans Affairs Merit Review Awards (to JK Belknap and R Hitzemann, respectively), AA10760 (Portland Alcohol Research Center), AA06243, AA13484, AA11034, DA05228 and MH51372.
Please contact either John Belknap or Robert Hitzemann at the Dept. of Behavioral Neuroscience, Oregon Health & Science University (L470), or Research Service (R&D5), Portland VA Medical Ctr., Portland, OR 97239 USA.
References:
Hitzemann, R, McWeeney, S, Harrington, S, Malmanger, B, Lawler, M, Belknap, JK (2004) Brain gene expression among four inbred mouse strains: The development of an analysis strategy for the integration of QTL and gene expression data. Submitted.
Irizarry, RA, Bolstad, BM, Collin, F, Cope, LM, Hobbs, B, Speed, TP (2003) Summaries of Affymetrix GeneChip probe level data. Nuc Acids Res 31:1-15.
Lincoln, SE, Lander, ES (1992) Systematic detection of errors in genetic linkage data. Genomics 14:604-610.
Zhang, L, Miles, MF, Aldape, KD (2003) A model of molecular interactions on short oligonucleotide microarrays. Nat Biotech 21:818-821.
Information about this text file:
This text file was originally generated by John Belknap, March 2004. Updated by RWW, October 31, 2004.
|