| 
INIA Brain mRNA M430 (January06) PDNN   
 Accession number: GN101
     Summary: 
RECOMMENDED, HIGHLY SELECTIVE DATA SET: This January 2006 data freeze provides estimates of mRNA expression in adult forebrain and midbrain from 43 lines of mice including C57BL/6J, DBA/2J, reciprocal F1 hybrids, and 39 BXD recombinant inbred strains. Data were generated at UTHSC and the University of Memphis with support from grants from the NIAAA Integrative Neuroscience Initiative on Alcoholism (INIA). Samples were hybridized in small pools (n = 3) to a total of 121 Affymetrix M430A and B array pairs. This data set only includes a high quality subset of 76 arrays. Arrays were processed using the PDNN method of Zhang and colleagues. To simplify comparison among transforms, PDNN values of each array were adjusted to an average of 8 units and a standard deviation of 2 units. A total of 583 probe sets have LRS values above 50.
     About the cases used to generate this set of data: 
We have used a set of BXD recombinant inbred strains generated by crossing C57BL/6J (B6 or B) with DBA/2J (D2 or D). The BXDs are particularly useful for systems genetics because both parental strains have been sequenced (8x coverage of B6 and 1.5x coverage of D). Physical maps in WebQTL incorporate approximately 2 million B vs D SNPs from Celera Genomics and from the Perlegen-NIEHS sequencing effort. BXD2 through BXD32 were bred by Benjamin A. Taylor starting in the late 1970s. BXD33 through 42 were bred by Taylor in the 1990s. These strains are available from The Jackson Laboratory. BXD43 through BXD99 were bred by Lu Lu, Jeremy Peirce, Lee M. Silver, and Robert W. Williams in the late 1990s and early 2000s using advanced intercross progeny (Peirce et al. 2004). Many of the 50 new BXD strains are available from Lu Lu and colleagues.
 All stock was obtained originally from The Jackson Laboratory between 1998 and 2003. Most BXD animals were born and housed at the University of Tennessee Health Science Center. Some cases were bred at the University of Memphis (Douglas Matthews) or the University of Alabama (John Mountz and Hui-Chen Hsu).
     About the tissue used to generate this set of data: The INIA M430 brain Database (January06) consists of 76 Affymetrix 430A and 430B microarray pairs. Each pair was hybridized in sequence (A array first, B array second) with a pool of brain tissue (forebrain minus olfactory bulb, plus the entire midbrain) taken from three adult animals of closely matched age and the same sex. RNA was extracted at UTHSC by Lu Lu, Zhiping Jia, and Hongtao Zhai at UTHSC.
Tissue preparation protocol. Animal were killed by rapid cervical dislocation. Eyes were removed immediately and placed in RNAlater at room temperature. Usually six eyes from animals with a common sex, age, and strain were stored in a single tube. The body was sprayed lightly with 70% ethanol to wet the hair. the following standard approach was used to extract the brain:
 
At this point the protocol divides. If tissue is to be saved for RNA extraction at a later time, the whole brain is placed directly in RNAlater (Ambion, Inc.) and treated per the manufacturer’s directions.  Step 7 is still very important because RNAlater may not fully penetrate the forebrain if the lobes are not separated. If tissue is to be used for immediate RNA extraction, one lobe of the forebrain is removed for processing and the rest of the brain is stored in RNAlater.Using small surgical scissors make an incision under the skin on the dorsal side of the neck. Cut the skin overlying the skull close to the midsagittal plane towards the nose. Pull and reflect the skin to expose the entire dorsal skull.
Slip the points of the scissors through into the cisterna magna just caudal to the cerebellum and gently enlarge this opening until is it possible to cut through the skull overlying the cerebellum.
Cut rostrally through the skull along the midsagittal line almost all the way to the nasal opening, taking care not to damage the dorsal surface of the brain.
Approximately midway along this incision, make a lateral cut. Repeat along the incision and peel back the resulting strips of skull.
Using small forceps, free the olfactory bulbs rostrally and ventrally, taking care to retain their connection to the rest of the forebrain.
Gently lift the brain from the base the skull starting from the olfactory bulbs, pulling the brain toward a nearly vertical position. Cut the optic and trigeminal nerves. Separate the brain from the spinal cord about 2 mm distal to the medulla.
Spread the hemispheres of the forebrain gently with forceps and then cut from dorsal to ventral using a straight scalpel, separating the hemispheres from each other (but not from the cerebellum). Take care to retain both paraflocculi.
 
Dissecting and preparing forebrain and midbrain for RNA extraction
 
Remove the left or right hemisphere of the forebrain and midbrain (referred to here as the forebrain for simplicity), either fresh or preserved in RNAlater by cutting from the caudal border of the inferior colliculus on the dorsal side and extending the cut ventrally to the basis pedunculi and the pons (cut just rostral of the pons) on the ventral side. See steps 7 and 8 here
Place tissue for RNA extraction in RNA STAT-60 (Tel-Test Inc.) and process per manufacturer's instructions (in brief form below).
Store RNA in 75% ethanol at -80 deg. C until use.
 Total RNA was extracted with RNA STAT-60 (Tel-Test) according to the manufacturer’s instructions. Briefly we:
 
homogenize tissue samples in the RNA STAT-60 (1 ml/50 to 100 mg tissue)
allowed the homogenate to stand for 5 min at room temperature
added 0.2 ml of chloroform per 1 ml RNA STAT-60
shook the sample vigorously for 15 sec and let the sample sit at room temperature for 3 min
centrifuged at 12,000 G for 15 min
transfered the aqueous phase to a fresh tube
added 0.5 ml of isopropanol per 1 ml RNA STAT-60
vortexed and allowed sample to stand at room temperature for 5-10 min
centrifugeed at 12,000 G for 10-15 min
removed the supernatant and washed the RNA pellet with 75% ethanol
stored the pellet in 75% ethanol at -80 deg C until use
 Sample Processing. Samples were processed in the INIA Bioanalytical Core at the W. Harry Feinstone Center of Excellence, The University of Memphis, lead by Thomas R. Sutter. All processing steps were performed by Shirlean Goodwin. In brief, samples were quality control checked for RNA purity using 260/280 ratios (samples had to be greater than 1.8, but the majority were 1.9 or higher). RNA integrity was assessed using the Agilent Bioanalyzer 2100. We required an RNA integrity number (RIN) of greater than 8, based on the intensity ratio and amplitude of 18S and 28S rRNA signals. The standard Eberwine T7 polymerase method was used to catalyze the synthesis of cDNA template from polyA-tailed RNA using Superscript II RT (Invitrogen Inc.).  The Enzo LIfe Sciences, Inc., BioArray High Yield RNA Transcript Labeling Kit (T7, Part No. 42655) was used to synthesize labeled cRNA. At this point the cRNA was evaluated again using both the 260/280 ratio (values of 2.0 or above are acceptable) and the Bioanalyzer output (a dark cRNA smear on the 2100 output centered roughly between 600 and 2000 nt is required). Those samples that passed both QC steps (10% usually fail) were then sheared using a fragment buffer included in the Affymetrix GeneChip Sample Cleanup Module (Part No.900371). After fragmentation, samples were either stored at -80 deg. centrigrade until use or were immediately injected onto the array.
 Replication and Sample Balance: Our goal was to obtain data for independent biological sample pools from at least one of sample from each sex for all BXD strains. While we achieved this goal technically, not all of the replicates were of sufficient quality to be included in this highly selected set. This data set is now complete and includes more than 20 replicates. Despite the lack of replicates for about 20 strains we still recommend this data set strongly over earliers data sets that included more arrays, many of which are suboptimal.
 Batch Structure: Before running the first batch of 30 pairs of array (dated Jan04), we ran four test samples (Nov03). The main batch of 30 includes the four test samples (four technical replicates). The Nov03 data was combined with the Jan04 data and was treated as a single batch that consists of one male and one female pool from C57BL/6J, DBA/2J, the B6D2F1 hybrid, 11 female BXD samples, and 11 male BXD samples. The second large batch was run February 2005 (Feb05) and consists of 71 pairs of arrays. Two more batches were run; the final in December 2005 (16 arrays pairs). Batch effects were corrected at the individual probe level as described below.
 The table below summarizes information on strain, sex, age, sample name, batch result date, the grouping to which an arrays data set belongs based on expression similarity, and source of mice.
 
| 
| Id | Strain | Sex | Age | Sample name | Batch | Fixed Batch | Source |  | 1 | B6D2F1 | F | 127 | R0919F1 | 2 | e_2 | UTM JB |  | 2 | B6D2F1 | F | 127 | R0919F2 | 2 | e_2 | UTM JB |  | 3 | B6D2F1 | F | 64 | R1053F1 | 3 | g_3 | UTM RW |  | 4 | B6D2F1 | F | 64 | R1053F1 | 3 | e_3 | UTM RW |  | 5 | B6D2F1 | M | 66 | R1057F1 | 3 | e_3 | UTM RW |  | 6 | D2B6F1 | F | 57 | R1066F1 | 3 | e_3 | UTM RW |  | 7 | C57BL/6J | F | 65 | R0903F1 | 1 | se_1 | UTM RW |  | 8 | C57BL/6J | F | 65 | R0903F1 | 2 | e_2 | UTM RW |  | 9 | C57BL/6J | M | 66 | R0906F1 | 1 | e_1 | UTM RW |  | 10 | C57BL/6J | M | 76 | R0997F1 | 3 | g_3 | UTM RW |  | 11 | DBA/2J | F | 60 | R0917F1 | 1 | e_1 | UTM RW |  | 12 | DBA/2J | F | 64 | R1123F1 | 3 | g_3 | UTM RW |  | 13 | DBA/2J | M | 60 | R0918F1 | 2 | sgA_2 | UTM RW |  | 14 | DBA/2J | M | 73 | R1009F1 | 3 | w_3 | UTM RW |  | 15 | BXD1 | M | 181 | R0956F1 | 3 | e_3 | UTM JB |  | 16 | BXD2 | F | 142 | R0907F1 | 3 | e_3 | UAB |  | 17 | BXD5 | F | 56 | R0744F1 | 3 | o_3 | UMemphis |  | 18 | BXD5 | M | 71 | R0728F1 | 2 | e_2 | UMemphis |  | 19 | BXD6 | F | 57 | R1711F1 | 3 | g_3 | JAX |  | 20 | BXD8 | M | 71 | R2664F1 | 4 | se_4 | JAX |  | 21 | BXD11 | F | 97 | R0745F1 | 3 | gA_3 | UAB |  | 22 | BXD12 | F | 64 | R0896F1 | 3 | o_3 | UMemphis |  | 23 | BXD12 | M | 64 | R0897F1 | 2 | e_2 | UMemphis |  | 24 | BXD13 | F | 86 | R0748F1 | 2 | e_2 | UMemphis |  | 25 | BXD13 | F | 86 | R0730F1 | 3 | e_3 | UMemphis |  | 26 | BXD13 | M | 76 | R0929F1 | 3 | e_3 | UMemphis |  | 27 | BXD14 | M | 68 | R1051F1 | 3 | e_3 | UTM RW |  | 28 | BXD15 | F | 80 | R0928F1 | 3 | e_3 | UMemphis |  | 29 | BXD18 | F | 108 | R0771F1 | 2 | e_2 | UAB |  | 30 | BXD19 | M | 157 | R1229F1 | 3 | gA_3 | UTM JB |  | 31 | BXD21 | F | 67 | R0740F1 | 3 | gA_3 | UAB |  | 32 | BXD23 | F | 88 | R0815F1 | 3 | gA_3 | UAB |  | 33 | BXD23 | F | 66 | R1035F1 | 3 | gA_3 | UTM RW |  | 34 | BXD23 | M | 66 | R1256F1 | 4 | e_4 | UTM RW |  | 35 | BXD23 | M | 66 | R1037F1 | 3 | gA_3 | UTM RW |  | 36 | BXD24 | F | 71 | R0914F1 | 3 | e_3 | UMemphis |  | 37 | BXD24 | M | 71 | R0913F1 | 2 | e_2 | UMemphis |  | 38 | BXD25 | F | 74 | R0373F1 | 2 | e_2 | UTM RW |  | 39 | BXD25 | M | 58 | R2623F1 | 4 | e_4 | UTM RW |  | 40 | BXD27 | M | 54 | R2660F1 | 4 | e_4 | UTM RW |  | 41 | BXD28 | F | 113 | R0892F1 | 3 | e_3 | UTM RW |  | 42 | BXD28 | M | 79 | R0911F1 | 3 | g_3 | UMemphis |  | 43 | BXD31 | M | 61 | R1141F1 | 3 | e_3 | UTM RW |  | 44 | BXD32 | F | 93 | R0898F1 | 2 | e_2 | UAB |  | 45 | BXD32 | F | 76 | R1214F1 | 3 | w_3 | UMemphis |  | 46 | BXD32 | M | 76 | R1217F2 | 4 | e_4 | UMemphis |  | 47 | BXD32 | M | 65 | R1478F1 | 3 | e_3 | UMemphis |  | 48 | BXD34 | M | 72 | R0916F1 | 2 | e_2 | UMemphis |  | 49 | BXD34 | F | 92 | R0900F1 | 3 | e_3 | UMemphis |  | 50 | BXD36 | F | 79 | R2654F1 | 4 | e_4 | UTM RW |  | 51 | BXD36 | F | 61 | R1145F1 | 3 | e_3 | UTM RW |  | 52 | BXD36 | M | 77 | R0926F1 | 2 | e_2 | UMemphis |  | 53 | BXD38 | F | 69 | R0729F1 | 3 | e_3 | UMemphis |  | 54 | BXD38 | F | 83 | R1208F1 | 3 | g_3 | UMemphis |  | 55 | BXD39 | F | 76 | R1712F1 | 3 | e_3 | JAX |  | 56 | BXD39 | M | 71 | R0602F1 | 3 | w_3 | UAB |  | 57 | BXD40 | F | 184 | R0741F1 | 3 | e_3 | UAB |  | 58 | BXD40 | M | 56 | R0894F1 | 3 | e_3 | UMemphis |  | 59 | BXD42 | F | 100 | R0742F1 | 3 | e_3 | UAB |  | 60 | BXD43 | F | 61 | R1199F1 | 3 | e_3 | UTM RW |  | 61 | BXD43 | F | 59 | R0980F1 | 4 | e_4 | UTM RW |  | 62 | BXD44 | M | 58 | R1072F1 | 3 | e_3 | UTM RW |  | 63 | BXD45 | F | 58 | R1398F1 | 3 | o_3 | UTM RW |  | 64 | BXD45 | M | 81 | R1658F2 | 4 | e_4 | UTM RW |  | 65 | BXD48 | F | 59 | R0946F1 | 3 | e_3 | UTM RW |  | 66 | BXD51 | F | 63 | R1430F1 | 3 | e_3 | UTM RW |  | 67 | BXD51 | M | 65 | R1001F1 | 3 | e_3 | UTM RW |  | 68 | BXD60 | M | 59 | R1075F1 | 3 | g_3 | UTM RW |  | 69 | BXD62 | M | 58 | R1027F1 | 3 | e_3 | UTM RW |  | 70 | BXD69 | F | 60 | R1438F1 | 3 | e_3 | UTM RW |  | 71 | BXD69 | M | 64 | R1193F1 | 3 | o_3 | UTM RW |  | 72 | BXD73 | F | 60 | R1275F1 | 3 | e_3 | UTM RW |  | 73 | BXD73 | M | 76 | R1442F1 | 3 | g_3 | UTM RW |  | 74 | BXD77 | M | 61 | R1426F1 | 3 | g_3 | UTM RW |  | 75 | BXD87 | F | 89 | R1713F1 | 3 | e_3 | UTM RW |  | 76 | BXD90 | F | 71 | R2628F1 | 4 | e_4 | UTM RW |  | 77 | BXD90 | M | 61 | R1452F | 3 | g_3 | UTM RW |  | 78 | BXD92 | F | 58 | R1299F1 | 3 | e_3 | UTM RW |  |      About data access: 
Normalized data are available for this INIA data set at 
Jan 2006, PDNN normalization (17 Mb file with strain means): ftp://atlas.utmem.edu/Public/Mouse_bxd/INIA_M_0106_PDNN.txt 
Jan 2006, RMA normalization (17 Mb file with strain means): ftp://atlas.utmem.edu/Public/Mouse_bxd/INIA_M_0106_RMA.txt 
June 2006, QTL results from RMA normalized data (5.7 Mb, no strain means): ftp://atlas.utmem.edu/Public/Mouse_bxd/INIA_M_0606_RMA.txt 
All data in ZIP format: ftp://atlas.utmem.edu/Public/Mouse_bxd/INIA_mRNA_data_sets.zip 
     About the array platfrom : 
Affymetrix Mouse Genome 430A and 430B array pairs: The 430A and B array pairs collectively consist of 992936 25-nucleotide probes that estimate the expression of approximately 39,000 transcripts (many are essentially duplicates). The array sequences were selected late in 2002 using Unigene Build 107. The arrays nominally contain the same probe sequence as the 430 2.0 series. However, we have found that roughy 75000 probes differ between those on A and B arrays and those on the 430 2.0.     About data processing: Probe (cell) level data from the CEL file: These CEL values produced by GCOS are 75% quantiles from a set of 91 pixel values per cell.
Step 1: We added an offset of 1.0 unit to each cell signal to ensure that all values could be logged without generating negative values. We then computed the log base 2 of each cell.
Step 2: We performed a quantile normalization of the log base 2 values for the total set of 105 arrays (processed as two batches) using the same initial steps used by the RMA transform.
Step 3: We computed the Z scores for each cell value.
Step 4: We multiplied all Z scores by 2.
Step 5: We added 8 to the value of all Z scores. The consequence of this simple set of transformations is to produce a set of Z scores that have a mean of 8, a variance of 4, and a standard deviation of 2. The advantage of this modified Z score is that a two-fold difference in expression level corresponds approximately to a 1 unit difference.
Step 6: We eliminated much of the systematic technical variance introduced by the two batches (n = 34 and n = 71 array pairs) at the probe level. To do this we calculated the ratio of each batch mean to the mean of both batches and used this as a single multiplicative probe-specific batch correction factor. The consequence of this simple correction is that the mean probe signal value for each batch is the same.
Step 7a: The 430A and 430B arrays include a set of 100 shared probe sets (a total of 2200 probes) that have identical sequences. These probes and probe sets provide a way to calibrate expression of the 430A and 430B arrays to a common scale. To bring the two arrays into alignment, we regressed Z scores of the common set of probes to obtain a linear regression correction to rescale the 430B arrays to the 430A array. In our case this involved multiplying all 430B Z scores by the slope of the regression and adding or subtracting a small offset. The result of this step is that the mean of the 430A expression is fixed at a value of 8, whereas that of the 430B chip is typically reduced to 7. The average of the merged 430A and 430B array data set is approximately 7.5.
Step 7b: We recentered the merged 430A and 430B data sets to a mean of 8 and a standard deviation of 2. This involved reapplying Steps 3 through 5.
Step 8: Finally, we computed the arithmetic mean of the values for the set of microarrays for each strain. Technical replicates were averaged before computing the mean for independent biological samples. Note, that we have not (yet) corrected for variance introduced by differences in sex, age, source of animals, or any interaction terms. We have not corrected for background beyond the background correction implemented by Affymetrix in generating the CEL file. We eventually hope to add statistical controls and adjustments for some of these variables.
     Data source acknowledgment: 
Support for acquisition of microarray data were generously provided by the NIAAA and its INIA grant program to RWW, Thomas Sutter, and Daniel Goldowitz (U01AA013515, U01AA013499-03S1, U01AA013488, U01AA013503-03S1). Support for the continued development of the GeneNetwork and WebQTL was provided by a NIMH Human Brain Project grant (P20MH062009). All arrays were processed at the University of Memphis by Thomas Sutter and colleagues with support of the INIA Bioanalytical Core.
     Information about this text file: This text file originally generated by RWW, YHQ, and EJC, Oct 2004. Updated by RWW, Nov 5, 2004; April 7, 2005; RNA/tissue preparation protocol updatedby JLP, Sept 2, 2005; Sept 26, 2005; by RWW Jan 2006.
 |