Survey
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
cDNA Microarrays and some of their applications to Clinical Medicine Pascale F. Macgregor, Ph. D. LMP 1019S April 12, 2002 Outline of the lecture • What are microarrays? **Manufacturing **Different types of microarrays **Possible applications **Data analysis • Application of the microarray technology to the study of ovarian cancer • Combination with cytogenetics • Laser Capture Microdissection • Validation: tissue arrays, RT-PCR, ISH, IHC ©2002 University Health Network/Ontario Microarray Network What are Microarrays? • Microarrays are simply small glass or silicon slides upon the surface of which are arrayed thousands of genes (usually between 50020,000) • Via a conventional DNA hybridization process, the level of expression/activity of those genes is measured • Data are read using laser-activated fluorescence readers • The process is “ultra-high throughput” ©2002 University Health Network/Ontario Microarray Network Glass slide microarray of thousands of genes for evaluation as new biomarkers for cancer prognostic assessment ©2002 University Health Network/Ontario Microarray Network Yeast genome: 12,800 points Diameter: 120 microns Slide size: 170 mm x 340 mm ©2002 University Health Network/Ontario Microarray Network Why analyze so many genes? • <10% of the human genome has been studied at the level of gene function. 40,000 odd genes represent the pool of remaining drug targets. • Patterns/clusters of expression are more predictive than looking at one or two prognostic markers. • Increased accuracy/confidence. • Unbiased. Empiric. Holistic. Independent of “flawed” hypotheses ©2002 University Health Network/Ontario Microarray Network Microarray technologies There are two primary microarray technologies which are somewhat complimentary: 1. On-chip oligonucleotide synthesis: e.g. Affymetrix® Gene Chips 2. Direct DNA deposition: “Pat Brown” approach (oligos or gene fragments - Expressed Sequence Tags, “ESTs”) ©2002 University Health Network/Ontario Microarray Network Affymetrix Gene Chip TM • Produced by synthesizing tens of thousands of short oligonucleotides in situ onto glass wafers • 16-20 nucleotides representing each gene on the array • Each oligo on the chip is matched with an almost identical one, differing only by one single base mismatch • Comparison of target intensity between the 2 partners oligonucleotides • Measure of the absolute level of expression of genes ©2002 University Health Network/Ontario Microarray Network Affymetrix Gene Chip TM Nature Genetics 21:2-24 1999 ©2002 University Health Network/Ontario Microarray Network On-chip oligonucleotide synthesis Nature Genetics 21:5-9 1999 ©2002 University Health Network/Ontario Microarray Network Spotted cDNA microarrays Advantages • Lower price and flexibility • Simultaneous comparison of two related biological samples (tumor versus normal, treated cells versus untreated cells) • ESTs allow discovery of new genes Disadvantages • Needs sequence verification • Measures the relative level of expression between 2 samples ©2002 University Health Network/Ontario Microarray Network steel Spotted arrays spotting pin chemically modified slides 384 well source plate 1 nanolitre spots 90-120 um diameter ©2002 University Health Network/Ontario Microarray Network ©2002 University Health Network/Ontario Microarray Network The 6 steps of a microarray experiment (1-3) • 1- Manufacturing of the microarray: clone collection acquisition (+ sequencing), PCR amplification and insert verification, spotting, QC. • 2- Experimental design and choice of reference: what to compare to what? • 3- Target preparation (labeling) and hybridization ©2002 University Health Network/Ontario Microarray Network The 6 steps of a microarray experiment (4-6) • 4- Image acquisition (scanning) and quantification (signal intensity to numbers) • 5- Database building, filtering and normalization • 6- Statistical analysis and data mining ©2002 University Health Network/Ontario Microarray Network From Macgregor and Squire, Clinical Chemistry, 2002 ©2002 University Health Network/Ontario Microarray Network ©2002 University Health Network/Ontario Microarray Network ©2002 University Health Network/Ontario Microarray Network Cy3 Cy5 ©2002 University Health Network/Ontario Microarray Network Cy-3 Cy-5 ©2002 University Health Network/Ontario Microarray Network Gene D Overexpressed in normal tissue Gene E Overexpressed in tumour • Biomarkers of prognosis • Genes affecting Treatment Response ©2002 University Health Network/Ontario Microarray Network Microarray Applications (some) • Can identify new genes implicated in disease progression and treatment response (90% of our genes have yet to be ascribed a function) • Can assess side-effects or drug reaction profiles • Can extract prognostic information, e.g. classify tumours based on hundreds of parameters rather than 2 or 3. • Can detect gene copy number changes in cancer (array CGH) • Can identify new drug targets and accelerate drug discovery and testing • ??? ©2002 University Health Network/Ontario Microarray Network The challenges of microarrays • Acquisition of high quality clinical samples, tumor and normal tissues • High Quality RNA • Experimental design: what to compare to what? • Data analysis -1: what to do with the data? • Data analysis -2: How do to it? – Very large number of data points – Size of data files – Choice of data analysis strategy/algorithm/software ©2002 University Health Network/Ontario Microarray Network Experimental Design • Choice of reference: Common (nonbiologically relevant) reference, or paired samples? • Number of replicates: how many are needed? (How many are affordable...?). Are the replicate results going to be averaged or treated independently? • Dye switches? • Choice of data base: where and how to store the data? ©2002 University Health Network/Ontario Microarray Network What is a “dye switch”: – One slide with experimental sample labeled with Cy5, and reference sample labeled with Cy3 (“straight”). – Replicate slide with experimental sample labeled with Cy3, and reference sample labeled with Cy5 (“switch”) ©2002 University Health Network/Ontario Microarray Network Data Pre-processing • Filtering: background subtraction? Low intensity spots? Saturated spots? Low quality spots (ghosts spots, dust spots etc). • Filtering or flagging? • Outliers? ©2002 University Health Network/Ontario Microarray Network • Data Pre-processing : Normalization – Housekeeping genes/ control genes – Intensity dependent (most commonly used): global intensity or global ratio, calculates a single normalization factor – Intensity independent (LOWESS – Locally Weighted Scatter plot Smoother) calculates a function – Global array or Sub-array http://stat-www.berkeley.edu/users/terry ©2002 University Health Network/Ontario Microarray Network Microarray data analysis 101 • Scatter plots of intensities of tumor samples versus normal samples: quick look at the changers and overall quality of microarray • Supervised versus unsupervised analysis – Clustering: organization of genes that are similar to each other and samples that are similar to each other using clustering algorithms – Statistical analysis: how significant are the results? ©2002 University Health Network/Ontario Microarray Network UP log/log scatter plot DOWN ©2002 University Health Network/Ontario Microarray Network 2 dimensional hierarchical (“Eisen”) clustering (Eisen et al, PNAS (1998), 95, p. 14863) • Unsupervised: no assumption on samples • The algorithm successively joins gene expression profiles to form a dendrogram based on their pair-wise similarities. • Two-dimensional hierarchical clustering first reorders genes and then reorders tumors based on similarities of gene expression between samples. ©2002 University Health Network/Ontario Microarray Network Two dimensional hierarchical (“Eisen”) Clustering From: Dhanasekaran et al. Nature, 421, p.822. ©2002 University Health Network/Ontario Microarray Network Significance Analysis of Microarrays (SAM) • Supervised learning software for genomic expression data mining • Developed at Stanford University, based on the paper of Tusher et al PNAS (2001) 98, p. 5116. ©2002 University Health Network/Ontario Microarray Network What SAM does: • SAM assigns a score to each genes on the basis of the change in gene expression relative to the standard deviation of repeated measurements. • For genes with scores above a certain threshold (set by the user), SAM uses permutations of the repeated measurements to estimate the % of genes identified by chance = the false discovery rate (FDR). ©2002 University Health Network/Ontario Microarray Network SAM example: ovarian tumor versus normal ovary Significant: 210 Delta 0.80000 SAM Plot Median # false significant: 6.15764 Fold Change 1.50000 12 UP 10 8 6 4 2 0 -4 -3 -2 -1 0 1 2 -2 -4 -6 DOWN -8 -10 ©2002 University Health Network/Ontario Microarray Network 3 4 Every cell in the body has the same genetic constitution. In cancer cells there is usually acquired aberrant DNA Breast Ovary Prostate Development of “cancer genomics” to better understand the molecular basis of acquired genetic change ©2002 University Health Network/Ontario Microarray Network TEAM WORK in ovarian cancer at the UHN • Clinicians / Pathologists / Basic Scientists / Computer Scientists • Tissue bank (tumor tissue and normal ovary tissue) of ~ 500 samples • New patients treated at Princess Margaret’s Hospital for ovarian cancer (~100/ year) • Microarray Centre / Basic Research labs ©2002 University Health Network/Ontario Microarray Network Epithelial Ovarian Cancer • First cause of death by gynecological cancer • 5th cause of death by cancer for women • Early Diagnostic (20%): 5 year survival rate is 80% • Late diagnostic (80%): 5 year survival rate is 10-15% • Treatment: Carboplatin + Taxol • Recurrence after treatment in 80% of cases ©2002 University Health Network/Ontario Microarray Network Urgent need for: • Improved detection • Better tumor classification • Better evaluation of response to currently used and experimental chemotherapy • New therapeutic targets • New treatments ©2002 University Health Network/Ontario Microarray Network My lab’s approach: • Combine the powers of chromosome/DNA analysis (cytogenetics/genomic) and RNA expression (cDNA microarrays) to facilitate the identification of smaller subsets of genes pertinent to epithelial ovarian cancer. ©2002 University Health Network/Ontario Microarray Network Microarray study design • All microarrays experiments were done on 19.2 K arrays, in duplicate, with “dye switches”. • 15 serous epithelial ovarian tumors, 7 normal ovaries • Stratagene Universal Reference RNA was used as an internal standard in all experiments. • 52 successful microarrays were hybridized and analyzed, for a total of 2 million data points. ©2002 University Health Network/Ontario Microarray Network Intensities scatter plot for normal sample Stratagene Reference: Ambion normal ovary Normalized Normal ovary #1 versus normal ovary #2 Cy5 intensity (Ambion normal) 100000 10000 1000 100 Tumor ovary versus normal ovary #1 10 10 100 1000 10000 100000 Cy3 intensity (Stratagene normal) Intesities scatter plot for tumor sample OCA21B Reference: Ambion normal ovary Normalized Cancer Research 2002 100000 Cy5 intensity (Ambion normal) From Bayani et al, 10000 1000 100 10 10 100 1000 ©2002 University Health Cy3 intensity (OCA21B) Network/Ontario Microarray Network 10000 100000 Combining supervised with unsupervised strategies From Macgregor and Squire, Clinical Chemistry, 2002 ©2002 University Health Network/Ontario Microarray Network Conclusions of microarray analysis of ovarian samples (1) • Normal ovary samples clustered separately from tumor samples and early EOC clustered as a group. • Separation between early EOC and late EOC less evident, but a group of late EOCs clustered together. ©2002 University Health Network/Ontario Microarray Network Conclusions (2) • Several new up-regulated genes identified in regions of chromosomal gains – Cis-platinum resistance related protein: Is differential gene expression between patients a predictor of treatment response? ©2002 University Health Network/Ontario Microarray Network Going further… • Narrowing down the number of candidate genes by combining approaches: RNA and DNA / chromosome analysis • Are the genes that are overexpressed in regions of gene copy number gains? ©2002 University Health Network/Ontario Microarray Network Chromosome CGH ©2002 University Health Network/Ontario Microarray Network SKY Method • Uses chromosome ‘paints’ labelled with combinations of fluorochromes so that unique spectral signal will identify chromosome region involved in aberration ©2002 University Health Network/Ontario Microarray Network CGH and SKY Analysis of Ovarian Patient 13A (OCA13A) From Bayani et al, Cancer Research 2002 ©2002 University Health Network/Ontario Microarray Network Tumor heterogeneity challenge **Some cancers, such as prostate cancer, are multifocal / heterogeneous. **For these tumors, “bulk” extraction of genetic material from tumor tissue will produce microarray results that are “contaminated” by normal or premalignant tissue. ©2002 University Health Network/Ontario Microarray Network Laser Capture Microdissection (LCM) ©2002 University Health Network/Ontario Microarray Network Before LCM After LCM Microdissected PIN (prostatic intraepithelial neoplasia) ©2002 University Health Network/Ontario Microarray Network Microarray data validation • • • • • cDNA microarrays: one patient - 20,000 genes Tissue arrays: one gene -1000 patients RT-PCR Immunohistochemistry (IHC) In situ Hybridization (ISH) • Cancer profiling arrays: one gene - 10 tumor/normal sample pairs for different tumors ©2002 University Health Network/Ontario Microarray Network Tissue Arrays Monni et al, Seminars in Cancer Biology, (2001), 11, p.395 ©2002 University Health Network/Ontario Microarray Network Northern Blot, Tissue Arrays a. Northern blot b. Tissue array c. IHC, anti-hepsin antibody (1:benign2:cancer) X 100 d. X 400 Dhanasekaran et al, Nature, (2001) 421, p.822. ©2002 University Health Network/Ontario Microarray Network cDNA array, Northern, ISH, IHC Mousses et al, Cancer Research (2002), 62, p. 1256 ©2002 University Health Network/Ontario Microarray Network Cancer Profiling Array (Clontech) Wiechen et al, American Journal of Pathology, (2001), 159, p.1635 ©2002 University Health Network/Ontario Microarray Network Acknowledgements • Dr. Jim Woodgett, Director,UHN microarray Centre and Head of the Division of Experimental Therapeutics, OCI • Dr. Jeremy Squire, Senior Scientist, OCI • Drs J. Murphy, B. Rosen and P. Shaw, PMH/TGH • Ms. Monique Albert, M. Sc., Dan Grisaru, M.D., Ph. D., and Ms. Sabrina Allegro • Mr. Jason Gonçalves, OCI ©2002 University Health Network/Ontario Microarray Network