posted on 2004-06-01, 00:00authored byClark Glymour, Daniel Handley, Nicoleta Serban, David Peters, Robert O'Doherty, Melvin Field, Larry Wasserman, Peter Spirtes, Richard Scheines
We present evidence of a potentially serious source of error intrinsic to all spotted cDNA microarrays that use IMAGE clones of expressed
sequence tags (ESTs). We found that a high proportion of these EST sequences contain 5'-end poly(dT) sequences that are remnants from the
oligo(dT)-primed reverse transcription of polyadenylated mRNA templates used to generate EST cDNA for sequence clone libraries. Analysis of expression data from two single-dye cDNA microarray experiments showed that ESTs whose sequences contain repeats of consecutive 5'-end dT residues appeared to be strongly coexpressed, while expression data of all other sequences exhibited no such pattern. Our analysis
suggests that expression data from sequences containing 5' poly(dT) tracts are more likely to be due to systematic cross-hybridization of these
poly(dT) tracts than to true mRNA coexpression. This indicates that existing data generated by cDNA microarrays containing IMAGE clone
ESTs should be filtered to remove expression data containing significant 5' poly(dT) tracts.