Exon CGH array design
The dataset come from arrays hybridized with DNA from subjects with epilepsy, autism, heart defects and mental disorders. Each experiment was performed on the 180 K exon targeted oligonucleotide array.
The design of the chip involved two stages. First, the prototype covering only exonic and microRNA regions was constructed. The main aim at this stage was to develop the array that allows detecting DNA copy number changes of the single exon. Therefore, it was postulated to cover each exon by the same number of oligos. For a given set of selected genes (including those related to epilepsy, autism, heart defects, mental disorders and other known pathologies) it was decided that each exon would be covered by approximately probes. All oligos were taken from M Agilent database of preselected probes. The distribution of probe coverage inside long exons, for which the number of available probes was greater than , was fixed as uniform. For short exons, with less than probes available, missing oligos was selected from surrounding region keeping the balance between proximity and symmetricity. Note, that oligo may overlap with each other at most in of its length. Figure 2 presents the example of probe coverage for short and long exons. Figure 3 shows the distribution of probes in exons and in exon neighborhood (bp).
Moreover, every second probe was taken from opposite strand to obtain better performance. The exemplary output of aCGH experiment for these two designs is presented in Figure 4.
|
Next step involved combining the prototype design with backbone, i.e. probes putted uniformly across the genome. Densely covered regions, exonic double covered regions were thinned with heuristic approach which considered previously assigned scores and uniformity of nascent coverage (sizes of introduced gaps).