Table IVa: Position-dependent pentamers: Clustered pentamers

Pentamers and/or reverse complements

Phase Pentamer Non-uniform distribution Number of sequence elements in the interval
    Kolmogorov-Smirnov test
p-value <
      -600 to -551 -550 to -501 -500 to -451 -450 to -401 -400 to -351 -350 to -301 -300 to -251 -250 to -201 -200 to -151 -150 to -101 -100 to -51 -50 to -1 totals
Early G1 ATATA/TATAT 0.001 16 22 22 22 29 32 26 17 50 44 47 23 350
GGCCA/TGGCC 0.001 2 5 7 8 4 6 3 1 0 0 0 0 36
Late G1 ACGCG/CGCGT 0.001 5 9 7 16 20 14 20 21 70 84 15 7 288
CGCGA/TCGCG 0.010 6 5 3 8 20 16 10 13 24 28 6 3 142
AACGC/GCGTT 0.001 8 11 8 15 13 15 12 17 30 50 19 5 203
GCGAA/TTCGC 0.010 12 7 13 11 19 9 14 15 24 24 16 10 174
G2, Late G1 CAGCC/GGCTG 0.005 13 15 20 9 12 9 10 6 7 5 5 3 114

Clicking on a pentamer retrieves a list of the genes whose upstream regions contain one or more copies of the pentamer in the specified intervals.

We performed a Kolmogorov-Smirnov test on each pentamer listed in Table II. Shown here are those hexamers whose p-value from this test is � 0.05. The dataset on which we performed the test is listed in the left column. We also counted the number of upstream regions in that dataset which contain one or more copies of the hexamer over the 50 nt intervals shown. The interval(s) with the highest numbers is in bold. Intervals are measured by the distance from the ATG start codon.