GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:36:16 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 448 Minimum sequence length: 448 Maximum sequence length: 448 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 1 < 600: 0 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 24337GAATTCGGTAATACCTAAGATAGCAGCCAAACGTCTTGAGCTACTTGGCTATAGGTCCTT, from 1 to 109681, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 1 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 TAAGCGTAAA AGTGGAAAGC ATCAGAAGCA ATCTTCATCC AGATAGCGGT CTGATTCGGA 61 AGAAGATTCT GGTGAAGAGA ATAACGGAAG AAAATCTCAC CATCAGAAAA CGTCAGGGAC 121 TCATGACAGA CACTATGAAA GACCAAGGTC AGATTTAGAA GATGAGTCAA AAGGAAGAGA 181 GAGTCGTGAT AGGCACTATG AGAAACGAAG GTCAGAACTA GATGATGGGC ACAAAAGAAG 241 AGAAAGTCAA GATAAGAGAC GAAGGTCACA TATTGATGAT GAACCAAAAA GAAGAGATGC 301 TCGACCGAAT GAGAAATATC GAAATCGCTC CCCTAAAGGC GGTGTGGAAA GGGAAAATCT 361 TAAGAGTTAT GGTCAAGAGG ATAAAAAGAG GAGAGCCGAG GATTTAGACA GTGGAAAACC 421 CAATGAATAC CAGAATAGAC CCCCGAAA Predicted gene structure (within gDNA segment 107217 to 108490): Exon 1 107517 107753 ( 237 n); cDNA 1 237 ( 237 n); score: 0.996 Intron 1 107754 107942 ( 189 n); Pd: 0.000 (s: 1.00), Pa: 0.001 (s: 0.98) Exon 2 107943 108153 ( 211 n); cDNA 238 448 ( 211 n); score: 0.972 MATCH 24337GAATTCGGTAATACCTAAGATAGCAGCCAAACGTCTTGAGCTACTTGGCTATAGGTCCTT+ gi+ 0.984 448 1.000 C PGS_24337GAATTCGGTAATACCTAAGATAGCAGCCAAACGTCTTGAGCTACTTGGCTATAGGTCCTT+_gi+ (107517 107753,107943 108153) Alignment (genomic DNA sequence = upper lines): TAAGCGTAAA AGTGGAAAGC ATCAGAAGCA ATCTTCATCC AGACAGCGGT CTGATTCGGA 107576 |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| TAAGCGTAAA AGTGGAAAGC ATCAGAAGCA ATCTTCATCC AGATAGCGGT CTGATTCGGA 60 AGAAGATTCT GGTGAAGAGA ATAACGGAAG AAAATCTCAC CATCAGAAAA CGTCAGGGAC 107636 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAAGATTCT GGTGAAGAGA ATAACGGAAG AAAATCTCAC CATCAGAAAA CGTCAGGGAC 120 TCATGACAGA CACTATGAAA GACCAAGGTC AGATTTAGAA GATGAGTCAA AAGGAAGAGA 107696 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCATGACAGA CACTATGAAA GACCAAGGTC AGATTTAGAA GATGAGTCAA AAGGAAGAGA 180 GAGTCGTGAT AGGCACTATG AGAAACGAAG GTCAGAACTA GATGATGGGC ACAAAAGAAG 107756 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||| GAGTCGTGAT AGGCACTATG AGAAACGAAG GTCAGAACTA GATGATGGGC ACAAAAG... 237 AGAGAGACAT GATACGCACT ATGAGAGACG AAGGTCAGAA ATGGATGATG AGTCAAAAAG 107816 .......... .......... .......... .......... .......... .......... 237 AAGAGAAAGT AGGGATAATC ACTATGAGAG ACGAAGGTCA GATTTGGATG ATGAGTCCAA 107876 .......... .......... .......... .......... .......... .......... 237 AAGAAGAGAA AGTCATGATA AGCACTTTGA GAGACAAAGG TCAGATTTGG ATGATGAGTA 107936 .......... .......... .......... .......... .......... .......... 237 CAAAAGAAGA GAAAGTCAAG ATAAGAGACG AAGGTCAGAT ATTGATGATG AACCAAAAAG 107996 |||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| ......AAGA GAAAGTCAAG ATAAGAGACG AAGGTCACAT ATTGATGATG AACCAAAAAG 291 AAGAGATGCT CGACCGAATG AGAAATATCG AAATCGCTCC CCTAAAGGCG GTGTGGAAAG 108056 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGAGATGCT CGACCGAATG AGAAATATCG AAATCGCTCC CCTAAAGGCG GTGTGGAAAG 351 GGAAAATCTT AAGAGTTATG GTCAAGAGGA TAAAAAGAGG AAAGCAGAGG ATTTAGACAG 108116 |||||||||| |||||||||| |||||||||| |||||||||| | ||| |||| |||||||||| GGAAAATCTT AAGAGTTATG GTCAAGAGGA TAAAAAGAGG AGAGCCGAGG ATTTAGACAG 411 TGGAAAACCG AATGAATACC AGAATAGACG CCGGAAA 108153 ||||||||| |||||||||| ||||||||| || |||| TGGAAAACCC AATGAATACC AGAATAGACC CCCGAAA 448 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (+ strand): 107517 108153 AGS-1 (107517 107753,107943 108153) SCR (e 0.996 d 0.000 a 0.001,e 0.972) Exon 1 107517 107753 ( 237 n); score: 0.996 Intron 1 107754 107942 ( 189 n); Pd: 0.000 Pa: 0.001 Exon 2 107943 108153 ( 211 n); score: 0.972 PGS (107517 107753,107943 108153) gi+ 3-phase translation of AGS-1 (+strand): . . . . . . 107517 TAAGCGTAAAAGTGGAAAGCATCAGAAGCAATCTTCATCCAGACAGCGGTCTGATTCGGA - A - K W K A S E A I F I Q T A V - F G K R K S G K H Q K Q S S S R Q R S D S E S V K V E S I R S N L H P D S G L I R . . . . . . 107577 AGAAGATTCTGGTGAAGAGAATAACGGAAGAAAATCTCACCATCAGAAAACGTCAGGGAC R R F W - R E - R K K I S P S E N V R D E D S G E E N N G R K S H H Q K T S G T K K I L V K R I T E E N L T I R K R Q G . . . . . . 107637 TCATGACAGACACTATGAAAGACCAAGGTCAGATTTAGAAGATGAGTCAAAAGGAAGAGA S - Q T L - K T K V R F R R - V K R K R H D R H Y E R P R S D L E D E S K G R E L M T D T M K D Q G Q I - K M S Q K E E . . . . . . : 107697 GAGTCGTGATAGGCACTATGAGAAACGAAGGTCAGAACTAGATGATGGGCACAAAAG : AAG E S - - A L - E T K V R T R - W A Q K : K S R D R H Y E K R R S E L D D G H K R : R R V V I G T M R N E G Q N - M M G T K : E . . . . . . 107946 AGAAAGTCAAGATAAGAGACGAAGGTCAGATATTGATGATGAACCAAAAAGAAGAGATGC R K S R - E T K V R Y - - - T K K K R C E S Q D K R R R S D I D D E P K R R D A E K V K I R D E G Q I L M M N Q K E E M . . . . . . 108006 TCGACCGAATGAGAAATATCGAAATCGCTCCCCTAAAGGCGGTGTGGAAAGGGAAAATCT S T E - E I S K S L P - R R C G K G K S R P N E K Y R N R S P K G G V E R E N L L D R M R N I E I A P L K A V W K G K I . . . . . . 108066 TAAGAGTTATGGTCAAGAGGATAAAAAGAGGAAAGCAGAGGATTTAGACAGTGGAAAACC - E L W S R G - K E E S R G F R Q W K T K S Y G Q E D K K R K A E D L D S G K P L R V M V K R I K R G K Q R I - T V E N . . . 108126 GAATGAATACCAGAATAGACGCCGGAAA E - I P E - T P E N E Y Q N R R R K R M N T R I D A G Maximal non-overlapping open reading frames (>= 64 codons): >24337GAATTCGGTAATACCTAAGATAGCAGCCAAACGTCTTGAGCTACTTGGCTATAGGTCCTT+_PGL-1_AGS-1_PPS_1 (107518 107753,107943 108153) (frame '2'; 447 bp, 149 residues) 1 KRKSGKHQKQ SSSRQRSDSE EDSGEENNGR KSHHQKTSGT HDRHYERPRS DLEDESKGRE 61 SRDRHYEKRR SELDDGHKRR ESQDKRRRSD IDDEPKRRDA RPNEKYRNRS PKGGVERENL 121 KSYGQEDKKR KAEDLDSGKP NEYQNRRRK