GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:37:38 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 587 Minimum sequence length: 587 Maximum sequence length: 587 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 0 < 600: 1 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 43304AAGCTTCAGCTTAGCAGAGAAGGTCTGGAAGCAATTAGCAGAATTACAACTCCTATTTCT, from 1 to 96767, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 1 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 1 Matches indexed, elapsed seconds = 1 HitsTableSize = 0 ******************************************************************************** EST sequence 1 -strand (File: gi-) 1 GCTTTGATAA CGGAGAATGC AACGAGGATT CTCCAGTAGT GCCACTGGGA AAGAGTCCAA 61 GAACTGATGT GCTTTTGAAA GCTGAAAAAG TAGTGTTGGA ATGTGGAGGG ACTGTCCTTA 121 GACTAGCAGG GCTTTACACA GAAACTAGAG GTGCACATAC TTACTGGTTG AGTAAGGAGA 181 CAATTGATGC TCGTCCTGAT CATATTCTAA ATCTCATACA CTATGAGGAT GCAGCATCGC 241 TGGCAGTTGC AATCATGAAG AAGAAAGCCG GTGCTCGGAT TTTCTTGGGT TGTGACAACC 301 ATCCTTTGTC AAGGCAAGAG GTGATGGACC TGATGGCTCA AAGCGGAAAA TTTGATAAGA 361 AGTTCAAAGG TTTTACAAGC ACCAGTGGTC CTTTAGGGAA GAAGCTGAAC AACTCTAAGA 421 CACGAGCGGA GATAGGATGG GAGCCGAAGT ATCCAAGCTT TGCCCAATTT TTTGGAGTAT 481 CGACATAATA TTTTTACTTG GATTGATTAA GAATGTCTCT AGCGCTGAAG AATCCAATAA 541 TGTGAAGCAT TATTTATGTT TGGTACAAAC AAACTCATGT ATCCTCT Predicted gene structure (within gDNA segment 83509 to 81629): Exon 1 83204 83179 ( 26 n); cDNA 1 26 ( 26 n); score: 1.000 Intron 1 83178 82886 ( 293 n); Pd: 0.937 (s: n/a), Pa: 0.994 (s: 1.00) Exon 2 82885 82771 ( 115 n); cDNA 27 141 ( 115 n); score: 0.983 Intron 2 82770 82664 ( 107 n); Pd: 0.000 (s: 0.96), Pa: 0.809 (s: 1.00) Exon 3 82663 82578 ( 86 n); cDNA 142 227 ( 86 n); score: 1.000 Intron 3 82577 82453 ( 125 n); Pd: 0.983 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 4 82452 82367 ( 86 n); cDNA 228 313 ( 86 n); score: 1.000 Intron 4 82366 82284 ( 83 n); Pd: 0.994 (s: 1.00), Pa: 0.937 (s: 1.00) Exon 5 82283 82219 ( 65 n); cDNA 314 378 ( 65 n); score: 1.000 Intron 5 82218 82138 ( 81 n); Pd: 0.900 (s: 1.00), Pa: 0.957 (s: 1.00) Exon 6 82137 81929 ( 209 n); cDNA 379 587 ( 209 n); score: 1.000 MATCH 43304AAGCTTCAGCTTAGCAGAGAAGGTCTGGAAGCAATTAGCAGAATTACAACTCCTATTTCT- gi- 0.996 561 0.956 C PGS_43304AAGCTTCAGCTTAGCAGAGAAGGTCTGGAAGCAATTAGCAGAATTACAACTCCTATTTCT-_gi- (83204 83179,82885 82771,82663 82578,82452 82367,82283 82219,82137 81929) Alignment (genomic DNA sequence = upper lines): GCTTTGATAA CGGAGAATGC AACGAGGTTT GTCTCAGCAC CTGAATACCT ATTTTCTTTC 83145 |||||||||| |||||||||| |||||| GCTTTGATAA CGGAGAATGC AACGAG.... .......... .......... .......... 26 AGTTTTCTCC ATCCTAAAAT TCCTAGTTTT AGACTTTAAC ACCACATTTA GTATTATTCT 83085 .......... .......... .......... .......... .......... .......... 26 ATATCAAAGT TGTTTAACAC AATTACTTGT TGAGGTAATT TGGGGAAAAG AGTCTCTCAC 83025 .......... .......... .......... .......... .......... .......... 26 ATTTAAAATC TATGAAAATC AAGTATACAA TCAATAAAAA AATTTGATAT TTGAGACCAC 82965 .......... .......... .......... .......... .......... .......... 26 AATGCAAATT CTTGTTCATT GTGAATATAT ATATGTGCTA TTCAATAAGA ATTTCTAAGT 82905 .......... .......... .......... .......... .......... .......... 26 TCGTTACCCT TCACTGCAGG ATTCTCCAGT AGTGCCACTG GGAAAGAGTC CAAGAACTGA 82845 | |||||||||| |||||||||| |||||||||| |||||||||| .......... .........G ATTCTCCAGT AGTGCCACTG GGAAAGAGTC CAAGAACTGA 67 TGTGCTTTTG AAAGCTGAAA AAGTAGTGTT GGAATGTGGA GGGACTGTCC TTAGACTAGC 82785 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTGCTTTTG AAAGCTGAAA AAGTAGTGTT GGAATGTGGA GGGACTGTCC TTAGACTAGC 127 AGGGCTTTAC ATATCCTTTT GGATTTCGAT ACTGAAACTG AATGAGCAAG TATTGAAAGA 82725 |||||||||| | | AGGGCTTTAC ACAG...... .......... .......... .......... .......... 141 CTTTTTTCTT GGTCTGTAGA AGATTTGAGC ATTATGTTTC TCCTGAACTT TTTATACACA 82665 .......... .......... .......... .......... .......... .......... 141 GAAACTAGAG GTGCACATAC TTACTGGTTG AGTAAGGAGA CAATTGATGC TCGTCCTGAT 82605 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .AAACTAGAG GTGCACATAC TTACTGGTTG AGTAAGGAGA CAATTGATGC TCGTCCTGAT 200 CATATTCTAA ATCTCATACA CTATGAGGTA TATTTGTGAT TTTGTTCACT CTCATAACTA 82545 |||||||||| |||||||||| ||||||| CATATTCTAA ATCTCATACA CTATGAG... .......... .......... .......... 227 ATTAGTTGAT CTGCAGATTG AACTCATTTT CTAGAATGGC AGTGTACTAT ACATTAGTCT 82485 .......... .......... .......... .......... .......... .......... 227 CTTATCAGGT TCTCTAATGT GTTTTTGAAC AGGATGCAGC ATCGCTGGCA GTTGCAATCA 82425 |||||||| |||||||||| |||||||||| .......... .......... .......... ..GATGCAGC ATCGCTGGCA GTTGCAATCA 255 TGAAGAAGAA AGCCGGTGCT CGGATTTTCT TGGGTTGTGA CAACCATCCT TTGTCAAGGT 82365 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| TGAAGAAGAA AGCCGGTGCT CGGATTTTCT TGGGTTGTGA CAACCATCCT TTGTCAAG.. 313 AATGCTGCTT TCGTTCTAAC TTTGAGATTA CTTTCTAACT TTTGATGAGA CATCAAATGT 82305 .......... .......... .......... .......... .......... .......... 313 TTGACCCCTC ACTCTTCTCA GGCAAGAGGT GATGGACCTG ATGGCTCAAA GCGGAAAATT 82245 ||||||||| |||||||||| |||||||||| |||||||||| .......... .......... .GCAAGAGGT GATGGACCTG ATGGCTCAAA GCGGAAAATT 352 TGATAAGAAG TTCAAAGGTT TTACAAGTGA GTTTCTTCAG ACATTGACAT GTAGTTGTCT 82185 |||||||||| |||||||||| |||||| TGATAAGAAG TTCAAAGGTT TTACAA.... .......... .......... .......... 378 TGAAAAACCT TAAGCTTCAT GGAGTCTGAC AACTTTGGCT TTTGTAGGCA CCAGTGGTCC 82125 ||| |||||||||| .......... .......... .......... .......... .......GCA CCAGTGGTCC 391 TTTAGGGAAG AAGCTGAACA ACTCTAAGAC ACGAGCGGAG ATAGGATGGG AGCCGAAGTA 82065 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTAGGGAAG AAGCTGAACA ACTCTAAGAC ACGAGCGGAG ATAGGATGGG AGCCGAAGTA 451 TCCAAGCTTT GCCCAATTTT TTGGAGTATC GACATAATAT TTTTACTTGG ATTGATTAAG 82005 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCAAGCTTT GCCCAATTTT TTGGAGTATC GACATAATAT TTTTACTTGG ATTGATTAAG 511 AATGTCTCTA GCGCTGAAGA ATCCAATAAT GTGAAGCATT ATTTATGTTT GGTACAAACA 81945 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATGTCTCTA GCGCTGAAGA ATCCAATAAT GTGAAGCATT ATTTATGTTT GGTACAAACA 571 AACTCATGTA TCCTCT 81929 |||||||||| |||||| AACTCATGTA TCCTCT 587 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (- strand): 83204 81929 AGS-1 (83204 83179,82885 82771,82663 82578,82452 82367,82283 82219,82137 81929) SCR (e 1.000 d 0.937 a 0.994,e 0.983 d 0.000 a 0.809,e 1.000 d 0.983 a 1.000,e 1.000 d 0.994 a 0.937,e 1.000 d 0.900 a 0.957,e 1.000) Exon 1 83204 83179 ( 26 n); score: 1.000 Intron 1 83178 82886 ( 293 n); Pd: 0.937 Pa: 0.994 Exon 2 82885 82771 ( 115 n); score: 0.983 Intron 2 82770 82664 ( 107 n); Pd: 0.000 Pa: 0.809 Exon 3 82663 82578 ( 86 n); score: 1.000 Intron 3 82577 82453 ( 125 n); Pd: 0.983 Pa: 1.000 Exon 4 82452 82367 ( 86 n); score: 1.000 Intron 4 82366 82284 ( 83 n); Pd: 0.994 Pa: 0.937 Exon 5 82283 82219 ( 65 n); score: 1.000 Intron 5 82218 82138 ( 81 n); Pd: 0.900 Pa: 0.957 Exon 6 82137 81929 ( 209 n); score: 1.000 PGS (83204 83179,82885 82771,82663 82578,82452 82367,82283 82219,82137 81929) gi- 3-phase translation of AGS-1 (-strand): . . . : . . . 83204 GCTTTGATAACGGAGAATGCAACGAG : GATTCTCCAGTAGTGCCACTGGGAAAGAGTCCAA A L I T E N A T R : I L Q - C H W E R V Q L - - R R M Q R : G F S S S A T G K E S K F D N G E C N E : D S P V V P L G K S P . . . . . . 82851 GAACTGATGTGCTTTTGAAAGCTGAAAAAGTAGTGTTGGAATGTGGAGGGACTGTCCTTA E L M C F - K L K K - C W N V E G L S L N - C A F E S - K S S V G M W R D C P - R T D V L L K A E K V V L E C G G T V L . . . : . . . 82791 GACTAGCAGGGCTTTACATAT : AAACTAGAGGTGCACATACTTACTGGTTGAGTAAGGAGA D - Q G F T Y : K L E V H I L T G - V R R T S R A L H I : N - R C T Y L L V E - G D R L A G L Y I : - T R G A H T Y W L S K E . . . . . : . 82624 CAATTGATGCTCGTCCTGATCATATTCTAAATCTCATACACTATGAG : GATGCAGCATCGC Q L M L V L I I F - I S Y T M R : M Q H R N - C S S - S Y S K S H T L - : G C S I A T I D A R P D H I L N L I H Y E : D A A S . . . . . . 82439 TGGCAGTTGCAATCATGAAGAAGAAAGCCGGTGCTCGGATTTTCTTGGGTTGTGACAACC W Q L Q S - R R K P V L G F S W V V T T G S C N H E E E S R C S D F L G L - Q P L A V A I M K K K A G A R I F L G C D N . . : . . . . 82379 ATCCTTTGTCAAG : GCAAGAGGTGATGGACCTGATGGCTCAAAGCGGAAAATTTGATAAGA I L C Q : G K R - W T - W L K A E N L I R S F V K : A R G D G P D G S K R K I - - E H P L S R : Q E V M D L M A Q S G K F D K . . : . . . . 82236 AGTTCAAAGGTTTTACAA : GCACCAGTGGTCCTTTAGGGAAGAAGCTGAACAACTCTAAGA S S K V L Q : A P V V L - G R S - T T L R V Q R F Y K : H Q W S F R E E A E Q L - D K F K G F T : S T S G P L G K K L N N S K . . . . . . 82095 CACGAGCGGAGATAGGATGGGAGCCGAAGTATCCAAGCTTTGCCCAATTTTTTGGAGTAT H E R R - D G S R S I Q A L P N F L E Y T S G D R M G A E V S K L C P I F W S I T R A E I G W E P K Y P S F A Q F F G V . . . . . . 82035 CGACATAATATTTTTACTTGGATTGATTAAGAATGTCTCTAGCGCTGAAGAATCCAATAA R H N I F T W I D - E C L - R - R I Q - D I I F L L G L I K N V S S A E E S N N S T - Y F Y L D - L R M S L A L K N P I . . . . . 81975 TGTGAAGCATTATTTATGTTTGGTACAAACAAACTCATGTATCCTCT C E A L F M F G T N K L M Y P V K H Y L C L V Q T N S C I L M - S I I Y V W Y K Q T H V S S Maximal non-overlapping open reading frames (>= 64 codons): >43304AAGCTTCAGCTTAGCAGAGAAGGTCTGGAAGCAATTAGCAGAATTACAACTCCTATTTCT-_PGL-1_AGS-1_PPS_1 (82661 82578,82452 82367,82283 82219,82137 82028) (frame '0'; 342 bp, 114 residues) 1 TRGAHTYWLS KETIDARPDH ILNLIHYEDA ASLAVAIMKK KAGARIFLGC DNHPLSRQEV 61 MDLMAQSGKF DKKFKGFTST SGPLGKKLNN SKTRAEIGWE PKYPSFAQFF GVST-