GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:37:32 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 612 Minimum sequence length: 612 Maximum sequence length: 612 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 0 < 600: 0 < 700: 1 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 42596GAATTCCTGGCAAACGCCCAACAACAGTGAATCCAGCTTATGAAGAGTTCTTCCTAACCA, from 1 to 118136, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 1 Matches indexed, elapsed seconds = 1 HitsTableSize = 1 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 ATGGAGTTTT GGGGTATCGA GATTAAGCCA GGGAAGCCAT TTAAGGTGAT ACAAAAAGAT 61 GGATTCATGG TCCATGCCTC TCAGGTTACC CTTGGTGACG TTGAGAAGGT TAAAAAAGAT 121 GAGACTTTTG CCGTTTATGT GAAGATTGGT GATGATGAGA ATGGGTTTAT GATTGGAAAT 181 CTCTCACAGA AGTTTCCTCA ATTTTCTATT GATCTCTACT TAGGGCACGA GTTTGAGATT 241 TCTCACAACA GTACAAGCAG TGTCTATCTT ATTGGTTACA GGACCTTTGA TGCTTTTGAC 301 GAACTGGATG AGGAGATTGA TTCTGATTCT GAGTTAGATG AATATATGGA ACAACAAATT 361 GCTGCTTTGC CTCAAAATGA GATCAATCCT GAAGAAGATG ATGAATCCGA CTCAGATGAG 421 ATGGGTTTGG ACGAGGATGA TGACTCTTCA GATGAAGAAG ATGTAGAGGC TGAAGCACCT 481 TTAAAGGTGG CTCCTCCGAG CAAAAAGATG CCAAATGGTG CATTTGAGAT AGCTAAAGGT 541 GGAAAGAAGA ACAAGTCATC AGGAGGGAAG AAGAGATGCC CATTCCCTTG TGGTCCCTCT 601 TGCAAAAAGT AG Predicted gene structure (within gDNA segment 17832 to 15605): Exon 1 17532 17520 ( 13 n); cDNA 1 13 ( 13 n); score: 1.000 Intron 1 17519 17076 ( 444 n); Pd: 0.997 (s: n/a), Pa: 0.997 (s: 1.00) Exon 2 17075 17005 ( 71 n); cDNA 14 84 ( 71 n); score: 1.000 Intron 2 17004 16913 ( 92 n); Pd: 0.995 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 3 16912 16692 ( 221 n); cDNA 85 305 ( 221 n); score: 1.000 Intron 3 16691 16580 ( 112 n); Pd: 0.000 (s: 1.00), Pa: 0.000 (s: n/a) Exon 4 16579 16560 ( 20 n); cDNA 306 325 ( 20 n); score: 1.000 Intron 4 16559 16476 ( 84 n); Pd: 0.996 (s: n/a), Pa: 0.980 (s: n/a) Exon 5 16475 16440 ( 36 n); cDNA 326 361 ( 36 n); score: 1.000 Intron 5 16439 16347 ( 93 n); Pd: 0.950 (s: n/a), Pa: 0.996 (s: 1.00) Exon 6 16346 16288 ( 59 n); cDNA 362 420 ( 59 n); score: 1.000 Intron 6 16287 16197 ( 91 n); Pd: 0.999 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 7 16196 16055 ( 142 n); cDNA 421 562 ( 142 n); score: 1.000 Intron 7 16054 15960 ( 95 n); Pd: 0.993 (s: 1.00), Pa: 0.987 (s: 1.00) Exon 8 15959 15910 ( 50 n); cDNA 563 612 ( 50 n); score: 1.000 MATCH 42596GAATTCCTGGCAAACGCCCAACAACAGTGAATCCAGCTTATGAAGAGTTCTTCCTAACCA- gi+ 1.000 543 0.887 C PGS_42596GAATTCCTGGCAAACGCCCAACAACAGTGAATCCAGCTTATGAAGAGTTCTTCCTAACCA-_gi+ (17532 17520,17075 17005,16912 16692,16579 16560,16475 16440,16346 16288,16196 16055,15959 15910) Alignment (genomic DNA sequence = upper lines): ATGGAGTTTT GGGGTATGTC TCTCTCAAAA TCATATCTTT ATTAAGTTCT ACTGAATACC 17473 |||||||||| ||| ATGGAGTTTT GGG....... .......... .......... .......... .......... 13 TCAATGACTC CTATTGACTG TTTTCGCCAT TAAAGTTAAT CATTTTGGTA TTCCTAAGTT 17413 .......... .......... .......... .......... .......... .......... 13 CGTTCTGTGA TTAAAAATCT GGTCTTTATA CAAAAGCTTG CCTTTCTTCT GTATTGAATT 17353 .......... .......... .......... .......... .......... .......... 13 CTCTCTATTA CTCAATACGC ATTTTCTTGT GTTCAAAACT ATACAATTGT TTGGGAATAT 17293 .......... .......... .......... .......... .......... .......... 13 TAGCATTCTT AGTGTTTAGG TTTCAATGCA TCTATCTTTG TGAGGTTATG TTTCAAAGTA 17233 .......... .......... .......... .......... .......... .......... 13 TCTATCTTTA AGAGTTTATG TTTCAAAGCA TCTATCTTTG TGAGTTTAGG TGTCAAAGTA 17173 .......... .......... .......... .......... .......... .......... 13 TCTATATCTT TGTGAGTTTA GGTGTCAAAG CATCTATCTT TGTGAGATTT TGTATGAATT 17113 .......... .......... .......... .......... .......... .......... 13 TGTACTCAAA TGTGTGATGA ATAATGTTCA ATTTTAGGTA TCGAGATTAA GCCAGGGAAG 17053 ||| |||||||||| |||||||||| .......... .......... .......... .......GTA TCGAGATTAA GCCAGGGAAG 36 CCATTTAAGG TGATACAAAA AGATGGATTC ATGGTCCATG CCTCTCAGGT CTGTATCAGT 16993 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| CCATTTAAGG TGATACAAAA AGATGGATTC ATGGTCCATG CCTCTCAG.. .......... 84 TTTGTAATTC CTGCAAGTTT CTCTCTTTGT GTAAATCCAT GATTGATATT GATTGTGTTA 16933 .......... .......... .......... .......... .......... .......... 84 CTTGTTGTAA CATCTCTTAG GTTACCCTTG GTGACGTTGA GAAGGTTAAA AAAGATGAGA 16873 |||||||||| |||||||||| |||||||||| |||||||||| .......... .......... GTTACCCTTG GTGACGTTGA GAAGGTTAAA AAAGATGAGA 124 CTTTTGCCGT TTATGTGAAG ATTGGTGATG ATGAGAATGG GTTTATGATT GGAAATCTCT 16813 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTTGCCGT TTATGTGAAG ATTGGTGATG ATGAGAATGG GTTTATGATT GGAAATCTCT 184 CACAGAAGTT TCCTCAATTT TCTATTGATC TCTACTTAGG GCACGAGTTT GAGATTTCTC 16753 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACAGAAGTT TCCTCAATTT TCTATTGATC TCTACTTAGG GCACGAGTTT GAGATTTCTC 244 ACAACAGTAC AAGCAGTGTC TATCTTATTG GTTACAGGAC CTTTGATGCT TTTGACGAAC 16693 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACAACAGTAC AAGCAGTGTC TATCTTATTG GTTACAGGAC CTTTGATGCT TTTGACGAAC 304 TATATCCTTT CTTTCATTGT CTTTTTCTAG TTAAATTCAT ACTCACTTGC AATTTTGAAG 16633 | T......... .......... .......... .......... .......... .......... 305 CTTTGTTTGT TTATCCTTGA TTTTGTTTTG CTCTGCTTTT TTCTGATGCT CACGGATGAG 16573 ||||||| .......... .......... .......... .......... .......... ...GGATGAG 312 GAGATTGATT CTGGTGAGTT TATAATTCTT TAGTTGATGA TTTGGTTTAC TCTATTATGA 16513 |||||||||| ||| GAGATTGATT CTG....... .......... .......... .......... .......... 325 AGCTTGAAGC TAATCTTTTG CTTTTTGGTT TGTGTAGATT CTGAGTTAGA TGAATATATG 16453 ||| |||||||||| |||||||||| .......... .......... .......... .......ATT CTGAGTTAGA TGAATATATG 348 GAACAACAAA TTGGTATGTT TACTTTTTTT ACTAAGCTAC TGCTTTGTGG TAGTCATATT 16393 |||||||||| ||| GAACAACAAA TTG....... .......... .......... .......... .......... 361 TACATGTGTA TCTCTTATGA AATCTAACTT TGACATTGTT GAAAAGCTGC TTTGCCTCAA 16333 |||| |||||||||| .......... .......... .......... .......... ......CTGC TTTGCCTCAA 375 AATGAGATCA ATCCTGAAGA AGATGATGAA TCCGACTCAG ATGAGGTAGA TTTTTCTTTT 16273 |||||||||| |||||||||| |||||||||| |||||||||| ||||| AATGAGATCA ATCCTGAAGA AGATGATGAA TCCGACTCAG ATGAG..... .......... 420 AAGTATTTAC TGATTAGCAA TGAATCAGTA TAAAAAAAAA GGTAATCTTG TTGTTTTTTT 16213 .......... .......... .......... .......... .......... .......... 420 CTGTATGTTT GGTTAGATGG GTTTGGACGA GGATGATGAC TCTTCAGATG AAGAAGATGT 16153 |||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ......ATGG GTTTGGACGA GGATGATGAC TCTTCAGATG AAGAAGATGT 464 AGAGGCTGAA GCACCTTTAA AGGTGGCTCC TCCGAGCAAA AAGATGCCAA ATGGTGCATT 16093 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAGGCTGAA GCACCTTTAA AGGTGGCTCC TCCGAGCAAA AAGATGCCAA ATGGTGCATT 524 TGAGATAGCT AAAGGTGGAA AGAAGAACAA GTCATCAGGT AACGAAAACT AAAGTTAAGA 16033 |||||||||| |||||||||| |||||||||| |||||||| TGAGATAGCT AAAGGTGGAA AGAAGAACAA GTCATCAG.. .......... .......... 562 TATATAAAGA TAACTTTGCA TATGAGATGA GAATTGTGGA TTATTGATTT GGTTTAAAAC 15973 .......... .......... .......... .......... .......... .......... 562 CATTTTGGTG CAGGAGGGAA GAAGAGATGC CCATTCCCTT GTGGTCCCTC TTGCAAAAAG 15913 ||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ...GAGGGAA GAAGAGATGC CCATTCCCTT GTGGTCCCTC TTGCAAAAAG 609 TAG 15910 ||| TAG 612 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (- strand): 17075 15910 AGS-1 (17075 17005,16912 16692,16579 16560,16475 16440,16346 16288,16196 16055,15959 15910) SCR (e 1.000 d 0.995 a 0.997,e 1.000 d 0.000 a 0.000,e 1.000 d 0.996 a 0.980,e 1.000 d 0.950 a 0.996,e 1.000 d 0.999 a 1.000,e 1.000 d 0.993 a 0.987,e 1.000) Exon 1 17075 17005 ( 71 n); score: 1.000 Intron 1 17004 16913 ( 92 n); Pd: 0.995 Pa: 0.997 Exon 2 16912 16692 ( 221 n); score: 1.000 Intron 2 16691 16580 ( 112 n); Pd: 0.000 Pa: 0.000 Exon 3 16579 16560 ( 20 n); score: 1.000 Intron 3 16559 16476 ( 84 n); Pd: 0.996 Pa: 0.980 Exon 4 16475 16440 ( 36 n); score: 1.000 Intron 4 16439 16347 ( 93 n); Pd: 0.950 Pa: 0.996 Exon 5 16346 16288 ( 59 n); score: 1.000 Intron 5 16287 16197 ( 91 n); Pd: 0.999 Pa: 1.000 Exon 6 16196 16055 ( 142 n); score: 1.000 Intron 6 16054 15960 ( 95 n); Pd: 0.993 Pa: 0.987 Exon 7 15959 15910 ( 50 n); score: 1.000 PGS (17075 17005,16912 16692,16579 16560,16475 16440,16346 16288,16196 16055,15959 15910) gi+ 3-phase translation of AGS-1 (-strand): . . . . . . 17075 GTATCGAGATTAAGCCAGGGAAGCCATTTAAGGTGATACAAAAAGATGGATTCATGGTCC V S R L S Q G S H L R - Y K K M D S W S Y R D - A R E A I - G D T K R W I H G P I E I K P G K P F K V I Q K D G F M V . . : . . . . 17015 ATGCCTCTCAG : GTTACCCTTGGTGACGTTGAGAAGGTTAAAAAAGATGAGACTTTTGCCG M P L R : L P L V T L R R L K K M R L L P C L S : G Y P W - R - E G - K R - D F C R H A S Q : V T L G D V E K V K K D E T F A . . . . . . 16863 TTTATGTGAAGATTGGTGATGATGAGAATGGGTTTATGATTGGAAATCTCTCACAGAAGT F M - R L V M M R M G L - L E I S H R S L C E D W - - - E W V Y D W K S L T E V V Y V K I G D D E N G F M I G N L S Q K . . . . . . 16803 TTCCTCAATTTTCTATTGATCTCTACTTAGGGCACGAGTTTGAGATTTCTCACAACAGTA F L N F L L I S T - G T S L R F L T T V S S I F Y - S L L R A R V - D F S Q Q Y F P Q F S I D L Y L G H E F E I S H N S . . . . . . : 16743 CAAGCAGTGTCTATCTTATTGGTTACAGGACCTTTGATGCTTTTGACGAACT : GGATGAGG Q A V S I L L V T G P L M L L T N : W M R K Q C L S Y W L Q D L - C F - R T : G - G T S S V Y L I G Y R T F D A F D E L : D E . . : . . . : . 16571 AGATTGATTCTG : ATTCTGAGTTAGATGAATATATGGAACAACAAATTG : CTGCTTTGCCTC R L I L : I L S - M N I W N N K L : L L C L D - F - : F - V R - I Y G T T N C : C F A S E I D S : D S E L D E Y M E Q Q I : A A L P . . . . . : . 16334 AAAATGAGATCAATCCTGAAGAAGATGATGAATCCGACTCAGATGAG : ATGGGTTTGGACG K M R S I L K K M M N P T Q M R : W V W T K - D Q S - R R - - I R L R - : D G F G R Q N E I N P E E D D E S D S D E : M G L D . . . . . . 16183 AGGATGATGACTCTTCAGATGAAGAAGATGTAGAGGCTGAAGCACCTTTAAAGGTGGCTC R M M T L Q M K K M - R L K H L - R W L G - - L F R - R R C R G - S T F K G G S E D D D S S D E E D V E A E A P L K V A . . . . . . 16123 CTCCGAGCAAAAAGATGCCAAATGGTGCATTTGAGATAGCTAAAGGTGGAAAGAAGAACA L R A K R C Q M V H L R - L K V E R R T S E Q K D A K W C I - D S - R W K E E Q P P S K K M P N G A F E I A K G G K K N . : . . . . . 16063 AGTCATCAG : GAGGGAAGAAGAGATGCCCATTCCCTTGTGGTCCCTCTTGCAAAAAGTAG S H Q : E G R R D A H S L V V P L A K S V I R : R E E E M P I P L W S L L Q K V K S S : G G K K R C P F P C G P S C K K - Maximal non-overlapping open reading frames (>= 64 codons): >42596GAATTCCTGGCAAACGCCCAACAACAGTGAATCCAGCTTATGAAGAGTTCTTCCTAACCA-_PGL-1_AGS-1_PPS_1 (17073 17005,16912 16692,16579 16560,16475 16440,16346 16288,16196 16055,15959 15910) (frame '0'; 594 bp, 198 residues) 1 IEIKPGKPFK VIQKDGFMVH ASQVTLGDVE KVKKDETFAV YVKIGDDENG FMIGNLSQKF 61 PQFSIDLYLG HEFEISHNST SSVYLIGYRT FDAFDELDEE IDSDSELDEY MEQQIAALPQ 121 NEINPEEDDE SDSDEMGLDE DDDSSDEEDV EAEAPLKVAP PSKKMPNGAF EIAKGGKKNK 181 SSGGKKRCPF PCGPSCKK-