GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:37:17 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 512 Minimum sequence length: 512 Maximum sequence length: 512 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 0 < 600: 1 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 42492GAATTCAGTTTTGTTTCTTTTATTAGTATGGGAGCAGCTTCAGAAAGAGAATGCAGATTT, from 1 to 131683, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 1 Matches indexed, elapsed seconds = 1 HitsTableSize = 0 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 1 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 CACAAACCCT AAACTATGTC CGGCGGTTTC TTCCGGGGTA CTTCCGCCGA GCAGGATACT 61 CGGTTTTCTA ACAAGCAAGC TAAATTGATG AAAAGCCAGA AATTTGCTCC GGAGTTAGAG 121 AATCTGGTGG ATATCACTAA AGTGAAGATG GATGTTATGA AACCTTGGAT TGCGACTAGA 181 GTCACTGAGC TTCTTGGTAT TGAAGATGAA GTGCTGATTA ACTTTATCTA TGGTCTGCTT 241 GACGGGAAGG TAGTGAATGG TAAGGAGATT CAGATAACGC TAACTGGATT CATGGAGAAA 301 AATACTGGCA AATTTATGAA AGAACTTTGG ACACTTCTTC TAAGCGCACA GAATAATCCA 361 AGTGGTGTTC CACAGCAGTT TTTAGATGCT AGAGCTGCTG AAACAAAGAA GAAACAGGAA 421 GAAGCAAATG AAATAATGAA GAAGAGGGAG GGAGATAAGA AGAACATTGA GCACGATATA 481 TTGAGGAAAA TAGTTATTCA GATAGCTTCT GC Predicted gene structure (within gDNA segment 101837 to 100121): Exon 1 101537 101502 ( 36 n); cDNA 1 36 ( 36 n); score: 1.000 Intron 1 101501 101383 ( 119 n); Pd: 0.992 (s: n/a), Pa: 0.981 (s: 1.00) Exon 2 101382 101293 ( 90 n); cDNA 37 126 ( 90 n); score: 1.000 Intron 2 101292 101131 ( 162 n); Pd: 0.980 (s: 1.00), Pa: 0.946 (s: 1.00) Exon 3 101130 101008 ( 123 n); cDNA 127 249 ( 123 n); score: 1.000 Intron 3 101007 100854 ( 154 n); Pd: 0.998 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 4 100853 100686 ( 168 n); cDNA 250 417 ( 168 n); score: 1.000 Intron 4 100685 100598 ( 88 n); Pd: 0.604 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 5 100597 100523 ( 75 n); cDNA 418 492 ( 75 n); score: 1.000 Intron 5 100522 100446 ( 77 n); Pd: 0.831 (s: 1.00), Pa: 0.000 (s: n/a) Exon 6 100445 100426 ( 20 n); cDNA 493 512 ( 20 n); score: 1.000 MATCH 42492GAATTCAGTTTTGTTTCTTTTATTAGTATGGGAGCAGCTTCAGAAAGAGAATGCAGATTT- gi+ 1.000 456 0.891 C PGS_42492GAATTCAGTTTTGTTTCTTTTATTAGTATGGGAGCAGCTTCAGAAAGAGAATGCAGATTT-_gi+ (101537 101502,101382 101293,101130 101008,100853 100686,100597 100523,100445 100426) Alignment (genomic DNA sequence = upper lines): CACAAACCCT AAACTATGTC CGGCGGTTTC TTCCGGGTTA GTTTTCCGAT CAATTTTCTC 101478 |||||||||| |||||||||| |||||||||| |||||| CACAAACCCT AAACTATGTC CGGCGGTTTC TTCCGG.... .......... .......... 36 AATTGATCTC TCATCTTTTA GATCCAGATC GATGCAATTC AAAAGTTTTG TTGAAGTTCC 101418 .......... .......... .......... .......... .......... .......... 36 CATGGTTTAA TAACTTCTTT GTTTTGAGCT TTTAGGGTAC TTCCGCCGAG CAGGATACTC 101358 ||||| |||||||||| |||||||||| .......... .......... .......... .....GGTAC TTCCGCCGAG CAGGATACTC 61 GGTTTTCTAA CAAGCAAGCT AAATTGATGA AAAGCCAGAA ATTTGCTCCG GAGTTAGAGA 101298 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTTTTCTAA CAAGCAAGCT AAATTGATGA AAAGCCAGAA ATTTGCTCCG GAGTTAGAGA 121 ATCTGGTGAG TTTTCTCTTT GTGTACTCGT CCATTTTAAC GATTTGTGAA TTCGTAGCTT 101238 ||||| ATCTG..... .......... .......... .......... .......... .......... 126 GAATTTACTT AATTGAGATT CGATTACCAG ATAATTCGAT ACATATGTTA CTATGAAATT 101178 .......... .......... .......... .......... .......... .......... 126 AAGTTTGGGG AGGGCTTAAT GATGTCTTCT ACTTGCCTTG GTGTTAGGTG GATATCACTA 101118 ||| |||||||||| .......... .......... .......... .......... .......GTG GATATCACTA 139 AAGTGAAGAT GGATGTTATG AAACCTTGGA TTGCGACTAG AGTCACTGAG CTTCTTGGTA 101058 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGTGAAGAT GGATGTTATG AAACCTTGGA TTGCGACTAG AGTCACTGAG CTTCTTGGTA 199 TTGAAGATGA AGTGCTGATT AACTTTATCT ATGGTCTGCT TGACGGGAAG GTAAGCTCAT 100998 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGAAGATGA AGTGCTGATT AACTTTATCT ATGGTCTGCT TGACGGGAAG .......... 249 TACTGAGATT TTATTTTTCT GAGCTTTTTA TTCGACTGTT AATAAGTGAC AGAGGGGTAG 100938 .......... .......... .......... .......... .......... .......... 249 TATGATTTCG TAGCCCATGC CATTGATACA AGTGATTTTT TTTTGTTGGA CTAATATGTA 100878 .......... .......... .......... .......... .......... .......... 249 ATTGAACGTT CACTATGTAT GTAGGTAGTG AATGGTAAGG AGATTCAGAT AACGCTAACT 100818 |||||| |||||||||| |||||||||| |||||||||| .......... .......... ....GTAGTG AATGGTAAGG AGATTCAGAT AACGCTAACT 285 GGATTCATGG AGAAAAATAC TGGCAAATTT ATGAAAGAAC TTTGGACACT TCTTCTAAGC 100758 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGATTCATGG AGAAAAATAC TGGCAAATTT ATGAAAGAAC TTTGGACACT TCTTCTAAGC 345 GCACAGAATA ATCCAAGTGG TGTTCCACAG CAGTTTTTAG ATGCTAGAGC TGCTGAAACA 100698 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCACAGAATA ATCCAAGTGG TGTTCCACAG CAGTTTTTAG ATGCTAGAGC TGCTGAAACA 405 AAGAAGAAAC AGGTATTCTT GGGAACTCAT TCTGAAGTTA GTTTGGTGAA CGACTTGTCT 100638 |||||||||| || AAGAAGAAAC AG........ .......... .......... .......... .......... 417 CTGTCCTACT GTTTCATGAT TATTATCCTT TATGTGGCAG GAAGAAGCAA ATGAAATAAT 100578 |||||||||| |||||||||| .......... .......... .......... .......... GAAGAAGCAA ATGAAATAAT 437 GAAGAAGAGG GAGGGAGATA AGAAGAACAT TGAGCACGAT ATATTGAGGA AAATAGTAAG 100518 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| GAAGAAGAGG GAGGGAGATA AGAAGAACAT TGAGCACGAT ATATTGAGGA AAATA..... 492 ACGTTCTTAT TCATGCTAGA GTTTCAGCTT TAGAGATGCT CTTCTGGTTT TAACATTCGC 100458 .......... .......... .......... .......... .......... .......... 492 CTGTTCTTTC ATGTTATTCA GATAGCTTCT GC 100426 |||||||| |||||||||| || .......... ..GTTATTCA GATAGCTTCT GC 512 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (- strand): 101537 100523 AGS-1 (101537 101502,101382 101293,101130 101008,100853 100686,100597 100523) SCR (e 1.000 d 0.992 a 0.981,e 1.000 d 0.980 a 0.946,e 1.000 d 0.998 a 0.999,e 1.000 d 0.604 a 1.000,e 1.000) Exon 1 101537 101502 ( 36 n); score: 1.000 Intron 1 101501 101383 ( 119 n); Pd: 0.992 Pa: 0.981 Exon 2 101382 101293 ( 90 n); score: 1.000 Intron 2 101292 101131 ( 162 n); Pd: 0.980 Pa: 0.946 Exon 3 101130 101008 ( 123 n); score: 1.000 Intron 3 101007 100854 ( 154 n); Pd: 0.998 Pa: 0.999 Exon 4 100853 100686 ( 168 n); score: 1.000 Intron 4 100685 100598 ( 88 n); Pd: 0.604 Pa: 1.000 Exon 5 100597 100523 ( 75 n); score: 1.000 PGS (101537 101502,101382 101293,101130 101008,100853 100686,100597 100523) gi+ 3-phase translation of AGS-1 (-strand): . . . . : . . 101537 CACAAACCCTAAACTATGTCCGGCGGTTTCTTCCGG : GGTACTTCCGCCGAGCAGGATACT H K P - T M S G G F F R : G T S A E Q D T T N P K L C P A V S S G : V L P P S R I L Q T L N Y V R R F L P : G Y F R R A G Y . . . . . . 101358 CGGTTTTCTAACAAGCAAGCTAAATTGATGAAAAGCCAGAAATTTGCTCCGGAGTTAGAG R F S N K Q A K L M K S Q K F A P E L E G F L T S K L N - - K A R N L L R S - R S V F - Q A S - I D E K P E I C S G V R . : . . . . . 101298 AATCTG : GTGGATATCACTAAAGTGAAGATGGATGTTATGAAACCTTGGATTGCGACTAGA N L : V D I T K V K M D V M K P W I A T R I W : W I S L K - R W M L - N L G L R L E E S : G G Y H - S E D G C Y E T L D C D - . . . . . . 101076 GTCACTGAGCTTCTTGGTATTGAAGATGAAGTGCTGATTAACTTTATCTATGGTCTGCTT V T E L L G I E D E V L I N F I Y G L L S L S F L V L K M K C - L T L S M V C L S H - A S W Y - R - S A D - L Y L W S A . : . . . . . 101016 GACGGGAAG : GTAGTGAATGGTAAGGAGATTCAGATAACGCTAACTGGATTCATGGAGAAA D G K : V V N G K E I Q I T L T G F M E K T G R : - - M V R R F R - R - L D S W R K - R E : G S E W - G D S D N A N W I H G E . . . . . . 100802 AATACTGGCAAATTTATGAAAGAACTTTGGACACTTCTTCTAAGCGCACAGAATAATCCA N T G K F M K E L W T L L L S A Q N N P I L A N L - K N F G H F F - A H R I I Q K Y W Q I Y E R T L D T S S K R T E - S . . . . . . : 100742 AGTGGTGTTCCACAGCAGTTTTTAGATGCTAGAGCTGCTGAAACAAAGAAGAAACAG : GAA S G V P Q Q F L D A R A A E T K K K Q : E V V F H S S F - M L E L L K Q R R N R : K K W C S T A V F R C - S C - N K E E T : G . . . . . . 100594 GAAGCAAATGAAATAATGAAGAAGAGGGAGGGAGATAAGAAGAACATTGAGCACGATATA E A N E I M K K R E G D K K N I E H D I K Q M K - - R R G R E I R R T L S T I Y R S K - N N E E E G G R - E E H - A R Y . . 100534 TTGAGGAAAATA L R K I - G K I E E N Maximal non-overlapping open reading frames (>= 64 codons): >42492GAATTCAGTTTTGTTTCTTTTATTAGTATGGGAGCAGCTTCAGAAAGAGAATGCAGATTT-_PGL-1_AGS-1_PPS_1 (101525 101502,101382 101293,101130 101008,100853 100686,100597 100523) (frame '1'; 480 bp, 160 residues) 1 TMSGGFFRGT SAEQDTRFSN KQAKLMKSQK FAPELENLVD ITKVKMDVMK PWIATRVTEL 61 LGIEDEVLIN FIYGLLDGKV VNGKEIQITL TGFMEKNTGK FMKELWTLLL SAQNNPSGVP 121 QQFLDARAAE TKKKQEEANE IMKKREGDKK NIEHDILRKI