GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:36:22 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 1603 Minimum sequence length: 1603 Maximum sequence length: 1603 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 0 < 600: 0 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 1 ________________________________________________________________________________ Sequence 1: 30373AAGCTTGACTGACACTTTTGCCTTGGGGTTGACCTGATTATAATTTTTTGTTAAGAGGGA, from 1 to 91479, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 1 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 TGAATTCTAG GCGACAATGA GCAAACCCCA TAAGTTAAAA GCCACTCCAG GATCTCAAAG 61 ACTTGTTCTG TTATGTATAG TCGCAGTTGC ATTTCTCCTC CTTTTCACTT CGGTGATCTC 121 CACCGGCGGA TTGGCTTTAC CGCATCAGAC NACCCTAATT GGTTATTTTG TGAGGTCAAC 181 TCGAAACAAG ACACAGCATA GTTTGTCGGA CAAGTACTTG TACTGGGGAA ACAGAATCGA 241 TTGTCCTGGT AAGAACTGTG AGACCTGTGC CGGTTTGGGT CACCAAGAAT CTAGCCTTAG 301 ATGTGCCCTT GAAGAAGCCA TGTTTCTGAA CAGGACTTTT GTAATGCCAT CTCGGATGTG 361 CATCAATCCA ATACATAACA AGAAGGGTAT ACTTAATCGA TCCAACAATG AAACTAGAGA 421 GGAAAGTTGG GAAGTGAGCT CTTGTGCAAT GGAATCATTG TATGATATTG ATCTCATCTC 481 TGAGAAAATA CCTGTGATCT TGGATGACTC GGAACCATGG CACATAATGC TATCGACGAG 541 TATGAAATTG AAAGAACGTG GGAGTGCGCA TGTATATGGG GCAAACAGGC ATGAGCTAAA 601 TGACTCTAGC GACTTTACAA ATCTTTTGCT CATTAACCGA ACCGCAAGCC CCCTTGCATG 661 GTTTGTTGAA TGCAAGGATC GAGGTAATCG TAGCGACGTC ATGCTTCCTT ATTCATTTCT 721 CCNGACTATG GCAGCATCAA GATTGAGAGA TGCTGCAGAA AAGGTGAAGG AGTTGCACAT 781 AATCTAGCTT TCAGATAAAA GCAAAACTTG GTGATTACGA TGCNATCCAT GTTCGTCGAG 841 GTGACAAACT GAAAACAAGA AAAGACAGAT TTCGCGTGGA AAGAAGCCAG TTTCCACATT 901 TAGATAGAGA CACACGGCCA GAATTCATCA TTGGCAGAAT TCAGAAACAA ATCCCACCAG 961 GACGGACTCT TTTTATCGGT TCTAATGAAA GAACCCCTGA TTTCTTTTCA CCTCTAGCTA 1021 TCAGATACAA AGTGGCGTAT TCATCGAATT TTAGTGAGAT TTTGGATCCG ATCATCGAGA 1081 ACAATTACCA GTTGTTCATG GTGGAGAGGT TGATAATGAT GGGTGCAAAG ACATTCTTCA 1141 AAACATTTAG AGAGTACGAA ACCGATCTCA CTTTGACTGA TGATCCGAAA AAGAACAAGA 1201 ACTGGGAAAT ACCAGTTTAC ACCATGGATG AAGGCAAAGA AGCAGCAAGC TAAACTATCT 1261 ATGATCTCAC TAGGCACTAG CAAAGTCTAC ACAACGATTC AGACAAGAGT CNCATTACTC 1321 TTCAAATACT TGGAGTTAAA AGCCTAATCT TATCTGAAGA TTCTATTGCA TAGGAACTCT 1381 GTGTATGTGT AACCAGAGTC TNTANCAGAG ACTAACAGAC TAGATTTGCT ACTTGGCTGA 1441 CTCATTTTTG TGTAAATCAA TCTGTTTATC TTCACTTCAA TGTTCATTGC ACCAGAAATG 1501 GTTTTTNAAA AACATTCGAG TTACATGGTA TGGTCGTTCG TCTTTTAAGC AAGTTTCTGC 1561 AGCCACTTTC GATATCCTAA TCCAATATAA GTGAAATTTC ATG Predicted gene structure (within gDNA segment 32140 to 29356): Exon 1 31804 31472 ( 333 n); cDNA 1 333 ( 333 n); score: 0.976 Intron 1 31471 31310 ( 162 n); Pd: 0.993 (s: 1.00), Pa: 0.989 (s: 1.00) Exon 2 31309 31217 ( 93 n); cDNA 334 426 ( 93 n); score: 1.000 Intron 2 31216 31116 ( 101 n); Pd: 0.999 (s: 1.00), Pa: 0.685 (s: 1.00) Exon 3 31115 30882 ( 234 n); cDNA 427 660 ( 234 n); score: 0.996 Intron 3 30881 30756 ( 126 n); Pd: 0.607 (s: 1.00), Pa: 0.992 (s: 1.00) Exon 4 30755 30622 ( 134 n); cDNA 661 794 ( 134 n); score: 0.993 Intron 4 30621 30551 ( 71 n); Pd: 0.000 (s: 1.00), Pa: 0.987 (s: 0.98) Exon 5 30550 30321 ( 230 n); cDNA 795 1024 ( 230 n); score: 0.996 Intron 5 30320 30244 ( 77 n); Pd: 0.996 (s: 1.00), Pa: 0.961 (s: 1.00) Exon 6 30243 29665 ( 579 n); cDNA 1025 1603 ( 579 n); score: 0.991 MATCH 30373AAGCTTGACTGACACTTTTGCCTTGGGGTTGACCTGATTATAATTTTTTGTTAAGAGGGA- gi+ 0.990 1603 1.000 C PGS_30373AAGCTTGACTGACACTTTTGCCTTGGGGTTGACCTGATTATAATTTTTTGTTAAGAGGGA-_gi+ (31804 31472,31309 31217,31115 30882,30755 30622,30550 30321,30243 29665) Alignment (genomic DNA sequence = upper lines): TTCATTAATG GCGACAATGA GCAAACCCCA TAAGTTAAAA GCCACTCCAG GATCTCAAAG 31745 | ||| | |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAATTCTAG GCGACAATGA GCAAACCCCA TAAGTTAAAA GCCACTCCAG GATCTCAAAG 60 ACTTGTTCTG TTATGTATAG TCGCAGTTGC ATTTCTCCTC CTTTTCACTT CGGTGATCTC 31685 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTTGTTCTG TTATGTATAG TCGCAGTTGC ATTTCTCCTC CTTTTCACTT CGGTGATCTC 120 CACCGGCGGA TTGGCTTTAC CGTATCGGAC AACCCTAATT GGTTATTTTG TGAGGTCAAC 31625 |||||||||| |||||||||| || ||| ||| ||||||||| |||||||||| |||||||||| CACCGGCGGA TTGGCTTTAC CGCATCAGAC NACCCTAATT GGTTATTTTG TGAGGTCAAC 180 TCGAAACAAG ACACAGCATA GTTTGTCGGA CAAGTACTTG TACTGGGGAA ACAGAATCGA 31565 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCGAAACAAG ACACAGCATA GTTTGTCGGA CAAGTACTTG TACTGGGGAA ACAGAATCGA 240 TTGTCCTGGT AAGAACTGTG AGACCTGTGC CGGTTTGGGT CACCAAGAAT CTAGCCTTAG 31505 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGTCCTGGT AAGAACTGTG AGACCTGTGC CGGTTTGGGT CACCAAGAAT CTAGCCTTAG 300 ATGTGCCCTT GAAGAAGCCA TGTTTCTGAA CAGGTAACTT TTGCAAAATC CACCAATATG 31445 |||||||||| |||||||||| |||||||||| ||| ATGTGCCCTT GAAGAAGCCA TGTTTCTGAA CAG....... .......... .......... 333 TCAAACTTTT GAGATTTCTG ATGAAGTTCT TGAAATGGCA ATTAGCCTAT CTGGTTCAAT 31385 .......... .......... .......... .......... .......... .......... 333 GAGATGGTTA GTTGAATAAA TGATGCAGCT CTCTTGGTAA AGATGGTCAC TGTAATTGCA 31325 .......... .......... .......... .......... .......... .......... 333 ATTTTCTTGT TGCAGGACTT TTGTAATGCC ATCTCGGATG TGCATCAATC CAATACATAA 31265 ||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... .....GACTT TTGTAATGCC ATCTCGGATG TGCATCAATC CAATACATAA 378 CAAGAAGGGT ATACTTAATC GATCCAACAA TGAAACTAGA GAGGAAAGGT GAGAGTTTCT 31205 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| CAAGAAGGGT ATACTTAATC GATCCAACAA TGAAACTAGA GAGGAAAG.. .......... 426 TTAAACTACT TCAAAAACTT AGGTTTTGTT TTGTTTCCCA CTTTTGAACA CAAGAGGTCT 31145 .......... .......... .......... .......... .......... .......... 426 CTCTCTCTCT CTCCTTTGTA TGTTCTTAGT TGGGAAGTGA GCTCTTGTGC AATGGAATCA 31085 | |||||||||| |||||||||| |||||||||| .......... .......... .........T TGGGAAGTGA GCTCTTGTGC AATGGAATCA 457 TTGTATGATA TTGATCTCAT CTCTGAGAAA ATACCTGTGA TCTTGGATGA CTCGGAAACA 31025 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||| || TTGTATGATA TTGATCTCAT CTCTGAGAAA ATACCTGTGA TCTTGGATGA CTCGGAACCA 517 TGGCACATAA TGCTATCGAC GAGTATGAAA TTGAAAGAAC GTGGGAGTGC GCATGTATAT 30965 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGCACATAA TGCTATCGAC GAGTATGAAA TTGAAAGAAC GTGGGAGTGC GCATGTATAT 577 GGGGCAAACA GGCATGAGCT AAATGACTCT AGCGACTTTA CAAATCTTTT GCTCATTAAC 30905 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGGGCAAACA GGCATGAGCT AAATGACTCT AGCGACTTTA CAAATCTTTT GCTCATTAAC 637 CGAACCGCAA GCCCCCTTGC ATGGTAAGCA TTGCTGAGGA AATTGTGTTT CCAATAGATT 30845 |||||||||| |||||||||| ||| CGAACCGCAA GCCCCCTTGC ATG....... .......... .......... .......... 660 GGGAAGTTGC TGCTCCTTGA GCATTATTTT CTTATCTTGT TTGTGTTAGG CATCATCATT 30785 .......... .......... .......... .......... .......... .......... 660 ATCAGATCAT ATATCTATCT TCATTGTAGG TTTGTTGAAT GCAAGGATCG AGGTAATCGT 30725 | |||||||||| |||||||||| |||||||||| .......... .......... .........G TTTGTTGAAT GCAAGGATCG AGGTAATCGT 691 AGCGACGTCA TGCTTCCTTA TTCATTTCTC CAGACTATGG CAGCATCAAG ATTGAGAGAT 30665 |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| |||||||||| AGCGACGTCA TGCTTCCTTA TTCATTTCTC CNGACTATGG CAGCATCAAG ATTGAGAGAT 751 GCTGCAGAAA AGGTGAAGGA GTTGCACATA ATCTAGCTTT CAGGAAAGAG TGACCAGATG 30605 |||||||||| |||||||||| |||||||||| |||||||||| ||| GCTGCAGAAA AGGTGAAGGA GTTGCACATA ATCTAGCTTT CAG....... .......... 794 GTAATGTACT GTTGACGCCT TATAGCTTAT TTATGACCAG TTTTGGTTGA ACAGATAAAA 30545 |||||| .......... .......... .......... .......... .......... ....ATAAAA 800 GCAAAACTTG GTGATTACGA TGCAATCCAT GTTCGTCGAG GTGACAAACT GAAAACAAGA 30485 |||||||||| |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| GCAAAACTTG GTGATTACGA TGCNATCCAT GTTCGTCGAG GTGACAAACT GAAAACAAGA 860 AAAGACAGAT TTCGCGTGGA AAGAAGCCAG TTTCCACATT TAGATAGAGA CACACGGCCA 30425 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAGACAGAT TTCGCGTGGA AAGAAGCCAG TTTCCACATT TAGATAGAGA CACACGGCCA 920 GAATTCATCA TTGGCAGAAT TCAGAAACAA ATCCCACCAG GACGGACTCT TTTTATCGGT 30365 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAATTCATCA TTGGCAGAAT TCAGAAACAA ATCCCACCAG GACGGACTCT TTTTATCGGT 980 TCTAATGAAA GAACCCCTGA TTTCTTTTCA CCTCTAGCTA TCAGGTAAAA CCATAATGTT 30305 |||||||||| |||||||||| |||||||||| |||||||||| |||| TCTAATGAAA GAACCCCTGA TTTCTTTTCA CCTCTAGCTA TCAG...... .......... 1024 TAGATTCAAG TTTTTAGTTA TATATGTAGA ATGTTCGTGT AAGACTCTTG TTACACTGCA 30245 .......... .......... .......... .......... .......... .......... 1024 GATACAAAGT GGCGTATTCA TCGAATTTTA GTGAGATTTT GGATCCGATC ATCGAGAACA 30185 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .ATACAAAGT GGCGTATTCA TCGAATTTTA GTGAGATTTT GGATCCGATC ATCGAGAACA 1083 ATTACCAGTT GTTCATGGTG GAGAGGTTGA TAATGATGGG TGCAAAGACA TTCTTCAAAA 30125 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTACCAGTT GTTCATGGTG GAGAGGTTGA TAATGATGGG TGCAAAGACA TTCTTCAAAA 1143 CATTTAGAGA GTACGAAACC GATCTCACTT TGACTGATGA TCCGAAAAAG AACAAGAACT 30065 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATTTAGAGA GTACGAAACC GATCTCACTT TGACTGATGA TCCGAAAAAG AACAAGAACT 1203 GGGAAATACC AGTTTACACC ATGGATGAAG GCAAAGAAGC AGCAAGCTAA ACTATCTATG 30005 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGGAAATACC AGTTTACACC ATGGATGAAG GCAAAGAAGC AGCAAGCTAA ACTATCTATG 1263 ATCTCACTAG GCACTAGCAA AGTCTACACA ACGATTCAGA CAAGAGTCTC ATTACTCTTC 29945 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| | |||||||||| ATCTCACTAG GCACTAGCAA AGTCTACACA ACGATTCAGA CAAGAGTCNC ATTACTCTTC 1323 AAATACTTGG AGTTAAAAGC CTAATCTTAT CTGAAGATTC TATTGCATAG GAACTCTGTG 29885 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAATACTTGG AGTTAAAAGC CTAATCTTAT CTGAAGATTC TATTGCATAG GAACTCTGTG 1383 TATGTGTAAC CAGAGTCTCT AACAGAGACT AACAGACTAG ATTTGCTACT TGGCTGACTC 29825 |||||||||| |||||||| | | |||||||| |||||||||| |||||||||| |||||||||| TATGTGTAAC CAGAGTCTNT ANCAGAGACT AACAGACTAG ATTTGCTACT TGGCTGACTC 1443 ATTTTTGTGT AAATCAATCT GTTTATCTTC ACTTCAATGT TCATTGCACC AGAAATGGTT 29765 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTTTGTGT AAATCAATCT GTTTATCTTC ACTTCAATGT TCATTGCACC AGAAATGGTT 1503 TTTTAAAAAC ATTCGAGTTA CATGGTATGG TCGTTCGTCT TTTAAGCAAG TTTCTGCAGC 29705 ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTNAAAAAC ATTCGAGTTA CATGGTATGG TCGTTCGTCT TTTAAGCAAG TTTCTGCAGC 1563 CACTTTCGAT ATCCTAATCC AATATAAGTG AAATTTCATA 29665 |||||||||| |||||||||| |||||||||| ||||||||| CACTTTCGAT ATCCTAATCC AATATAAGTG AAATTTCATG 1603 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (- strand): 31804 29665 AGS-1 (31804 31472,31309 31217,31115 30882,30755 30622,30550 30321,30243 29665) SCR (e 0.976 d 0.993 a 0.989,e 1.000 d 0.999 a 0.685,e 0.996 d 0.607 a 0.992,e 0.993 d 0.000 a 0.987,e 0.996 d 0.996 a 0.961,e 0.991) Exon 1 31804 31472 ( 333 n); score: 0.976 Intron 1 31471 31310 ( 162 n); Pd: 0.993 Pa: 0.989 Exon 2 31309 31217 ( 93 n); score: 1.000 Intron 2 31216 31116 ( 101 n); Pd: 0.999 Pa: 0.685 Exon 3 31115 30882 ( 234 n); score: 0.996 Intron 3 30881 30756 ( 126 n); Pd: 0.607 Pa: 0.992 Exon 4 30755 30622 ( 134 n); score: 0.993 Intron 4 30621 30551 ( 71 n); Pd: 0.000 Pa: 0.987 Exon 5 30550 30321 ( 230 n); score: 0.996 Intron 5 30320 30244 ( 77 n); Pd: 0.996 Pa: 0.961 Exon 6 30243 29665 ( 579 n); score: 0.991 PGS (31804 31472,31309 31217,31115 30882,30755 30622,30550 30321,30243 29665) gi+ 3-phase translation of AGS-1 (-strand): . . . . . . 31804 TTCATTAATGGCGACAATGAGCAAACCCCATAAGTTAAAAGCCACTCCAGGATCTCAAAG F I N G D N E Q T P - V K S H S R I S K S L M A T M S K P H K L K A T P G S Q R H - W R Q - A N P I S - K P L Q D L K . . . . . . 31744 ACTTGTTCTGTTATGTATAGTCGCAGTTGCATTTCTCCTCCTTTTCACTTCGGTGATCTC T C S V M Y S R S C I S P P F H F G D L L V L L C I V A V A F L L L F T S V I S D L F C Y V - S Q L H F S S F S L R - S . . . . . . 31684 CACCGGCGGATTGGCTTTACCGTATCGGACAACCCTAATTGGTTATTTTGTGAGGTCAAC H R R I G F T V S D N P N W L F C E V N T G G L A L P Y R T T L I G Y F V R S T P P A D W L Y R I G Q P - L V I L - G Q . . . . . . 31624 TCGAAACAAGACACAGCATAGTTTGTCGGACAAGTACTTGTACTGGGGAAACAGAATCGA S K Q D T A - F V G Q V L V L G K Q N R R N K T Q H S L S D K Y L Y W G N R I D L E T R H S I V C R T S T C T G E T E S . . . . . . 31564 TTGTCCTGGTAAGAACTGTGAGACCTGTGCCGGTTTGGGTCACCAAGAATCTAGCCTTAG L S W - E L - D L C R F G S P R I - P - C P G K N C E T C A G L G H Q E S S L R I V L V R T V R P V P V W V T K N L A L . . . . : . . 31504 ATGTGCCCTTGAAGAAGCCATGTTTCTGAACAG : GACTTTTGTAATGCCATCTCGGATGTG M C P - R S H V S E Q : D F C N A I S D V C A L E E A M F L N R : T F V M P S R M C D V P L K K P C F - T : G L L - C H L G C . . . . . . 31282 CATCAATCCAATACATAACAAGAAGGGTATACTTAATCGATCCAACAATGAAACTAGAGA H Q S N T - Q E G Y T - S I Q Q - N - R I N P I H N K K G I L N R S N N E T R E A S I Q Y I T R R V Y L I D P T M K L E . : . . . . . 31222 GGAAAG : TTGGGAAGTGAGCTCTTGTGCAATGGAATCATTGTATGATATTGATCTCATCTC G K : L G S E L L C N G I I V - Y - S H L E S : W E V S S C A M E S L Y D I D L I S R K : V G K - A L V Q W N H C M I L I S S . . . . . . 31061 TGAGAAAATACCTGTGATCTTGGATGACTCGGAAACATGGCACATAATGCTATCGACGAG - E N T C D L G - L G N M A H N A I D E E K I P V I L D D S E T W H I M L S T S L R K Y L - S W M T R K H G T - C Y R R . . . . . . 31001 TATGAAATTGAAAGAACGTGGGAGTGCGCATGTATATGGGGCAAACAGGCATGAGCTAAA Y E I E R T W E C A C I W G K Q A - A K M K L K E R G S A H V Y G A N R H E L N V - N - K N V G V R M Y M G Q T G M S - . . . . . . : 30941 TGACTCTAGCGACTTTACAAATCTTTTGCTCATTAACCGAACCGCAAGCCCCCTTGCATG : - L - R L Y K S F A H - P N R K P P C M : D S S D F T N L L L I N R T A S P L A W : M T L A T L Q I F C S L T E P Q A P L H : . . . . . . 30881 GTTTGTTGAATGCAAGGATCGAGGTAATCGTAGCGACGTCATGCTTCCTTATTCATTTCT V C - M Q G S R - S - R R H A S L F I S F V E C K D R G N R S D V M L P Y S F L G L L N A R I E V I V A T S C F L I H F . . . . . . 30695 CCAGACTATGGCAGCATCAAGATTGAGAGATGCTGCAGAAAAGGTGAAGGAGTTGCACAT P D Y G S I K I E R C C R K G E G V A H Q T M A A S R L R D A A E K V K E L H I S R L W Q H Q D - E M L Q K R - R S C T . . : . . . . 30635 AATCTAGCTTTCAG : ATAAAAGCAAAACTTGGTGATTACGATGCAATCCATGTTCGTCGAG N L A F R : - K Q N L V I T M Q S M F V E I - L S : D K S K T W - L R C N P C S S R - S S F Q : I K A K L G D Y D A I H V R R . . . . . . 30504 GTGACAAACTGAAAACAAGAAAAGACAGATTTCGCGTGGAAAGAAGCCAGTTTCCACATT V T N - K Q E K T D F A W K E A S F H I - Q T E N K K R Q I S R G K K P V S T F G D K L K T R K D R F R V E R S Q F P H . . . . . . 30444 TAGATAGAGACACACGGCCAGAATTCATCATTGGCAGAATTCAGAAACAAATCCCACCAG - I E T H G Q N S S L A E F R N K S H Q R - R H T A R I H H W Q N S E T N P T R L D R D T R P E F I I G R I Q K Q I P P . . . . . . 30384 GACGGACTCTTTTTATCGGTTCTAATGAAAGAACCCCTGATTTCTTTTCACCTCTAGCTA D G L F L S V L M K E P L I S F H L - L T D S F Y R F - - K N P - F L F T S S Y G R T L F I G S N E R T P D F F S P L A . : . . . . . 30324 TCAG : ATACAAAGTGGCGTATTCATCGAATTTTAGTGAGATTTTGGATCCGATCATCGAGA S : D T K W R I H R I L V R F W I R S S R Q : I Q S G V F I E F - - D F G S D H R E I R : Y K V A Y S S N F S E I L D P I I E . . . . . . 30187 ACAATTACCAGTTGTTCATGGTGGAGAGGTTGATAATGATGGGTGCAAAGACATTCTTCA T I T S C S W W R G - - - W V Q R H S S Q L P V V H G G E V D N D G C K D I L Q N N Y Q L F M V E R L I M M G A K T F F . . . . . . 30127 AAACATTTAGAGAGTACGAAACCGATCTCACTTTGACTGATGATCCGAAAAAGAACAAGA K H L E S T K P I S L - L M I R K R T R N I - R V R N R S H F D - - S E K E Q E K T F R E Y E T D L T L T D D P K K N K . . . . . . 30067 ACTGGGAAATACCAGTTTACACCATGGATGAAGGCAAAGAAGCAGCAAGCTAAACTATCT T G K Y Q F T P W M K A K K Q Q A K L S L G N T S L H H G - R Q R S S K L N Y L N W E I P V Y T M D E G K E A A S - T I . . . . . . 30007 ATGATCTCACTAGGCACTAGCAAAGTCTACACAACGATTCAGACAAGAGTCTCATTACTC M I S L G T S K V Y T T I Q T R V S L L - S H - A L A K S T Q R F R Q E S H Y S Y D L T R H - Q S L H N D S D K S L I T . . . . . . 29947 TTCAAATACTTGGAGTTAAAAGCCTAATCTTATCTGAAGATTCTATTGCATAGGAACTCT F K Y L E L K A - S Y L K I L L H R N S S N T W S - K P N L I - R F Y C I G T L L Q I L G V K S L I L S E D S I A - E L . . . . . . 29887 GTGTATGTGTAACCAGAGTCTCTAACAGAGACTAACAGACTAGATTTGCTACTTGGCTGA V Y V - P E S L T E T N R L D L L L G - C M C N Q S L - Q R L T D - I C Y L A D C V C V T R V S N R D - Q T R F A T W L . . . . . . 29827 CTCATTTTTGTGTAAATCAATCTGTTTATCTTCACTTCAATGTTCATTGCACCAGAAATG L I F V - I N L F I F T S M F I A P E M S F L C K S I C L S S L Q C S L H Q K W T H F C V N Q S V Y L H F N V H C T R N . . . . . . 29767 GTTTTTTAAAAACATTCGAGTTACATGGTATGGTCGTTCGTCTTTTAAGCAAGTTTCTGC V F - K H S S Y M V W S F V F - A S F C F F K N I R V T W Y G R S S F K Q V S A G F L K T F E L H G M V V R L L S K F L . . . . . 29707 AGCCACTTTCGATATCCTAATCCAATATAAGTGAAATTTCATA S H F R Y P N P I - V K F H A T F D I L I Q Y K - N F I Q P L S I S - S N I S E I S Maximal non-overlapping open reading frames (>= 64 codons): >30373AAGCTTGACTGACACTTTTGCCTTGGGGTTGACCTGATTATAATTTTTTGTTAAGAGGGA-_PGL-1_AGS-1_PPS_1 (31803 31472,31309 31217,31115 30882,30755 30629) (frame '2'; 783 bp, 261 residues) 1 SLMATMSKPH KLKATPGSQR LVLLCIVAVA FLLLFTSVIS TGGLALPYRT TLIGYFVRST 61 RNKTQHSLSD KYLYWGNRID CPGKNCETCA GLGHQESSLR CALEEAMFLN RTFVMPSRMC 121 INPIHNKKGI LNRSNNETRE ESWEVSSCAM ESLYDIDLIS EKIPVILDDS ETWHIMLSTS 181 MKLKERGSAH VYGANRHELN DSSDFTNLLL INRTASPLAW FVECKDRGNR SDVMLPYSFL 241 QTMAASRLRD AAEKVKELHI I- >30373AAGCTTGACTGACACTTTTGCCTTGGGGTTGACCTGATTATAATTTTTTGTTAAGAGGGA-_PGL-1_AGS-1_PPS_2 (30633 30622,30550 30321,30243 30015) (frame '0'; 468 bp, 156 residues) 1 SSFQIKAKLG DYDAIHVRRG DKLKTRKDRF RVERSQFPHL DRDTRPEFII GRIQKQIPPG 61 RTLFIGSNER TPDFFSPLAI RYKVAYSSNF SEILDPIIEN NYQLFMVERL IMMGAKTFFK 121 TFREYETDLT LTDDPKKNKN WEIPVYTMDE GKEAAS-