GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:38:44 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 485 Minimum sequence length: 485 Maximum sequence length: 485 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 1 < 600: 0 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 49209AAGCTTGAAGATATAGTGTACTCAGGTAATGCATGAGATCATTAATCCAACCTTATTCGT, from 1 to 110292, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 1 Matches indexed, elapsed seconds = 1 HitsTableSize = 4 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 GAGAGTTATT ACAACTTGAA AAACATGGCA TCCAAGGCTT TGATTCTNTT GGGTCTCTTC 61 GCAATTNTNC TGGTGGTCTC CGAAGTTTCT NCCGCAAGAC AGTCGGGCAT GGTGAAGCCA 121 GAGAGTGAGG CAACTNTGCA ACCTGAAGGT TATCACGGAG GACATGGTGG TCACGGAGGG 181 GGAGGCCACT ACGGAGGAGG AGGCCACGGG CATGGAGGAC ACAACGGAGG AGGGGGCCAC 241 GGACTTNACG GATACGGAGG AGGACATGGA GGACACTACG GGCTTAACGG ACCTNTTCAG 301 ACGAAACCGG GTGTTTAAAA GTTAAAACTA TANAATAAAT TCACCACCAG TCCACCATGC 361 ATAATTGCAT CTCTATATAC ACTTATGNCT TATAAGTATG CATCAAAATA AACCATGGTN 421 AGTTTTAATG CAGTTCCCTC AGAAATGTGT GGGTAATGTT TTTAANATAN TTGNTATCCC 481 CAGTT Predicted gene structure (within gDNA segment 81686 to 80037): Exon 1 81386 81281 ( 106 n); cDNA 1 106 ( 106 n); score: 0.962 Intron 1 81280 80944 ( 337 n); Pd: 0.999 (s: 0.94), Pa: 1.000 (s: 0.98) Exon 2 80943 80769 ( 175 n); cDNA 107 281 ( 175 n); score: 0.989 Intron 2 80768 80679 ( 90 n); Pd: 0.000 (s: 0.98), Pa: 0.000 (s: 0.96) Exon 3 80678 80469 ( 210 n); cDNA 282 485 ( 204 n); score: 0.914 MATCH 49209AAGCTTGAAGATATAGTGTACTCAGGTAATGCATGAGATCATTAATCCAACCTTATTCGT- gi+ 0.951 491 1.012 C PGS_49209AAGCTTGAAGATATAGTGTACTCAGGTAATGCATGAGATCATTAATCCAACCTTATTCGT-_gi+ (81386 81281,80943 80769,80678 80469) Alignment (genomic DNA sequence = upper lines): GAGAGTTATT ACAACTTGAA AAACATGGCA TCCAAGGCTT TGATTCTGTT GGGTCTCTTC 81327 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| || |||||||||| GAGAGTTATT ACAACTTGAA AAACATGGCA TCCAAGGCTT TGATTCTNTT GGGTCTCTTC 60 GCAATTCTTC TGGTGGTCTC CGAAGTTTCT GCCGCAAGAC AGTCGGGTAT GTAAAATTAT 81267 |||||| | | |||||||||| |||||||||| ||||||||| |||||| GCAATTNTNC TGGTGGTCTC CGAAGTTTCT NCCGCAAGAC AGTCGG.... .......... 106 TTTTTACATT TCAGTCTAGC TAATTGAATT TGAACTAATT AACGTACCAT TTGATAAATT 81207 .......... .......... .......... .......... .......... .......... 106 TATCCAACTC TTTCGTACTT CCATTTTTTT TACTTTAGTT AAAAGTCAGA ACACACTTCA 81147 .......... .......... .......... .......... .......... .......... 106 CCTAACCTAA GTGATGTGGA ATCCTTGTAG TTTTGTGTAA ATATGTTTCG AACGTATATT 81087 .......... .......... .......... .......... .......... .......... 106 TTTAAAAACT ATAGTTCGAG TTTATGTACT TATTTATAAT TGAAAAAACA TTTTAAGGTC 81027 .......... .......... .......... .......... .......... .......... 106 GAAAACAAAG TTGAAGTCAG GAAGAAGAAT TCACATTTGA ACGTGATTAG AATGTCTTTG 80967 .......... .......... .......... .......... .......... .......... 106 AAATAACATG TGGTTTTGGG CAGGCATGGT GAAGCCAGAG AGTGAGGCAA CTGTGCAACC 80907 ||||||| |||||||||| |||||||||| || ||||||| .......... .......... ...GCATGGT GAAGCCAGAG AGTGAGGCAA CTNTGCAACC 143 TGAAGGTTAT CACGGAGGAC ATGGTGGTCA CGGAGGGGGA GGCCACTACG GAGGAGGAGG 80847 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAAGGTTAT CACGGAGGAC ATGGTGGTCA CGGAGGGGGA GGCCACTACG GAGGAGGAGG 203 CCACGGGCAT GGAGGACACA ACGGAGGAGG GGGCCACGGA CTTGACGGAT ACGGAGGAGG 80787 |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| |||||||||| CCACGGGCAT GGAGGACACA ACGGAGGAGG GGGCCACGGA CTTNACGGAT ACGGAGGAGG 263 ACATGGAGGA CACTACGGAG GAGGAGGAGG ACACTACGGA GGAGGAGGAG GCCACGGTGG 80727 |||||||||| |||||||| ACATGGAGGA CACTACGG.. .......... .......... .......... .......... 281 TGGTGGACAC TACGGAGGTG GAGGACACCA TGGAGGAGGA GGCCACGGGC TTAACGAACC 80667 || |||||| ||| .......... .......... .......... .......... ........GC TTAACGGACC 293 TGTTCAGACG AAACCGGGTG TTTAAAAGTT AAAACTATAA AATAAATTCA CCACCAGTCC 80607 | |||||||| |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| TNTTCAGACG AAACCGGGTG TTTAAAAGTT AAAACTATAN AATAAATTCA CCACCAGTCC 353 ACCATGCATA ATTGCATCTC TATATACACT TATGTCTTAT AAGTATGCAT CAAAATAAAC 80547 |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| |||||||||| ACCATGCATA ATTGCATCTC TATATACACT TATGNCTTAT AAGTATGCAT CAAAATAAAC 413 CATGGTGAGT TTGTAATGCA GTTCCTTCAG AAATGTGTGG AATAATGTTT TATAATAATA 80487 |||||| ||| || ||||||| ||||| |||| |||||||||| |||||||| | ||| ||| CATGGTNAGT TT-TAATGCA GTTCCCTCAG AAATGTGTGG -GTAATGTTT T-TAA-NATA 469 ATAGAATATC TCTCAGTT 80469 | | |||| | ||||| NTTG-NTATC -CCCAGTT 485 ******************************************************************************** EST sequence 2 +strand (File: gi+) 1 GAGAGTTATT ACAACTTGAA AAACATGGCA TCCAAGGCTT TGATTCTNTT GGGTCTCTTC 61 GCAATTNTNC TGGTGGTCTC CGAAGTTTCT NCCGCAAGAC AGTCGGGCAT GGTGAAGCCA 121 GAGAGTGAGG CAACTNTGCA ACCTGAAGGT TATCACGGAG GACATGGTGG TCACGGAGGG 181 GGAGGCCACT ACGGAGGAGG AGGCCACGGG CATGGAGGAC ACAACGGAGG AGGGGGCCAC 241 GGACTTNACG GATACGGAGG AGGACATGGA GGACACTACG GGCTTAACGG ACCTNTTCAG 301 ACGAAACCGG GTGTTTAAAA GTTAAAACTA TANAATAAAT TCACCACCAG TCCACCATGC 361 ATAATTGCAT CTCTATATAC ACTTATGNCT TATAAGTATG CATCAAAATA AACCATGGTN 421 AGTTTTAATG CAGTTCCCTC AGAAATGTGT GGGTAATGTT TTTAANATAN TTGNTATCCC 481 CAGTT Predicted gene structure (within gDNA segment 104421 to 100491): Exon 1 104059 103954 ( 106 n); cDNA 1 106 ( 106 n); score: 0.849 Intron 1 103953 103596 ( 358 n); Pd: 0.981 (s: 0.80), Pa: 1.000 (s: 0.92) Exon 2 103595 103446 ( 150 n); cDNA 107 262 ( 156 n); score: 0.807 Intron 2 103445 103332 ( 114 n); Pd: 0.001 (s: 0.98), Pa: 0.001 (s: n/a) Exon 3 103331 103323 ( 9 n); cDNA 263 271 ( 9 n); score: 0.889 Intron 3 103322 103260 ( 63 n); Pd: 0.001 (s: n/a), Pa: 0.001 (s: 0.82) Exon 4 103259 103209 ( 51 n); cDNA 272 322 ( 51 n); score: 0.824 Intron 4 103208 102569 ( 640 n); Pd: 0.000 (s: 0.82), Pa: 0.001 (s: n/a) Exon 5 102568 102559 ( 10 n); cDNA 323 332 ( 10 n); score: 0.800 Intron 5 102558 102267 ( 292 n); Pd: 0.901 (s: n/a), Pa: 0.999 (s: n/a) Exon 6 102266 102245 ( 22 n); cDNA 333 352 ( 20 n); score: 0.409 Intron 6 102244 101604 ( 641 n); Pd: 0.001 (s: n/a), Pa: 0.001 (s: 0.76) Exon 7 101603 101472 ( 132 n); cDNA 353 485 ( 133 n); score: 0.689 MATCH 49209AAGCTTGAAGATATAGTGTACTCAGGTAATGCATGAGATCATTAATCCAACCTTATTCGT- gi+ 0.784 439 0.905 C PGS_49209AAGCTTGAAGATATAGTGTACTCAGGTAATGCATGAGATCATTAATCCAACCTTATTCGT-_gi+ (104059 103954,103595 103446,103331 103323,103259 103209,102568 102559,102266 102245,101603 101472) Alignment (genomic DNA sequence = upper lines): GAGAGTTATT AGAACTTGCA AAAAATGGCT TCCAAGGCTT TGATTCTGTT AGGTCTCTTC 104000 |||||||||| | |||||| | ||| ||||| |||||||||| ||||||| || ||||||||| GAGAGTTATT ACAACTTGAA AAACATGGCA TCCAAGGCTT TGATTCTNTT GGGTCTCTTC 60 TCAGTTCTTC TCGTCGTCTC CGAAGTGTCT GCCGCAAGGC AATCGGGTAC GTAAAATATT 103940 || || | | | || ||||| |||||| ||| ||||||| | | |||| GCAATTNTNC TGGTGGTCTC CGAAGTTTCT NCCGCAAGAC AGTCGG.... .......... 106 ATACATTTCA GCCTGCTTAC AAAAAACGAA GATATGTTCG TGAGGCTAGC TATTCAAACT 103880 .......... .......... .......... .......... .......... .......... 106 TGAACTAACG TACCATTTGA TAAATTTATA CAACTATTTC GCCAGTTCCA TTTTATGGAC 103820 .......... .......... .......... .......... .......... .......... 106 TTACAATATA GTAAAGTCAG AACACACTTC ACTTGAGTGA TGTGGAATCC TAGGAAGTTT 103760 .......... .......... .......... .......... .......... .......... 106 TGTGTAAATT ATGTTTCGAA CGTATATTTA AAAACTATAG TTTCGAGTTT ATATATGTAC 103700 .......... .......... .......... .......... .......... .......... 106 TTTAAAAAAA AAATTGAGGT GGAAAACTAA ATTGAAGTCA GGAAGAGGGC TTCACATTTT 103640 .......... .......... .......... .......... .......... .......... 106 CACGTGCTTA AGAATGTCTT TGAAATAACG TGTTTTTTGG GCAGGCATGG TGAAGCCAGA 103580 |||||| |||||||||| .......... .......... .......... .......... ....GCATGG TGAAGCCAGA 122 GAGTGAGGAA ACTGTGCAAC CTGAAGGTTA TGGCGGTGGC CACGGAGG-- AC--A--TGG 103526 |||||||| | ||| |||||| |||||||||| | ||| || || || || || | || GAGTGAGGCA ACTNTGCAAC CTGAAGGTTA TCACGGAGGA CATGGTGGTC ACGGAGGGGG 182 TGGTCACGGA GGGGGAGGAG GCCACGGACA TGGAGGACAC AACGGAGGAG GGGGCCACGG 103466 || ||| || ||||||| ||||||| || |||||||||| |||||||||| |||||||||| AGGCCACTAC GGAGGAGGAG GCCACGGGCA TGGAGGACAC AACGGAGGAG GGGGCCACGG 242 ACTTGACGGA TACGGAGGAG GTGGAGGACA CTATGGAGGA GGTGGAGGAC ACTACGGAGG 103406 |||| ||||| |||||||||| ACTTNACGGA TACGGAGGAG .......... .......... .......... .......... 262 AGGTGGAGGA CACTACGGAG GAGGTGGAGG ACACTACGGA GGAGGTGGAG GACACTACGG 103346 .......... .......... .......... .......... .......... .......... 262 AGGAGGTGGT GGAGGACACG GAGGTGGAGG ACACTACGGA GGTGGTGGAG GAGGATACGG 103286 |||| | ||| .......... ....GACATG GAG....... .......... .......... .......... 271 AGGTGGAGGA GGACACCACG GAGGAGGAGG CCACGGGCTA AACGAACCTG TTCAGACTAA 103226 || | ||||||| |||| |||| ||||||| || .......... .......... ......GACA CTACGGGCTT AACGGACCTN TTCAGACGAA 305 GCCGGGTGTT TAAAACTATA TAATATCTTC ACTACCATGC ATGATTGCAT ATATATATAT 103166 ||||||||| ||||| | ACCGGGTGTT TAAAAGT... .......... .......... .......... .......... 322 ACGCTTATGT ATTATCTATA TGCCTATAAA TAAACCATGG TGAGTTTGTA ACGCAGTGCC 103106 .......... .......... .......... .......... .......... .......... 322 TTCAGAAATG TTCGGAATAA ATTTCCATAA TATTAGTATA ATGTCTCTCT GTTTGAATTA 103046 .......... .......... .......... .......... .......... .......... 322 TAAACTGCGC TGTTTGCATA ATAAAATCTC TTGTAGCTAG GTCATGTTAC TCTCTTTCAG 102986 .......... .......... .......... .......... .......... .......... 322 TTTTTCTTTG TAACAGTATT ATATCCTTAT TCATATTGTT AGGGAATATT TTTCTTAAAA 102926 .......... .......... .......... .......... .......... .......... 322 GATTACCAAA AGCCTGCAGG AATAAAAAAA AATGATTACT AAAAACCTAG AGACTATCGA 102866 .......... .......... .......... .......... .......... .......... 322 GGTTTTGCAT ACACCAACAC TTACGAAGTT CATTAAATAT ATGAAACCGA TACGAGAGAT 102806 .......... .......... .......... .......... .......... .......... 322 GGGATCATCA GAGAAGGGAG ATAAAGACAG TAGAAGCTTC TTAATTGTCG CAGTTATGTT 102746 .......... .......... .......... .......... .......... .......... 322 CTGATCTTTT CCTTATAGAA TTAGACTTTT AATATTTTCG TTTTTAGTTT TAACTTACGT 102686 .......... .......... .......... .......... .......... .......... 322 CGGTTGTAAG AGATCTATAT ATAATCATCT TCCAAACATT ACCCTTATTT TCAAGGCTTT 102626 .......... .......... .......... .......... .......... .......... 322 GCTTCTGTTG GGTCTCTTCG CAGTTCTTCT CGTCATCTCC GAAGTGTCTG CGGCAAGTAA 102566 ||| .......... .......... .......... .......... .......... .......TAA 325 AATTATTGTA TGCATTTCAA TCTACTTTAA AGAAACAAGA TATATGATAA AAATATTTTA 102506 || ||| AACTATA... .......... .......... .......... .......... .......... 332 TACATTTCAA TCATCCAACA CTAAAGTACC ATTTGATAAA TTTACATCAC TTAAGTGAAG 102446 .......... .......... .......... .......... .......... .......... 332 TGTGGAATTC TAGTACCATT TTAGAAGTTG TGTGCATATT ATGTTTAGAA CGTATATTTA 102386 .......... .......... .......... .......... .......... .......... 332 ATTTTGTAAA CCTTTTTTGA AAAAAGCATT AATATACTTT CAAATCTTTA TTGAACGCAC 102326 .......... .......... .......... .......... .......... .......... 332 ATTTAACATT TTGCAAGTAC ATGCTTAACA ATATCTTTGA AATAATGTTT TATTTTTAGG 102266 .......... .......... .......... .......... .......... .........N 333 TAGGCATGGT GACACCAGAG AGTGATGAAA CTGTGTAACC TGAAGGTTAT GGCGGTGGCC 102206 | || |||||| AATAAAT-TC ACCACCAG-T C......... .......... .......... .......... 352 ACGGAGGACA AGGTGGTCAC GACCATGGAG GACACAACGG AGGAGGGGGC CACAGACAGA 102146 .......... .......... .......... .......... .......... .......... 352 CCCGGTCCAT AAGGGCCATT GAAGCAATAG ATTTAGGCCT CAATATTTAA TGGCTTTTTA 102086 .......... .......... .......... .......... .......... .......... 352 CTTTAAATTT ATTTTGAGTC CTTTACCATA AATATATTTT TAGAGCCTTT TAAGATTTTG 102026 .......... .......... .......... .......... .......... .......... 352 TTTAGGCATT TAGATACAAA CTAATTAGTA TCTCTAATAG TGAAATTTGA TTTATTGTGA 101966 .......... .......... .......... .......... .......... .......... 352 AATGTTTTTT CAAAATTGGA CTTTTCAACA TTTTTTGGGA TTTATTTTGT TGTTTAAGTG 101906 .......... .......... .......... .......... .......... .......... 352 TATTTTTGAA ATTTGTTTTG TTGTTAAAGT GTATTTTTAG AGGTTTAATG TATATTTTAT 101846 .......... .......... .......... .......... .......... .......... 352 TATTATTTAT GATTTTTCAT AATGTATAAA AAGGACTGTT TTTAAGCTTT GCTTAAGGTA 101786 .......... .......... .......... .......... .......... .......... 352 TCAAGATATG TTTGATCAGC ACTGGCCACA GACTTGACGC ATATGGAGGA GTGGAGGACG 101726 .......... .......... .......... .......... .......... .......... 352 TGGACCATGT GGTGGTCACG GTGGTGGAGG TGGCTACGGA GGAGGATACG GAGTTGGAGG 101666 .......... .......... .......... .......... .......... .......... 352 ACACATGATT AACGAACCAG TTCGGACTAA ACCGGGTGTT TAAAACTATA TAATATATTC 101606 .......... .......... .......... .......... .......... .......... 352 AGCACCATCC ATGATTGCAT CAATATATGC AATTATTTAT TATTTTTATG CATATAAAT- 101547 |||||| | || ||||||| | ||||| | | |||| | ||| |||| ||| |||| ..CACCATGC ATAATTGCAT CTCTATATAC ACTTATGNCT TATAAGTATG CATCAAAATA 410 AACCATGGTG AGTTTGTAAT GCAGTGCTTT CAAAAATGTT TGGAATAAAG TTTCACAATA 101487 ||||||||| ||||| |||| ||||| | | || |||||| ||| | ||| | ||| AACCATGGTN AGTTT-TAAT GCAGTTCCCT CAGAAATGTG TGGGTAATGT TTTTAANATA 469 C-TAATTTCC GCTGCT 101472 | | ||| | | | NTTGNTATCC CCAGTT 485 Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (2): PGL 1 (- strand): 81386 80469 AGS-1 (81386 81281,80943 80769,80678 80469) SCR (e 0.962 d 0.999 a 1.000,e 0.989 d 0.000 a 0.000,e 0.914) Exon 1 81386 81281 ( 106 n); score: 0.962 Intron 1 81280 80944 ( 337 n); Pd: 0.999 Pa: 1.000 Exon 2 80943 80769 ( 175 n); score: 0.989 Intron 2 80768 80679 ( 90 n); Pd: 0.000 Pa: 0.000 Exon 3 80678 80469 ( 210 n); score: 0.914 PGS (81386 81281,80943 80769,80678 80469) gi+ 3-phase translation of AGS-1 (-strand): . . . . . . 81386 GAGAGTTATTACAACTTGAAAAACATGGCATCCAAGGCTTTGATTCTGTTGGGTCTCTTC E S Y Y N L K N M A S K A L I L L G L F R V I T T - K T W H P R L - F C W V S S E L L Q L E K H G I Q G F D S V G S L . . . . . : . 81326 GCAATTCTTCTGGTGGTCTCCGAAGTTTCTGCCGCAAGACAGTCGG : GCATGGTGAAGCCA A I L L V V S E V S A A R Q S : G M V K P Q F F W W S P K F L P Q D S R : A W - S Q R N S S G G L R S F C R K T V G : H G E A . . . . . . 80929 GAGAGTGAGGCAACTGTGCAACCTGAAGGTTATCACGGAGGACATGGTGGTCACGGAGGG E S E A T V Q P E G Y H G G H G G H G G R V R Q L C N L K V I T E D M V V T E G R E - G N C A T - R L S R R T W W S R R . . . . . . 80869 GGAGGCCACTACGGAGGAGGAGGCCACGGGCATGGAGGACACAACGGAGGAGGGGGCCAC G G H Y G G G G H G H G G H N G G G G H E A T T E E E A T G M E D T T E E G A T G R P L R R R R P R A W R T Q R R R G P . . . . . : . 80809 GGACTTGACGGATACGGAGGAGGACATGGAGGACACTACGG : GCTTAACGAACCTGTTCAG G L D G Y G G G H G G H Y G : L N E P V Q D L T D T E E D M E D T T : G L T N L F R R T - R I R R R T W R T L R : A - R T C S . . . . . . 80659 ACGAAACCGGGTGTTTAAAAGTTAAAACTATAAAATAAATTCACCACCAGTCCACCATGC T K P G V - K L K L - N K F T T S P P C R N R V F K S - N Y K I N S P P V H H A D E T G C L K V K T I K - I H H Q S T M . . . . . . 80599 ATAATTGCATCTCTATATACACTTATGTCTTATAAGTATGCATCAAAATAAACCATGGTG I I A S L Y T L M S Y K Y A S K - T M V - L H L Y I H L C L I S M H Q N K P W - H N C I S I Y T Y V L - V C I K I N H G . . . . . . 80539 AGTTTGTAATGCAGTTCCTTCAGAAATGTGTGGAATAATGTTTTATAATAATAATAGAAT S L - C S S F R N V W N N V L - - - - N V C N A V P S E M C G I M F Y N N N R I E F V M Q F L Q K C V E - C F I I I I E . . 80479 ATCTCTCAGTT I S Q S L S Y L S V Maximal non-overlapping open reading frames (>= 64 codons): >49209AAGCTTGAAGATATAGTGTACTCAGGTAATGCATGAGATCATTAATCCAACCTTATTCGT-_PGL-1_AGS-1_PPS_1 (81386 81281,80943 80769,80678 80642) (frame '1'; 315 bp, 105 residues) 1 ESYYNLKNMA SKALILLGLF AILLVVSEVS AARQSGMVKP ESEATVQPEG YHGGHGGHGG 61 GGHYGGGGHG HGGHNGGGGH GLDGYGGGHG GHYGLNEPVQ TKPGV- PGL 2 (- strand): 104059 102245 AGS-1 (104059 103954,103595 103446,103331 103323,103259 103209,102568 102559,102266 102245) SCR (e 0.849 d 0.981 a 1.000,e 0.807 d 0.001 a 0.001,e 0.889 d 0.001 a 0.001,e 0.824 d 0.000 a 0.001,e 0.800 d 0.901 a 0.999,e 0.409) Exon 1 104059 103954 ( 106 n); score: 0.849 Intron 1 103953 103596 ( 358 n); Pd: 0.981 Pa: 1.000 Exon 2 103595 103446 ( 150 n); score: 0.807 Intron 2 103445 103332 ( 114 n); Pd: 0.001 Pa: 0.001 Exon 3 103331 103323 ( 9 n); score: 0.889 Intron 3 103322 103260 ( 63 n); Pd: 0.001 Pa: 0.001 Exon 4 103259 103209 ( 51 n); score: 0.824 Intron 4 103208 102569 ( 640 n); Pd: 0.000 Pa: 0.001 Exon 5 102568 102559 ( 10 n); score: 0.800 Intron 5 102558 102267 ( 292 n); Pd: 0.901 Pa: 0.999 Exon 6 102266 102245 ( 22 n); score: 0.409 PGS (104059 103954,103595 103446,103331 103323,103259 103209,102568 102559,102266 102245) gi+ 3-phase translation of AGS-1 (-strand): . . . . . . 104059 GAGAGTTATTAGAACTTGCAAAAAATGGCTTCCAAGGCTTTGATTCTGTTAGGTCTCTTC E S Y - N L Q K M A S K A L I L L G L F R V I R T C K K W L P R L - F C - V S S E L L E L A K N G F Q G F D S V R S L . . . . . : . 103999 TCAGTTCTTCTCGTCGTCTCCGAAGTGTCTGCCGCAAGGCAATCGG : GCATGGTGAAGCCA S V L L V V S E V S A A R Q S : G M V K P Q F F S S S P K C L P Q G N R : A W - S Q L S S S R R L R S V C R K A I G : H G E A . . . . . . 103581 GAGAGTGAGGAAACTGTGCAACCTGAAGGTTATGGCGGTGGCCACGGAGGACATGGTGGT E S E E T V Q P E G Y G G G H G G H G G R V R K L C N L K V M A V A T E D M V V R E - G N C A T - R L W R W P R R T W W . . . . . . 103521 CACGGAGGGGGAGGAGGCCACGGACATGGAGGACACAACGGAGGAGGGGGCCACGGACTT H G G G G G H G H G G H N G G G G H G L T E G E E A T D M E D T T E E G A T D L S R R G R R P R T W R T Q R R R G P R T . . : . : . . . 103461 GACGGATACGGAGGAG : GACACGGAG : GAGGCCACGGGCTAAACGAACCTGTTCAGACTAAG D G Y G G : G H G : G G H G L N E P V Q T K T D T E E : D T E : E A T G - T N L F R L S - R I R R R : T R R : R P R A K R T C S D - . . : . : . . 103224 CCGGGTGTTTAAAACT : TAAAATTATT : GTAGGCATGGTGACACCAGAGA P G V - N : L K L L : - A W - H Q R R V F K T : - N Y : C R H G D T R A G C L K L : K I I : V G M V T P E Maximal non-overlapping open reading frames (>= 64 codons): >49209AAGCTTGAAGATATAGTGTACTCAGGTAATGCATGAGATCATTAATCCAACCTTATTCGT-_PGL-2_AGS-1_PPS_1 (104047 103954,103595 103446,103331 103323,103259 103213) (frame '1'; 297 bp, 99 residues) 1 NLQKMASKAL ILLGLFSVLL VVSEVSAARQ SGMVKPESEE TVQPEGYGGG HGGHGGHGGG 61 GGHGHGGHNG GGGHGLDGYG GGHGGGHGLN EPVQTKPGV-