GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:36:45 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 2388 Minimum sequence length: 2388 Maximum sequence length: 2388 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 0 < 600: 0 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 1 ________________________________________________________________________________ Sequence 1: 37829GAATTCAACAACATCATGATAAACTTTTCAATCCACAACCAAGCCAAAATCACCATCATC, from 1 to 101944, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 1 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 ATGGTGAGTA CTCAACAACG CACGGACGAT GACTCTTCTC AACCGGTAAA AGCTTCTCTT 61 AAGAGCTATG GGATCACGGA GCCACTGTCT ATTGCTGGAC CTTCTGCTGC TGATGTTAAG 121 CGTAATTTGG AACTAGAGAA GTTTCTGGTT GATGAGGGGC TCTACGAGAG CAAGGAAGAA 181 ACTATGCGGA GAGAGGAAGT TGTGGTTCGC ATTGATCAGA TTGTAAAACA CTGGGTGAAA 241 CAGTTAACTC GTCAGAGGGG CTATACTGAT CAGATGGTGG AGGATGCAAA TGCTGTCATT 301 TTCACTTTTG GATCTTACCG CCTTGGAGTT CACGGACCTA TGGCTGATAT TGATACTTTG 361 TGTGTTGGCC CATCTTATGT TAACCGAGAG GAGGATTTCT TCATTTTCTT CCGTGATATA 421 TTGGCTGAAA TGGAAGAAGT GACTGAACTT CAACCTGTTA CTGATGCCCA TGTCCCAGTC 481 ATGAAATTTA AGTTCCAAGG AATATCAATT GATCTTCTGT ATGCTAGCAT ATCGCTTCTA 541 GTTATCCCAC AGGTAGATAT CTCCAACTCT TCTGTGCTGT GTGACGTGGA TGAACAAACT 601 GTCCGCAGTC TTAATGGTTG TAGGGTTGCT GATCAGATTC TTAAACTTGT TCCAAATTCC 661 GAGATGCCTG AAGTACTGGG TAAGAAGCGT GGGGTCTATT CAAATGTAGT TACTGGATTT 721 CTTGGTGGTG TAAACTGGGC ACTTCTGGTT GCACGCCTTT GCCAGTTTTA TCCAAATGCT 781 ATTCCTAGTA TGTTGGTTTC TCGATTTTTT AGAGTATATA CACAATGGCG CTGGCCGAAT 841 CCAGTCATGC TTTGTGCAAT AGAAGAAGAT GACCTTAGCT TTCCTGTTTG GGACCCACGA 901 AAAAATCATC GTGACCGCTA TCATCTTATG CCAATAATAA CTCCTGCATA CCCATGCATG 961 AATTCTAGTT ACAATGTCTC TCAAAGCACT CTTCGTGTTA TGACAGAGCA ATTCCAGTTT 1021 GGCAACACGA TCTGTCAGAT GCAGGAGATT GAGTTAAATA AACAACACTG GAGTTCCTTA 1081 TTTCAGCAAT ATATGTTCTT CGAGGCATAT AAAAACTACC TTCAGGTTGA TGTACTAGCT 1141 GCAGATGCCG AAGATTTATT GGCATGGAAA GGTTGGGTGG AGTCACGGTT CAGGCAACTG 1201 ACCTTGAAGA TAGAACGAGA CACAAATGGG ATGTTAATGT GCCACCCTCA ACCAAACGAG 1261 TATGTAGACA CTTCGAAGCA GTTTCGACAT TGTGCCTTTT TCATGGGCTT GCAGAGGGCA 1321 GATGGATTTG GTGGCCAAGA ATGTCAACAG TTTGATATAC GTGGAACAGT GGACGAATTC 1381 AGGCAAGAGG TAAACATGTA TATGTTTTGG AGACCTGGGA TGGATGTGCA TGTTTCTCAT 1441 GTTCGAAGAC GGCAGCTTCC ATCTTTTGTT TTTCCAAATG GATATAAAAG GTCTCGGCAA 1501 TCAAGGCACC AGAGTCAACA ATGCAGAGAA CCTGGTGATG AGGGCGTTGG TTCTTTATCC 1561 GACTCTGTTG AGAGATATGC GAAGAGAAAG AACGATGATG AAATTATGAA TTCCAGGCCA 1621 GAGAAACGTG AGAAGCGCGC ATCTTGTAGT CTACATACTC TGGATGCAGC TTCTCCTGAC 1681 AGCAGTGGTA TCACTACTAG TGGGACTCCT CAGATTGGCA TTGTTCCAGG TCCTAGAGCT 1741 GAATGCTTAG TAACTGGTGA TCTTGTTTGC AATGTTACAA GTCTTCCTAA CGTGGAAGTT 1801 GAGGCTGAAA AGTTTATCAG TAAAATCACG GAACTAAGAA AATTCTCTCA GTACGAGCAT 1861 ACCTCTGGTA GCGAGCAAAT CCTGGAAGTA GATAGTAGGG CTCTAGTTCA AAGTTATCAT 1921 GACCTGGCTG AGCCTGTAGC AAAACATGTG AGACCTGACC TTAGTGCTTT GCTAGCGTGT 1981 GAAGGTGGGC AGAATAAAGA AATAGGTCAT GATATGGGCT CTGAATCTAT TAATGACACT 2041 GACACGCAAC ATCTTCCAAG GCGACTAAAT GTAAATGAAG ATGTTGATGA AGTTGAGAGG 2101 GAAGCCAAGT TGGGAGAAAT TGCTGGTGGT GTTTTGTGGA ATGGACACTG TGGGCGGAAC 2161 CTTGACCATG AGGGTTTTGT GACTCCTGCA AATTTGGATT CAGCTGTGGA AAATAGAAAC 2221 TTGCATTCAG ACGGATTGAA AAAAAGTGGC TTGCCAGAAG AACTTCAGTC AAATTCTTTG 2281 CTCAGCGGGA CGGGGAAGCT GGACGATGGA GCTAGGTCAG AATCTTTGCA AAATGAAATG 2341 ATGAGGCATG TGTTTTTGCA ACCCATTATT GGTTTATGCA AATCATGA Predicted gene structure (within gDNA segment 45172 to 50094): Exon 1 45472 45612 ( 141 n); cDNA 1 141 ( 141 n); score: 1.000 Intron 1 45613 45708 ( 96 n); Pd: 0.988 (s: 1.00), Pa: 0.592 (s: 1.00) Exon 2 45709 45786 ( 78 n); cDNA 142 219 ( 78 n); score: 1.000 Intron 2 45787 46161 ( 375 n); Pd: 0.984 (s: 1.00), Pa: 0.941 (s: 1.00) Exon 3 46162 46269 ( 108 n); cDNA 220 327 ( 108 n); score: 1.000 Intron 3 46270 46463 ( 194 n); Pd: 0.921 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 4 46464 46526 ( 63 n); cDNA 328 390 ( 63 n); score: 1.000 Intron 4 46527 46601 ( 75 n); Pd: 0.903 (s: 1.00), Pa: 0.282 (s: 1.00) Exon 5 46602 46763 ( 162 n); cDNA 391 552 ( 162 n); score: 1.000 Intron 5 46764 47101 ( 338 n); Pd: 0.961 (s: 1.00), Pa: 0.969 (s: 0.92) Exon 6 47102 47215 ( 114 n); cDNA 553 663 ( 111 n); score: 0.965 Intron 6 47216 47316 ( 101 n); Pd: 0.961 (s: 1.00), Pa: 0.001 (s: 0.98) Exon 7 47317 47362 ( 46 n); cDNA 664 708 ( 45 n); score: 0.978 Intron 7 47363 47442 ( 80 n); Pd: 0.000 (s: 0.98), Pa: 0.993 (s: 1.00) Exon 8 47443 47772 ( 330 n); cDNA 709 1038 ( 330 n); score: 1.000 Intron 8 47773 47844 ( 72 n); Pd: 0.843 (s: 1.00), Pa: 0.000 (s: 1.00) Exon 9 47845 48015 ( 171 n); cDNA 1039 1209 ( 171 n); score: 1.000 Intron 9 48016 48150 ( 135 n); Pd: 0.715 (s: 1.00), Pa: 0.982 (s: 1.00) Exon 10 48151 49113 ( 963 n); cDNA 1210 2172 ( 963 n); score: 1.000 Intron 10 49114 49196 ( 83 n); Pd: 0.845 (s: 1.00), Pa: 0.872 (s: 1.00) Exon 11 49197 49292 ( 96 n); cDNA 2173 2268 ( 96 n); score: 0.969 Intron 11 49293 49669 ( 377 n); Pd: 0.976 (s: 0.94), Pa: 0.999 (s: 1.00) Exon 12 49670 49789 ( 120 n); cDNA 2269 2388 ( 120 n); score: 1.000 MATCH 37829GAATTCAACAACATCATGATAAACTTTTCAATCCACAACCAAGCCAAAATCACCATCATC+ gi+ 0.997 2346 0.982 C PGS_37829GAATTCAACAACATCATGATAAACTTTTCAATCCACAACCAAGCCAAAATCACCATCATC+_gi+ (45472 45612,45709 45786,46162 46269,46464 46526,46602 46763,47102 47215,47317 47362,47443 47772,47845 48015,48151 49113,49197 49292,49670 49789) Alignment (genomic DNA sequence = upper lines): ATGGTGAGTA CTCAACAACG CACGGACGAT GACTCTTCTC AACCGGTAAA AGCTTCTCTT 45531 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGGTGAGTA CTCAACAACG CACGGACGAT GACTCTTCTC AACCGGTAAA AGCTTCTCTT 60 AAGAGCTATG GGATCACGGA GCCACTGTCT ATTGCTGGAC CTTCTGCTGC TGATGTTAAG 45591 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGAGCTATG GGATCACGGA GCCACTGTCT ATTGCTGGAC CTTCTGCTGC TGATGTTAAG 120 CGTAATTTGG AACTAGAGAA GGTTAGCGTT GTGTTTGTTT GTTTTTTTGT TCCAATGAAA 45651 |||||||||| |||||||||| | CGTAATTTGG AACTAGAGAA G......... .......... .......... .......... 141 AATTGCTTTC AGGAACAATG TGATATTGAC TTGCGTTTTG GCTGATTTAT TGTGAAGTTT 45711 ||| .......... .......... .......... .......... .......... .......TTT 144 CTGGTTGATG AGGGGCTCTA CGAGAGCAAG GAAGAAACTA TGCGGAGAGA GGAAGTTGTG 45771 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTGGTTGATG AGGGGCTCTA CGAGAGCAAG GAAGAAACTA TGCGGAGAGA GGAAGTTGTG 204 GTTCGCATTG ATCAGGTGAT TAAGAGCGTC TGGTTTATTG CTTTCTCATG ATGATAATGC 45831 |||||||||| ||||| GTTCGCATTG ATCAG..... .......... .......... .......... .......... 219 TTAGACTGGT TCTCACATTT TCTTTTTATA GTGTTGTAGA CCAACGATTT CTATGCCGTA 45891 .......... .......... .......... .......... .......... .......... 219 ATTATAAGAT GATTGTAGCA ATATGTAGTC TAGTCCTTCT ATTAATCCTA TTGGCTCATT 45951 .......... .......... .......... .......... .......... .......... 219 TGCCATGTGA ACACAAGTGA CTTATAGGCG CTAATCATTT GTTTGATAGT TTACTCTGCT 46011 .......... .......... .......... .......... .......... .......... 219 TTACTGAAAT GTAGATTTTG GGTTAGCTGC TTTCTCTTCG ATGTCAAAGA GGCTCACAGT 46071 .......... .......... .......... .......... .......... .......... 219 GGTGTTTACT GAAATGCATA ATGGTCAAGT TTTGTTTACA GTACTAACTA TCTAATCTAA 46131 .......... .......... .......... .......... .......... .......... 219 CACATCTTCT TCCTTAAACT TCTTGTATAG ATTGTAAAAC ACTGGGTGAA ACAGTTAACT 46191 |||||||||| |||||||||| |||||||||| .......... .......... .......... ATTGTAAAAC ACTGGGTGAA ACAGTTAACT 249 CGTCAGAGGG GCTATACTGA TCAGATGGTG GAGGATGCAA ATGCTGTCAT TTTCACTTTT 46251 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGTCAGAGGG GCTATACTGA TCAGATGGTG GAGGATGCAA ATGCTGTCAT TTTCACTTTT 309 GGATCTTACC GCCTTGGAGT AAGTTCTTTT TAAGTTTAAA GTTAAAAGAT CAACTGCATC 46311 |||||||||| |||||||| GGATCTTACC GCCTTGGA.. .......... .......... .......... .......... 327 TAATCAGTTT GGGGAAAGGT TTTACTTGGA TCATATATTT TCTATTGAGC CTTGTGAGTA 46371 .......... .......... .......... .......... .......... .......... 327 TCTTGAATTA TCACTGAAAT GAAGAGAAGT AGCATTGTAT TCGTCTCTAC ATTACTAGTT 46431 .......... .......... .......... .......... .......... .......... 327 GAGGGTGAAA TTTTTATTTT GTTGGTTTGC AGGTTCACGG ACCTATGGCT GATATTGATA 46491 |||||||| |||||||||| |||||||||| .......... .......... .......... ..GTTCACGG ACCTATGGCT GATATTGATA 355 CTTTGTGTGT TGGCCCATCT TATGTTAACC GAGAGGTAAC TATTGAGTTT CGTCTTACTG 46551 |||||||||| |||||||||| |||||||||| ||||| CTTTGTGTGT TGGCCCATCT TATGTTAACC GAGAG..... .......... .......... 390 ATGTAAAGGT CTGGTGGGTT AGTTTAGTTG CTAACAGGTT ATTTTTACAG GAGGATTTCT 46611 |||||||||| .......... .......... .......... .......... .......... GAGGATTTCT 400 TCATTTTCTT CCGTGATATA TTGGCTGAAA TGGAAGAAGT GACTGAACTT CAACCTGTTA 46671 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCATTTTCTT CCGTGATATA TTGGCTGAAA TGGAAGAAGT GACTGAACTT CAACCTGTTA 460 CTGATGCCCA TGTCCCAGTC ATGAAATTTA AGTTCCAAGG AATATCAATT GATCTTCTGT 46731 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTGATGCCCA TGTCCCAGTC ATGAAATTTA AGTTCCAAGG AATATCAATT GATCTTCTGT 520 ATGCTAGCAT ATCGCTTCTA GTTATCCCAC AGGTAACATC AATATAGCCT TTTTCAGTTT 46791 |||||||||| |||||||||| |||||||||| || ATGCTAGCAT ATCGCTTCTA GTTATCCCAC AG........ .......... .......... 552 CTGTTGAAAG TTAGGGGGTG GTTCATGATA CCTGTATGTT TATCTTGGAT TTCATTTGGT 46851 .......... .......... .......... .......... .......... .......... 552 GGGTTTGGTG CGTGTATTGA TTGCGTGTTG ATGGTCACAT AGTATTATTA TACACTTCCT 46911 .......... .......... .......... .......... .......... .......... 552 CCAAGGGTTG AGGTGTCAAA GAAACTTCTT ATAAGAAATT GTTAGCCTTA GTGAACATGG 46971 .......... .......... .......... .......... .......... .......... 552 AGATGTTGAA CTGCCGTGTT AGCACCATCA GGCTTGATTT ATATATGAGT ACTGAGATGA 47031 .......... .......... .......... .......... .......... .......... 552 GTTCTCGACA AAGTTTATAA GTTCTTGCTG AGGCAGATCC CCTTTTTTTC TGGAATCAAT 47091 .......... .......... .......... .......... .......... .......... 552 TTATTAACAG GATCTGGATA TCTCCAACTC TTCTGTGCTG TGTGACGTGG ATGAACAAAC 47151 | | |||| |||||||||| |||||||||| |||||||||| |||||||||| .......... G---TAGATA TCTCCAACTC TTCTGTGCTG TGTGACGTGG ATGAACAAAC 599 TGTCCGCAGT CTTAATGGTT GTAGGGTTGC TGATCAGATT CTTAAACTTG TTCCAAATTC 47211 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTCCGCAGT CTTAATGGTT GTAGGGTTGC TGATCAGATT CTTAAACTTG TTCCAAATTC 659 CGAGGTAAAG TCGTTGAGTG CCAGTTGTTT ATGTTATAAA CTAAGGAATC ATGTAAGCCA 47271 |||| CGAG...... .......... .......... .......... .......... .......... 663 TCTCTGATGT GATGTGCAAA TGTAGCACTT CCGGACAACA TTAAGATGCC TGAAGTACTG 47331 ||||| |||||||||| .......... .......... .......... .......... .....ATGCC TGAAGTACTG 678 GGCTAAGAAG CGTGGGGTCT ATTCAAATGT AAGGTGGTGT CTTTCTGAAT ATTACATCTG 47391 || ||||||| |||||||||| |||||||||| | GG-TAAGAAG CGTGGGGTCT ATTCAAATGT A......... .......... .......... 708 TTGACAATAT TCTTAGGTTA AAGCTCTTGT AATCTCTGTT ATATTGAGCA GGTTACTGGA 47451 ||||||||| .......... .......... .......... .......... .......... .GTTACTGGA 717 TTTCTTGGTG GTGTAAACTG GGCACTTCTG GTTGCACGCC TTTGCCAGTT TTATCCAAAT 47511 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTCTTGGTG GTGTAAACTG GGCACTTCTG GTTGCACGCC TTTGCCAGTT TTATCCAAAT 777 GCTATTCCTA GTATGTTGGT TTCTCGATTT TTTAGAGTAT ATACACAATG GCGCTGGCCG 47571 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTATTCCTA GTATGTTGGT TTCTCGATTT TTTAGAGTAT ATACACAATG GCGCTGGCCG 837 AATCCAGTCA TGCTTTGTGC AATAGAAGAA GATGACCTTA GCTTTCCTGT TTGGGACCCA 47631 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATCCAGTCA TGCTTTGTGC AATAGAAGAA GATGACCTTA GCTTTCCTGT TTGGGACCCA 897 CGAAAAAATC ATCGTGACCG CTATCATCTT ATGCCAATAA TAACTCCTGC ATACCCATGC 47691 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGAAAAAATC ATCGTGACCG CTATCATCTT ATGCCAATAA TAACTCCTGC ATACCCATGC 957 ATGAATTCTA GTTACAATGT CTCTCAAAGC ACTCTTCGTG TTATGACAGA GCAATTCCAG 47751 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGAATTCTA GTTACAATGT CTCTCAAAGC ACTCTTCGTG TTATGACAGA GCAATTCCAG 1017 TTTGGCAACA CGATCTGTCA GGTTAGGAAG CATAACACCC GATTTTGTTT ATGAAAAGTT 47811 |||||||||| |||||||||| | TTTGGCAACA CGATCTGTCA G......... .......... .......... .......... 1038 TACCCAGGAT CCTGTTGCTA ACTAATTTTT TATATGCAGG AGATTGAGTT AAATAAACAA 47871 ||||||| |||||||||| |||||||||| .......... .......... .......... ...ATGCAGG AGATTGAGTT AAATAAACAA 1065 CACTGGAGTT CCTTATTTCA GCAATATATG TTCTTCGAGG CATATAAAAA CTACCTTCAG 47931 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACTGGAGTT CCTTATTTCA GCAATATATG TTCTTCGAGG CATATAAAAA CTACCTTCAG 1125 GTTGATGTAC TAGCTGCAGA TGCCGAAGAT TTATTGGCAT GGAAAGGTTG GGTGGAGTCA 47991 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTGATGTAC TAGCTGCAGA TGCCGAAGAT TTATTGGCAT GGAAAGGTTG GGTGGAGTCA 1185 CGGTTCAGGC AACTGACCTT GAAGGTAGAT AGCCTTATGT TCGTTTAATG TTGTGGAGTA 48051 |||||||||| |||||||||| |||| CGGTTCAGGC AACTGACCTT GAAG...... .......... .......... .......... 1209 CACCTTCCGT GTCCTCAGTG TGTTCTCGAA TTTTTTACTG GGAGAATAAA CAATGATTTA 48111 .......... .......... .......... .......... .......... .......... 1209 AGGATTCTTC GTCTTCTCTT TTTATATCCA ATAATGCAGA TAGAACGAGA CACAAATGGG 48171 | |||||||||| |||||||||| .......... .......... .......... .........A TAGAACGAGA CACAAATGGG 1230 ATGTTAATGT GCCACCCTCA ACCAAACGAG TATGTAGACA CTTCGAAGCA GTTTCGACAT 48231 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGTTAATGT GCCACCCTCA ACCAAACGAG TATGTAGACA CTTCGAAGCA GTTTCGACAT 1290 TGTGCCTTTT TCATGGGCTT GCAGAGGGCA GATGGATTTG GTGGCCAAGA ATGTCAACAG 48291 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTGCCTTTT TCATGGGCTT GCAGAGGGCA GATGGATTTG GTGGCCAAGA ATGTCAACAG 1350 TTTGATATAC GTGGAACAGT GGACGAATTC AGGCAAGAGG TAAACATGTA TATGTTTTGG 48351 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTGATATAC GTGGAACAGT GGACGAATTC AGGCAAGAGG TAAACATGTA TATGTTTTGG 1410 AGACCTGGGA TGGATGTGCA TGTTTCTCAT GTTCGAAGAC GGCAGCTTCC ATCTTTTGTT 48411 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGACCTGGGA TGGATGTGCA TGTTTCTCAT GTTCGAAGAC GGCAGCTTCC ATCTTTTGTT 1470 TTTCCAAATG GATATAAAAG GTCTCGGCAA TCAAGGCACC AGAGTCAACA ATGCAGAGAA 48471 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTCCAAATG GATATAAAAG GTCTCGGCAA TCAAGGCACC AGAGTCAACA ATGCAGAGAA 1530 CCTGGTGATG AGGGCGTTGG TTCTTTATCC GACTCTGTTG AGAGATATGC GAAGAGAAAG 48531 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTGGTGATG AGGGCGTTGG TTCTTTATCC GACTCTGTTG AGAGATATGC GAAGAGAAAG 1590 AACGATGATG AAATTATGAA TTCCAGGCCA GAGAAACGTG AGAAGCGCGC ATCTTGTAGT 48591 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACGATGATG AAATTATGAA TTCCAGGCCA GAGAAACGTG AGAAGCGCGC ATCTTGTAGT 1650 CTACATACTC TGGATGCAGC TTCTCCTGAC AGCAGTGGTA TCACTACTAG TGGGACTCCT 48651 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTACATACTC TGGATGCAGC TTCTCCTGAC AGCAGTGGTA TCACTACTAG TGGGACTCCT 1710 CAGATTGGCA TTGTTCCAGG TCCTAGAGCT GAATGCTTAG TAACTGGTGA TCTTGTTTGC 48711 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGATTGGCA TTGTTCCAGG TCCTAGAGCT GAATGCTTAG TAACTGGTGA TCTTGTTTGC 1770 AATGTTACAA GTCTTCCTAA CGTGGAAGTT GAGGCTGAAA AGTTTATCAG TAAAATCACG 48771 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATGTTACAA GTCTTCCTAA CGTGGAAGTT GAGGCTGAAA AGTTTATCAG TAAAATCACG 1830 GAACTAAGAA AATTCTCTCA GTACGAGCAT ACCTCTGGTA GCGAGCAAAT CCTGGAAGTA 48831 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAACTAAGAA AATTCTCTCA GTACGAGCAT ACCTCTGGTA GCGAGCAAAT CCTGGAAGTA 1890 GATAGTAGGG CTCTAGTTCA AAGTTATCAT GACCTGGCTG AGCCTGTAGC AAAACATGTG 48891 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATAGTAGGG CTCTAGTTCA AAGTTATCAT GACCTGGCTG AGCCTGTAGC AAAACATGTG 1950 AGACCTGACC TTAGTGCTTT GCTAGCGTGT GAAGGTGGGC AGAATAAAGA AATAGGTCAT 48951 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGACCTGACC TTAGTGCTTT GCTAGCGTGT GAAGGTGGGC AGAATAAAGA AATAGGTCAT 2010 GATATGGGCT CTGAATCTAT TAATGACACT GACACGCAAC ATCTTCCAAG GCGACTAAAT 49011 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATATGGGCT CTGAATCTAT TAATGACACT GACACGCAAC ATCTTCCAAG GCGACTAAAT 2070 GTAAATGAAG ATGTTGATGA AGTTGAGAGG GAAGCCAAGT TGGGAGAAAT TGCTGGTGGT 49071 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTAAATGAAG ATGTTGATGA AGTTGAGAGG GAAGCCAAGT TGGGAGAAAT TGCTGGTGGT 2130 GTTTTGTGGA ATGGACACTG TGGGCGGAAC CTTGACCATG AGGTGAGTTT TTGCTGTTCC 49131 |||||||||| |||||||||| |||||||||| |||||||||| || GTTTTGTGGA ATGGACACTG TGGGCGGAAC CTTGACCATG AG........ .......... 2172 ATGACTATGG ATACTGGGGG CGGAACCTTA ATAGTACTGG GTATAGTGAT CATGGGTTGT 49191 .......... .......... .......... .......... .......... .......... 2172 TACAGGGTTT TGTGACTCCT GCAAATTTGG ATTCAGCTGT GGAAAATAGA AACTTGCATT 49251 ||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .....GGTTT TGTGACTCCT GCAAATTTGG ATTCAGCTGT GGAAAATAGA AACTTGCATT 2227 CAGACGGATT GTTCAAAAGT GGCTTGCCAG AAGAACTTCA GGTTTTTATT CTAACTCATG 49311 |||||||||| | |||||| |||||||||| |||||||||| | CAGACGGATT GAAAAAAAGT GGCTTGCCAG AAGAACTTCA G......... .......... 2268 TTATTTGGCA TTAAATTCAT GCGTCTGTAG TTTTTTTTGG CGTTTGGTTA CTGGTTTGAG 49371 .......... .......... .......... .......... .......... .......... 2268 ATTTATTAAT TATGATGCCA ACGATTTATA CTTAAACATT TGTGACTTGT GAGTGACACA 49431 .......... .......... .......... .......... .......... .......... 2268 CCCACTGCAT GACATGTTGA AGGGTGCTCT TGTGCTTAAT TTTTAGAATC AAGAAATACG 49491 .......... .......... .......... .......... .......... .......... 2268 AGGTAACAAA CTTAAATTAG AATGTTGTGC TGCTTAAGTG ATCAATTTTA TGCTTTTTTT 49551 .......... .......... .......... .......... .......... .......... 2268 CTGCCATGAT TTTATTCTGT ATTATTGCAT ACTTGTACTG TCTTTTGAAA CTAGGTTCTT 49611 .......... .......... .......... .......... .......... .......... 2268 TTCTCTTCCA AGATCATTTC TGACTTCTCT CCTTCTATAT TGTATTTTTG TGTCTTAGTC 49671 || .......... .......... .......... .......... .......... ........TC 2270 AAATTCTTTG CTCAGCGGGA CGGGGAAGCT GGACGATGGA GCTAGGTCAG AATCTTTGCA 49731 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAATTCTTTG CTCAGCGGGA CGGGGAAGCT GGACGATGGA GCTAGGTCAG AATCTTTGCA 2330 AAATGAAATG ATGAGGCATG TGTTTTTGCA ACCCATTATT GGTTTATGCA AATCATGA 49789 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| AAATGAAATG ATGAGGCATG TGTTTTTGCA ACCCATTATT GGTTTATGCA AATCATGA 2388 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (+ strand): 45472 49789 AGS-1 (45472 45612,45709 45786,46162 46269,46464 46526,46602 46763,47102 47215,47317 47362,47443 47772,47845 48015,48151 49113,49197 49292,49670 49789) SCR (e 1.000 d 0.988 a 0.592,e 1.000 d 0.984 a 0.941,e 1.000 d 0.921 a 1.000,e 1.000 d 0.903 a 0.282,e 1.000 d 0.961 a 0.969,e 0.965 d 0.961 a 0.001,e 0.978 d 0.000 a 0.993,e 1.000 d 0.843 a 0.000,e 1.000 d 0.715 a 0.982,e 1.000 d 0.845 a 0.872,e 0.969 d 0.976 a 0.999,e 1.000) Exon 1 45472 45612 ( 141 n); score: 1.000 Intron 1 45613 45708 ( 96 n); Pd: 0.988 Pa: 0.592 Exon 2 45709 45786 ( 78 n); score: 1.000 Intron 2 45787 46161 ( 375 n); Pd: 0.984 Pa: 0.941 Exon 3 46162 46269 ( 108 n); score: 1.000 Intron 3 46270 46463 ( 194 n); Pd: 0.921 Pa: 1.000 Exon 4 46464 46526 ( 63 n); score: 1.000 Intron 4 46527 46601 ( 75 n); Pd: 0.903 Pa: 0.282 Exon 5 46602 46763 ( 162 n); score: 1.000 Intron 5 46764 47101 ( 338 n); Pd: 0.961 Pa: 0.969 Exon 6 47102 47215 ( 114 n); score: 0.965 Intron 6 47216 47316 ( 101 n); Pd: 0.961 Pa: 0.001 Exon 7 47317 47362 ( 46 n); score: 0.978 Intron 7 47363 47442 ( 80 n); Pd: 0.000 Pa: 0.993 Exon 8 47443 47772 ( 330 n); score: 1.000 Intron 8 47773 47844 ( 72 n); Pd: 0.843 Pa: 0.000 Exon 9 47845 48015 ( 171 n); score: 1.000 Intron 9 48016 48150 ( 135 n); Pd: 0.715 Pa: 0.982 Exon 10 48151 49113 ( 963 n); score: 1.000 Intron 10 49114 49196 ( 83 n); Pd: 0.845 Pa: 0.872 Exon 11 49197 49292 ( 96 n); score: 0.969 Intron 11 49293 49669 ( 377 n); Pd: 0.976 Pa: 0.999 Exon 12 49670 49789 ( 120 n); score: 1.000 PGS (45472 45612,45709 45786,46162 46269,46464 46526,46602 46763,47102 47215,47317 47362,47443 47772,47845 48015,48151 49113,49197 49292,49670 49789) gi+ 3-phase translation of AGS-1 (+strand): . . . . . . 45472 ATGGTGAGTACTCAACAACGCACGGACGATGACTCTTCTCAACCGGTAAAAGCTTCTCTT M V S T Q Q R T D D D S S Q P V K A S L W - V L N N A R T M T L L N R - K L L L G E Y S T T H G R - L F S T G K S F S . . . . . . 45532 AAGAGCTATGGGATCACGGAGCCACTGTCTATTGCTGGACCTTCTGCTGCTGATGTTAAG K S Y G I T E P L S I A G P S A A D V K R A M G S R S H C L L L D L L L L M L S - E L W D H G A T V Y C W T F C C - C - . . . : . . . 45592 CGTAATTTGGAACTAGAGAAG : TTTCTGGTTGATGAGGGGCTCTACGAGAGCAAGGAAGAA R N L E L E K : F L V D E G L Y E S K E E V I W N - R S : F W L M R G S T R A R K K A - F G T R E : V S G - - G A L R E Q G R . . . . : . . 45748 ACTATGCGGAGAGAGGAAGTTGTGGTTCGCATTGATCAG : ATTGTAAAACACTGGGTGAAA T M R R E E V V V R I D Q : I V K H W V K L C G E R K L W F A L I R : L - N T G - N N Y A E R G S C G S H - S : D C K T L G E . . . . . . 46183 CAGTTAACTCGTCAGAGGGGCTATACTGATCAGATGGTGGAGGATGCAAATGCTGTCATT Q L T R Q R G Y T D Q M V E D A N A V I S - L V R G A I L I R W W R M Q M L S F T V N S S E G L Y - S D G G G C K C C H . . . : . . . 46243 TTCACTTTTGGATCTTACCGCCTTGGA : GTTCACGGACCTATGGCTGATATTGATACTTTG F T F G S Y R L G : V H G P M A D I D T L S L L D L T A L E : F T D L W L I L I L C F H F W I L P P W : S S R T Y G - Y - Y F . . . : . . . 46497 TGTGTTGGCCCATCTTATGTTAACCGAGAG : GAGGATTTCTTCATTTTCTTCCGTGATATA C V G P S Y V N R E : E D F F I F F R D I V L A H L M L T E R : R I S S F S S V I Y V C W P I L C - P R : G G F L H F L P - Y . . . . . . 46632 TTGGCTGAAATGGAAGAAGTGACTGAACTTCAACCTGTTACTGATGCCCATGTCCCAGTC L A E M E E V T E L Q P V T D A H V P V W L K W K K - L N F N L L L M P M S Q S I G - N G R S D - T S T C Y - C P C P S . . . . . . 46692 ATGAAATTTAAGTTCCAAGGAATATCAATTGATCTTCTGTATGCTAGCATATCGCTTCTA M K F K F Q G I S I D L L Y A S I S L L - N L S S K E Y Q L I F C M L A Y R F - H E I - V P R N I N - S S V C - H I A S . . : . . . . 46752 GTTATCCCACAG : GATCTGGATATCTCCAACTCTTCTGTGCTGTGTGACGTGGATGAACAA V I P Q : D L D I S N S S V L C D V D E Q L S H R : I W I S P T L L C C V T W M N K S Y P T : G S G Y L Q L F C A V - R G - T . . . . . . 47150 ACTGTCCGCAGTCTTAATGGTTGTAGGGTTGCTGATCAGATTCTTAAACTTGTTCCAAAT T V R S L N G C R V A D Q I L K L V P N L S A V L M V V G L L I R F L N L F Q I N C P Q S - W L - G C - S D S - T C S K . : . . . . . : 47210 TCCGAG : ATGCCTGAAGTACTGGGCTAAGAAGCGTGGGGTCTATTCAAATGTA : GTTACTGG S E : M P E V L G - E A W G L F K C : S Y W P R : C L K Y W A K K R G V Y S N V : V T G F R : D A - S T G L R S V G S I Q M - : L L . . . . . . 47451 ATTTCTTGGTGGTGTAAACTGGGCACTTCTGGTTGCACGCCTTTGCCAGTTTTATCCAAA I S W W C K L G T S G C T P L P V L S K F L G G V N W A L L V A R L C Q F Y P N D F L V V - T G H F W L H A F A S F I Q . . . . . . 47511 TGCTATTCCTAGTATGTTGGTTTCTCGATTTTTTAGAGTATATACACAATGGCGCTGGCC C Y S - Y V G F S I F - S I Y T M A L A A I P S M L V S R F F R V Y T Q W R W P M L F L V C W F L D F L E Y I H N G A G . . . . . . 47571 GAATCCAGTCATGCTTTGTGCAATAGAAGAAGATGACCTTAGCTTTCCTGTTTGGGACCC E S S H A L C N R R R - P - L S C L G P N P V M L C A I E E D D L S F P V W D P R I Q S C F V Q - K K M T L A F L F G T . . . . . . 47631 ACGAAAAAATCATCGTGACCGCTATCATCTTATGCCAATAATAACTCCTGCATACCCATG T K K S S - P L S S Y A N N N S C I P M R K N H R D R Y H L M P I I T P A Y P C H E K I I V T A I I L C Q - - L L H T H . . . . . . 47691 CATGAATTCTAGTTACAATGTCTCTCAAAGCACTCTTCGTGTTATGACAGAGCAATTCCA H E F - L Q C L S K H S S C Y D R A I P M N S S Y N V S Q S T L R V M T E Q F Q A - I L V T M S L K A L F V L - Q S N S . . . : . . . 47751 GTTTGGCAACACGATCTGTCAG : ATGCAGGAGATTGAGTTAAATAAACAACACTGGAGTTC V W Q H D L S : D A G D - V K - T T L E F F G N T I C Q : M Q E I E L N K Q H W S S S L A T R S V R : C R R L S - I N N T G V . . . . . . 47883 CTTATTTCAGCAATATATGTTCTTCGAGGCATATAAAAACTACCTTCAGGTTGATGTACT L I S A I Y V L R G I - K L P S G - C T L F Q Q Y M F F E A Y K N Y L Q V D V L P Y F S N I C S S R H I K T T F R L M Y . . . . . . 47943 AGCTGCAGATGCCGAAGATTTATTGGCATGGAAAGGTTGGGTGGAGTCACGGTTCAGGCA S C R C R R F I G M E R L G G V T V Q A A A D A E D L L A W K G W V E S R F R Q - L Q M P K I Y W H G K V G W S H G S G . . : . . . . 48003 ACTGACCTTGAAG : ATAGAACGAGACACAAATGGGATGTTAATGTGCCACCCTCAACCAAA T D L E : D R T R H K W D V N V P P S T K L T L K : I E R D T N G M L M C H P Q P N N - P - R : - N E T Q M G C - C A T L N Q . . . . . . 48198 CGAGTATGTAGACACTTCGAAGCAGTTTCGACATTGTGCCTTTTTCATGGGCTTGCAGAG R V C R H F E A V S T L C L F H G L A E E Y V D T S K Q F R H C A F F M G L Q R T S M - T L R S S F D I V P F S W A C R . . . . . . 48258 GGCAGATGGATTTGGTGGCCAAGAATGTCAACAGTTTGATATACGTGGAACAGTGGACGA G R W I W W P R M S T V - Y T W N S G R A D G F G G Q E C Q Q F D I R G T V D E G Q M D L V A K N V N S L I Y V E Q W T . . . . . . 48318 ATTCAGGCAAGAGGTAAACATGTATATGTTTTGGAGACCTGGGATGGATGTGCATGTTTC I Q A R G K H V Y V L E T W D G C A C F F R Q E V N M Y M F W R P G M D V H V S N S G K R - T C I C F G D L G W M C M F . . . . . . 48378 TCATGTTCGAAGACGGCAGCTTCCATCTTTTGTTTTTCCAAATGGATATAAAAGGTCTCG S C S K T A A S I F C F S K W I - K V S H V R R R Q L P S F V F P N G Y K R S R L M F E D G S F H L L F F Q M D I K G L . . . . . . 48438 GCAATCAAGGCACCAGAGTCAACAATGCAGAGAACCTGGTGATGAGGGCGTTGGTTCTTT A I K A P E S T M Q R T W - - G R W F F Q S R H Q S Q Q C R E P G D E G V G S L G N Q G T R V N N A E N L V M R A L V L . . . . . . 48498 ATCCGACTCTGTTGAGAGATATGCGAAGAGAAAGAACGATGATGAAATTATGAATTCCAG I R L C - E I C E E K E R - - N Y E F Q S D S V E R Y A K R K N D D E I M N S R Y P T L L R D M R R E R T M M K L - I P . . . . . . 48558 GCCAGAGAAACGTGAGAAGCGCGCATCTTGTAGTCTACATACTCTGGATGCAGCTTCTCC A R E T - E A R I L - S T Y S G C S F S P E K R E K R A S C S L H T L D A A S P G Q R N V R S A H L V V Y I L W M Q L L . . . . . . 48618 TGACAGCAGTGGTATCACTACTAGTGGGACTCCTCAGATTGGCATTGTTCCAGGTCCTAG - Q Q W Y H Y - W D S S D W H C S R S - D S S G I T T S G T P Q I G I V P G P R L T A V V S L L V G L L R L A L F Q V L . . . . . . 48678 AGCTGAATGCTTAGTAACTGGTGATCTTGTTTGCAATGTTACAAGTCTTCCTAACGTGGA S - M L S N W - S C L Q C Y K S S - R G A E C L V T G D L V C N V T S L P N V E E L N A - - L V I L F A M L Q V F L T W . . . . . . 48738 AGTTGAGGCTGAAAAGTTTATCAGTAAAATCACGGAACTAAGAAAATTCTCTCAGTACGA S - G - K V Y Q - N H G T K K I L S V R V E A E K F I S K I T E L R K F S Q Y E K L R L K S L S V K S R N - E N S L S T . . . . . . 48798 GCATACCTCTGGTAGCGAGCAAATCCTGGAAGTAGATAGTAGGGCTCTAGTTCAAAGTTA A Y L W - R A N P G S R - - G S S S K L H T S G S E Q I L E V D S R A L V Q S Y S I P L V A S K S W K - I V G L - F K V . . . . . . 48858 TCATGACCTGGCTGAGCCTGTAGCAAAACATGTGAGACCTGACCTTAGTGCTTTGCTAGC S - P G - A C S K T C E T - P - C F A S H D L A E P V A K H V R P D L S A L L A I M T W L S L - Q N M - D L T L V L C - . . . . . . 48918 GTGTGAAGGTGGGCAGAATAAAGAAATAGGTCATGATATGGGCTCTGAATCTATTAATGA V - R W A E - R N R S - Y G L - I Y - - C E G G Q N K E I G H D M G S E S I N D R V K V G R I K K - V M I W A L N L L M . . . . . . 48978 CACTGACACGCAACATCTTCCAAGGCGACTAAATGTAAATGAAGATGTTGATGAAGTTGA H - H A T S S K A T K C K - R C - - S - T D T Q H L P R R L N V N E D V D E V E T L T R N I F Q G D - M - M K M L M K L . . . . . . 49038 GAGGGAAGCCAAGTTGGGAGAAATTGCTGGTGGTGTTTTGTGGAATGGACACTGTGGGCG E G S Q V G R N C W W C F V E W T L W A R E A K L G E I A G G V L W N G H C G R R G K P S W E K L L V V F C G M D T V G . . : . . . . 49098 GAACCTTGACCATGAG : GGTTTTGTGACTCCTGCAAATTTGGATTCAGCTGTGGAAAATAG E P - P - : G F C D S C K F G F S C G K - N L D H E : G F V T P A N L D S A V E N R G T L T M R : V L - L L Q I W I Q L W K I . . . . . . : 49241 AAACTTGCATTCAGACGGATTGTTCAAAAGTGGCTTGCCAGAAGAACTTCAG : TCAAATTC K L A F R R I V Q K W L A R R T S : V K F N L H S D G L F K S G L P E E L Q : S N S E T C I Q T D C S K V A C Q K N F S : Q I . . . . . . 49678 TTTGCTCAGCGGGACGGGGAAGCTGGACGATGGAGCTAGGTCAGAATCTTTGCAAAATGA F A Q R D G E A G R W S - V R I F A K - L L S G T G K L D D G A R S E S L Q N E L C S A G R G S W T M E L G Q N L C K M . . . . . . 49738 AATGATGAGGCATGTGTTTTTGCAACCCATTATTGGTTTATGCAAATCATGA N D E A C V F A T H Y W F M Q I M M M R H V F L Q P I I G L C K S - K - - G M C F C N P L L V Y A N H Maximal non-overlapping open reading frames (>= 64 codons): >37829GAATTCAACAACATCATGATAAACTTTTCAATCCACAACCAAGCCAAAATCACCATCATC+_PGL-1_AGS-1_PPS_1 (46753 46763,47102 47215,47317 47362,47443 47772,47845 48015,48151 49113,49197 49292,49670 49789) (frame '2'; 1848 bp, 616 residues) 1 LSHRIWISPT LLCCVTWMNK LSAVLMVVGL LIRFLNLFQI PRCLKYWAKK RGVYSNVVTG 61 FLGGVNWALL VARLCQFYPN AIPSMLVSRF FRVYTQWRWP NPVMLCAIEE DDLSFPVWDP 121 RKNHRDRYHL MPIITPAYPC MNSSYNVSQS TLRVMTEQFQ FGNTICQMQE IELNKQHWSS 181 LFQQYMFFEA YKNYLQVDVL AADAEDLLAW KGWVESRFRQ LTLKIERDTN GMLMCHPQPN 241 EYVDTSKQFR HCAFFMGLQR ADGFGGQECQ QFDIRGTVDE FRQEVNMYMF WRPGMDVHVS 301 HVRRRQLPSF VFPNGYKRSR QSRHQSQQCR EPGDEGVGSL SDSVERYAKR KNDDEIMNSR 361 PEKREKRASC SLHTLDAASP DSSGITTSGT PQIGIVPGPR AECLVTGDLV CNVTSLPNVE 421 VEAEKFISKI TELRKFSQYE HTSGSEQILE VDSRALVQSY HDLAEPVAKH VRPDLSALLA 481 CEGGQNKEIG HDMGSESIND TDTQHLPRRL NVNEDVDEVE REAKLGEIAG GVLWNGHCGR 541 NLDHEGFVTP ANLDSAVENR NLHSDGLFKS GLPEELQSNS LLSGTGKLDD GARSESLQNE 601 MMRHVFLQPI IGLCKS- >37829GAATTCAACAACATCATGATAAACTTTTCAATCCACAACCAAGCCAAAATCACCATCATC+_PGL-1_AGS-1_PPS_2 (46190 46269,46464 46526,46602 46653) (frame '2'; 192 bp, 64 residues) 1 LVRGAILIRW WRMQMLSFSL LDLTALEFTD LWLILILCVL AHLMLTERRI SSFSSVIYWL 61 KWKK-