GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:36:31 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 1079 Minimum sequence length: 1079 Maximum sequence length: 1079 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 0 < 600: 0 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 1 ________________________________________________________________________________ Sequence 1: 33026gAATTCTTATTGGATTGGACTTCAACTCAATCTTATTTGGACGATTTAAAATGATCTTAA, from 1 to 87918, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 2 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 1 Matches indexed, elapsed seconds = 1 HitsTableSize = 0 ******************************************************************************** EST sequence 2 +strand (File: gi+) 1 TAAGAGGTAT TTGTCAATCA ATACCTTTCT TCGTCGATTC CCTTAGCTGA TGTGTTGTTA 61 TTTTCGAATT TGATTTGTTG GAACACACTT CGAAGCTTGG GATCTGAAAT ATCTGATTCT 121 GACTGTATAC TGTAGTCAAA GATGGGCAGT GAAAGTGATA AAGGTAGAGA AGCAATTGTT 181 GAAGAAGAAG AAGAAGAGAT AGTCTGCTTG GAGTCTTTCT TCATCAACGA TGATTATCAG 241 TTGACGAAGT TTACGTTTGG TTCTCATGTT CTTGAGCTCT ACTGTCTCCA ATCAGCTTCA 301 ACTGATTTTG ATTTAACAGG GCAGCTGGTT TGGCCTGGTG CGATGCTTAT GAATGGTTAT 361 CTCTCAGAAA ATGCTGACAT TCTCCAGGGA TGTTCAGTTT TGGAGTTGGG ATCTGGCGTT 421 GGTATAACTG GAGTCCTATG TAGCAAATTT TGCCGTAAAG TTATTTTTAC TGACCACAAC 481 GATGAAGTGC TCAAGATACT GAAGAAAAAC ATTGACCTTC ATGGACATTC AAGTGGTCCC 541 AAACCCTCAG CTGAATTAGA GGCTGCAAAG CTTGAATGGG GAAATAGTGA TCAGCTTGGT 601 CAAATTTTAA AGAAACACAA TGATGGCTTT GATCTTATTC TTGGAGCTGA GATCTGCTTT 661 CAGCAATCAA GTGTGCCATT GCTATTTGAC AGTGTTGAGC AGCTTCTGCG GATCAGGGGA 721 CAAGGAAACT GCAAATTCAT ACTAGCATAC GTATCACGGG CTAGACAGAT GGATTCTGCA 781 ATCTTGAGAG AAGGCGCTCA GCACGGGATG CTGATGAATG AAGTTTCTGG GACTCGGTGT 841 ACCGTAGGAA ACTTGGAAGG GGTCATATAT GAAATCACAC TTCAAAAGAA GAGAGGAATT 901 GTGTTCGAGT AACTTAGTTC CTTTGATACC TCAGAATTTT GTAACATTAT TTTTATGTTA 961 TTTAGATGCA TACATTTTTG GTGAAACGTT TACTAAAGTT ACAGTTCAAA AAGTATACAA 1021 TGACATTTGT GGATGCTTTG AAGTGAATCC ACATTGTTAG CTGAAAAACT GTTGCAACT Predicted gene structure (within gDNA segment 61077 to 66044): Exon 1 61437 61669 ( 233 n); cDNA 1 233 ( 233 n); score: 0.979 Intron 1 61670 61781 ( 112 n); Pd: 0.844 (s: 1.00), Pa: 0.991 (s: 1.00) Exon 2 61782 61849 ( 68 n); cDNA 234 301 ( 68 n); score: 1.000 Intron 2 61850 63408 (1559 n); Pd: 0.999 (s: 1.00), Pa: 0.334 (s: n/a) Exon 3 63409 63427 ( 19 n); cDNA 302 320 ( 19 n); score: 0.579 MATCH 33026gAATTCTTATTGGATTGGACTTCAACTCAATCTTATTTGGACGATTTAAAATGATCTTAA+ gi+ 0.983 301 0.279 C PGS_33026gAATTCTTATTGGATTGGACTTCAACTCAATCTTATTTGGACGATTTAAAATGATCTTAA+_gi+ (61437 61669,61782 61849,63409 63427) Alignment (genomic DNA sequence = upper lines): GAGGTGGTAT TTGTTAATCA ATACCTTTCT TCGTCGATTC CCTTAGCTGA TGTGTTGTTA 61496 | | ||||| |||| ||||| |||||||||| |||||||||| |||||||||| |||||||||| TAAGAGGTAT TTGTCAATCA ATACCTTTCT TCGTCGATTC CCTTAGCTGA TGTGTTGTTA 60 TTTCCGAATT TGATTTGTTG GAACACACTT CGAAGCTTGG GATCTGAAAT ATCTGATTCT 61556 ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTCGAATT TGATTTGTTG GAACACACTT CGAAGCTTGG GATCTGAAAT ATCTGATTCT 120 GACTGTATAC TGTAGTCAAA GATGGGCAGT GAAAGTGATA AAGGTAGAGA AGCAATTGTT 61616 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACTGTATAC TGTAGTCAAA GATGGGCAGT GAAAGTGATA AAGGTAGAGA AGCAATTGTT 180 GAAGAAGAAG AAGAAGAGAT AGTCTGCTTG GAGTCTTTCT TCATCAACGA TGAGTAATTT 61676 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| GAAGAAGAAG AAGAAGAGAT AGTCTGCTTG GAGTCTTTCT TCATCAACGA TGA....... 233 TGTTACACAA ACAAATGTTT GTTTTATTTG GTTTTTAAAC TTAGATCAAG TGTCTGCTTT 61736 .......... .......... .......... .......... .......... .......... 233 TGGTTTTCTC CTTATACATG TTTGTTTTGT CTTTGTCGGA TGCAGTTATC AGTTGACGAA 61796 ||||| |||||||||| .......... .......... .......... .......... .....TTATC AGTTGACGAA 248 GTTTACGTTT GGTTCTCATG TTCTTGAGCT CTACTGTCTC CAATCAGCTT CAAGTAAGTT 61856 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| GTTTACGTTT GGTTCTCATG TTCTTGAGCT CTACTGTCTC CAATCAGCTT CAA....... 301 AATTTCACCT TTATCTTGTA TCAATCTTGC ATTGTATTTT TGTTTTTACA TTATTGTTTT 61916 .......... .......... .......... .......... .......... .......... 301 CAAGATTTTA ATTGTATTTT TCAATTATAT TTTGACCCCG ATAAAAAATT ATGGGTCCGC 61976 .......... .......... .......... .......... .......... .......... 301 CACTGACTCC AATGCCTAAA ACCACAAGTG ACGATTTATT CGAATTGAGT TGCAATGTTG 62036 .......... .......... .......... .......... .......... .......... 301 CATATCAAAA CTTGAATTTT AAGCTTTTTA AATACTGGAA AACAAATCAT GCAAACTGAA 62096 .......... .......... .......... .......... .......... .......... 301 AATAATTAAT GAAAGAATTA AGTAATCGTT ACCACGAAAT TTAGATGCGA TTATTTATGT 62156 .......... .......... .......... .......... .......... .......... 301 TTCACGTTAT TATTGAATAA GTGATCAACT GATAAACAAC GAAGCATGCA TGCATCAGCT 62216 .......... .......... .......... .......... .......... .......... 301 AAGAATGATA GAAGTCAGAC ATACACTATA CTGAATTACT GATACATGCA TTAGTTTATA 62276 .......... .......... .......... .......... .......... .......... 301 GGACGAATTA AAACACCTTT GACAATGAGC CCTCCCCTGT TATCGCCGGA GTCGACCTCA 62336 .......... .......... .......... .......... .......... .......... 301 GACATGCGGA ACCCAATCAG TCCGACATTC TCCGGTGTCG TAATGAATTC ACCGGCGTGT 62396 .......... .......... .......... .......... .......... .......... 301 ATAGTTACCC ATCCTTTCCC AATATGCTGC CTCATGTCAA CAGCTCGCTC TTGCGTTCCT 62456 .......... .......... .......... .......... .......... .......... 301 GGACCTCCTG GCCTACTGTT GATGTAAAAC AGGTTCATTT TCACCGCTGC ATTCCATTTA 62516 .......... .......... .......... .......... .......... .......... 301 AAAGCTGAGT CTTTTAAATT AACAACGAAC AGAACCTCGT AATGTGTCCA TGGTGCCATT 62576 .......... .......... .......... .......... .......... .......... 301 TCCGTCGTGT CCAAACTTCC ACTTACGTCA AACCAATAAA CTCCTAGAAG CTCAGCAACT 62636 .......... .......... .......... .......... .......... .......... 301 TCGACGAATG TATTGCTGGT AAAATCAACA AGTCTTATTT AAGTGAGGGT TAACTACATG 62696 .......... .......... .......... .......... .......... .......... 301 GCTTTTTGAG ACATACATGA TTCGGCCATA AGCCAACTTA ACTCTTTTCT ATGAAGTTAT 62756 .......... .......... .......... .......... .......... .......... 301 TTGTTTTTAA AAGTTATATT TGATCATGTA TTTGTAAATT GAAGCATTTC TTTCTGCCTA 62816 .......... .......... .......... .......... .......... .......... 301 AAGGGATGAT GAAAGCATAT CATGTGTTCG TTTGATTCAG ATTTTTCGGT ACGGTTTGGT 62876 .......... .......... .......... .......... .......... .......... 301 TTGGATCGGT TAAACAAGAT CAGACAATAA CACGGTCCCA CCACTTGTCC CATTATAACT 62936 .......... .......... .......... .......... .......... .......... 301 GTTTGTAGGA AATATATATA AATCCTTACC TTGATATGTT GTGATCCAGA TTTACCCATT 62996 .......... .......... .......... .......... .......... .......... 301 TCCAATGCTC TTCGCTATGA GACCATTCGA TGTTAAGATC CCTTGCGCAG ATCATGCGAT 63056 .......... .......... .......... .......... .......... .......... 301 TCTGTCCATA AATAACCGAC CTCAGTCCAC GAAACATGAT ACATATATAT ATATATATGA 63116 .......... .......... .......... .......... .......... .......... 301 CCAAAAAAAA TCAAACACAA AGTAGGAGGC ATATTTGATC GAATCTGATG ACCACAACCT 63176 .......... .......... .......... .......... .......... .......... 301 TCTTAAGCTC ATTCGTTTGA TGTTCCTTGA GCCTTTTCTC CTCTTCCTAG CAAACGAAGG 63236 .......... .......... .......... .......... .......... .......... 301 ATTTCTAACA TTTAAAATCA TTGTCTAAAG GCAAAACGTA CAAGAGATCA ATTTCTATTT 63296 .......... .......... .......... .......... .......... .......... 301 GAATGCAATT AAGTACAAGT ATACAAGGTT AAAGAATATG AATTTACCGC CTGTGCAATC 63356 .......... .......... .......... .......... .......... .......... 301 AACTGCTTGT TTTTCTCCTC TTCCATTTGT AACTTCTCCA CCATTACTGC AGCTAGTTCG 63416 || || .......... .......... .......... .......... .......... ..CTGATTTT 309 GTTTCAGCAC G 63427 | || | || | GATTTAACAG G 320 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 TAAGAGGTAT TTGTCAATCA ATACCTTTCT TCGTCGATTC CCTTAGCTGA TGTGTTGTTA 61 TTTTCGAATT TGATTTGTTG GAACACACTT CGAAGCTTGG GATCTGAAAT ATCTGATTCT 121 GACTGTATAC TGTAGTCAAA GATGGGCAGT GAAAGTGATA AAGGTAGAGA AGCAATTGTT 181 GAAGAAGAAG AAGAAGAGAT AGTCTGCTTG GAGTCTTTCT TCATCAACGA TGATTATCAG 241 TTGACGAAGT TTACGTTTGG TTCTCATGTT CTTGAGCTCT ACTGTCTCCA ATCAGCTTCA 301 ACTGATTTTG ATTTAACAGG GCAGCTGGTT TGGCCTGGTG CGATGCTTAT GAATGGTTAT 361 CTCTCAGAAA ATGCTGACAT TCTCCAGGGA TGTTCAGTTT TGGAGTTGGG ATCTGGCGTT 421 GGTATAACTG GAGTCCTATG TAGCAAATTT TGCCGTAAAG TTATTTTTAC TGACCACAAC 481 GATGAAGTGC TCAAGATACT GAAGAAAAAC ATTGACCTTC ATGGACATTC AAGTGGTCCC 541 AAACCCTCAG CTGAATTAGA GGCTGCAAAG CTTGAATGGG GAAATAGTGA TCAGCTTGGT 601 CAAATTTTAA AGAAACACAA TGATGGCTTT GATCTTATTC TTGGAGCTGA GATCTGCTTT 661 CAGCAATCAA GTGTGCCATT GCTATTTGAC AGTGTTGAGC AGCTTCTGCG GATCAGGGGA 721 CAAGGAAACT GCAAATTCAT ACTAGCATAC GTATCACGGG CTAGACAGAT GGATTCTGCA 781 ATCTTGAGAG AAGGCGCTCA GCACGGGATG CTGATGAATG AAGTTTCTGG GACTCGGTGT 841 ACCGTAGGAA ACTTGGAAGG GGTCATATAT GAAATCACAC TTCAAAAGAA GAGAGGAATT 901 GTGTTCGAGT AACTTAGTTC CTTTGATACC TCAGAATTTT GTAACATTAT TTTTATGTTA 961 TTTAGATGCA TACATTTTTG GTGAAACGTT TACTAAAGTT ACAGTTCAAA AAGTATACAA 1021 TGACATTTGT GGATGCTTTG AAGTGAATCC ACATTGTTAG CTGAAAAACT GTTGCAACT Predicted gene structure (within gDNA segment 67935 to 71038): Exon 1 68247 68479 ( 233 n); cDNA 1 233 ( 233 n); score: 0.991 Intron 1 68480 68591 ( 112 n); Pd: 0.844 (s: 1.00), Pa: 0.990 (s: 1.00) Exon 2 68592 68659 ( 68 n); cDNA 234 301 ( 68 n); score: 1.000 Intron 2 68660 69172 ( 513 n); Pd: 0.992 (s: 1.00), Pa: 0.983 (s: 1.00) Exon 3 69173 69292 ( 120 n); cDNA 302 421 ( 120 n); score: 1.000 Intron 3 69293 69638 ( 346 n); Pd: 0.932 (s: 1.00), Pa: 0.843 (s: 1.00) Exon 4 69639 69712 ( 74 n); cDNA 422 495 ( 74 n); score: 1.000 Intron 4 69713 69793 ( 81 n); Pd: 0.000 (s: 1.00), Pa: 0.981 (s: 1.00) Exon 5 69794 69851 ( 58 n); cDNA 496 553 ( 58 n); score: 1.000 Intron 5 69852 69931 ( 80 n); Pd: 0.990 (s: 1.00), Pa: 0.994 (s: 1.00) Exon 6 69932 70034 ( 103 n); cDNA 554 656 ( 103 n); score: 0.990 Intron 6 70035 70212 ( 178 n); Pd: 0.000 (s: 0.98), Pa: 0.001 (s: 1.00) Exon 7 70213 70323 ( 111 n); cDNA 657 767 ( 111 n); score: 1.000 Intron 7 70324 70421 ( 98 n); Pd: 0.985 (s: 1.00), Pa: 0.890 (s: 1.00) Exon 8 70422 70733 ( 312 n); cDNA 768 1079 ( 312 n); score: 1.000 MATCH 33026gAATTCTTATTGGATTGGACTTCAACTCAATCTTATTTGGACGATTTAAAATGATCTTAA+ gi+ 0.997 1079 1.000 C PGS_33026gAATTCTTATTGGATTGGACTTCAACTCAATCTTATTTGGACGATTTAAAATGATCTTAA+_gi+ (68247 68479,68592 68659,69173 69292,69639 69712,69794 69851,69932 70034,70213 70323,70422 70733) Alignment (genomic DNA sequence = upper lines): GAGGAGGTAT TTGTCAATCA ATACCTTTCT TCGTCGATTC CCTTAGCTGA TGTGTTGTTA 68306 | ||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAAGAGGTAT TTGTCAATCA ATACCTTTCT TCGTCGATTC CCTTAGCTGA TGTGTTGTTA 60 TTTTCGAATT TGATTTGTTG GAACACACTT CGAAGCTTGG GATCTGAAAT ATCTGATTCT 68366 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTCGAATT TGATTTGTTG GAACACACTT CGAAGCTTGG GATCTGAAAT ATCTGATTCT 120 GACTGTATAC TGTAGTCAAA GATGGGCAGT GAAAGTGATA AAGGTAGAGA AGCAATTGTT 68426 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACTGTATAC TGTAGTCAAA GATGGGCAGT GAAAGTGATA AAGGTAGAGA AGCAATTGTT 180 GAAGAAGAAG AAGAAGAGAT AGTCTGCTTG GAGTCTTTCT TCATCAACGA TGAGTAATTT 68486 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| GAAGAAGAAG AAGAAGAGAT AGTCTGCTTG GAGTCTTTCT TCATCAACGA TGA....... 233 TGTTACACAA ACAAATGTTT GTTTTATTTG GTTTTTAAAC TTAGATCAAG TGTCTGCTTT 68546 .......... .......... .......... .......... .......... .......... 233 TGGTTTTCTC AATATACATG TTTGTTTTGT CTTTGTCGGA TGCAGTTATC AGTTGACGAA 68606 ||||| |||||||||| .......... .......... .......... .......... .....TTATC AGTTGACGAA 248 GTTTACGTTT GGTTCTCATG TTCTTGAGCT CTACTGTCTC CAATCAGCTT CAAGTGAGTT 68666 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| GTTTACGTTT GGTTCTCATG TTCTTGAGCT CTACTGTCTC CAATCAGCTT CAA....... 301 TCATCTTTAT TTTGATTTTT GTATCAATGT TGCATTGTAT TTTTGTTCTT TAGATAATAA 68726 .......... .......... .......... .......... .......... .......... 301 AGGAGCAAAC TTTATTGGTT AAGTTTTGAG GTTTGTGTTA TAAGTAAGCT GATTAACAAA 68786 .......... .......... .......... .......... .......... .......... 301 GATTTTATTG GCCATGGAGA ACTTTTGTTG TTTCTATACT CTTCGAATCT TCATATAGTT 68846 .......... .......... .......... .......... .......... .......... 301 ATCATTAGTC AATCTTGTGT CACTGAGCTA TATTCTAGGC TGGTGTATTC TTTGAACTTT 68906 .......... .......... .......... .......... .......... .......... 301 TATTGCCATT GGAAGACTAA ATATGAGATT CAGATACCCT TTTTGTGAAT TTCTTCATCA 68966 .......... .......... .......... .......... .......... .......... 301 ATTATTTTTG ACTATGAACT GTTTATATGC TCTTGAAATA GTCAATCTGG AGTTACTTAT 69026 .......... .......... .......... .......... .......... .......... 301 CTGTATTCTA GGCTGGTGTA TTCTTTTAAC TTCTAATGCC ATTGAAAGAC TAAATAAGAA 69086 .......... .......... .......... .......... .......... .......... 301 CAATGAGTTC AGAGATATAT TTTTTTTTTT ATCTAAACCC TTGTAGTGTT TATCATAAAG 69146 .......... .......... .......... .......... .......... .......... 301 CTATGTCTAA AATGTTTCTT GGACAGCTGA TTTTGATTTA ACAGGGCAGC TGGTTTGGCC 69206 |||| |||||||||| |||||||||| |||||||||| .......... .......... ......CTGA TTTTGATTTA ACAGGGCAGC TGGTTTGGCC 335 TGGTGCGATG CTTATGAATG GTTATCTCTC AGAAAATGCT GACATTCTCC AGGGATGTTC 69266 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGTGCGATG CTTATGAATG GTTATCTCTC AGAAAATGCT GACATTCTCC AGGGATGTTC 395 AGTTTTGGAG TTGGGATCTG GCGTTGGTAA GTGCTGTTTA TTTCCCCCTA CCATTTTCCA 69326 |||||||||| |||||||||| |||||| AGTTTTGGAG TTGGGATCTG GCGTTG.... .......... .......... .......... 421 TTTCGATTGG ACCACTCTCA CACTCTGTAT GTATCTTGTT TATCCTAGTA TGTAATGGCT 69386 .......... .......... .......... .......... .......... .......... 421 AATCACTACT TGTGCATGCG TCTTCAAATT TCAGTTAGAG ATGTTATGTA TTGTAACATC 69446 .......... .......... .......... .......... .......... .......... 421 ATCACTCCCT GTTAGATATG CTGTATATAC AAATATGGTC TTTGCATACT CTGCTTGGCT 69506 .......... .......... .......... .......... .......... .......... 421 GACTTCACCA TCATTCACAT GTATAAACTC TTATATTTGT GCTCTTTCTT TTGTCTTTGA 69566 .......... .......... .......... .......... .......... .......... 421 GAACATACTG TTTTTATTTT TTTCCGGTTT TCGTTCCTGT GTAAAACTGA TCTGTAATCT 69626 .......... .......... .......... .......... .......... .......... 421 TTTATAACAC AGGTATAACT GGAGTCCTAT GTAGCAAATT TTGCCGTAAA GTTATTTTTA 69686 |||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ..GTATAACT GGAGTCCTAT GTAGCAAATT TTGCCGTAAA GTTATTTTTA 469 CTGACCACAA CGATGAAGTG CTCAAGGCAA GATTACGTTA CTGAAGATTT GATATGGGAA 69746 |||||||||| |||||||||| |||||| CTGACCACAA CGATGAAGTG CTCAAG.... .......... .......... .......... 495 TTCAACATTG GTAGAATATA ATGAGGTCTT TTAAATGAAC TTTTCAGATA CTGAAGAAAA 69806 ||| |||||||||| .......... .......... .......... .......... .......ATA CTGAAGAAAA 508 ACATTGACCT TCATGGACAT TCAAGTGGTC CCAAACCCTC AGCTGGTGAG AAAGAAAATC 69866 |||||||||| |||||||||| |||||||||| |||||||||| ||||| ACATTGACCT TCATGGACAT TCAAGTGGTC CCAAACCCTC AGCTG..... .......... 553 AACTAAGCTA TTGACTCTCA TTGTTCATAT TATTGATGTG TTCTGTTTCT CATTTCTACC 69926 .......... .......... .......... .......... .......... .......... 553 TACAGAATTA GAGGCTGCAA AGCTTGAATG GGGAAATAGT GATCAGCTTG GTCAAATTTT 69986 ||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .....AATTA GAGGCTGCAA AGCTTGAATG GGGAAATAGT GATCAGCTTG GTCAAATTTT 608 AAAGAAACAC AATGATGGCT TTGATCTTAT TCTTGGAGCT GAGATCTATA TCCTTATGTT 70046 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| AAAGAAACAC AATGATGGCT TTGATCTTAT TCTTGGAGCT GAGATCTG.. .......... 656 CGTTAACGAA TTCTTTATCT ATCCGCTAGG ACAGTGGTTG ATCTTAGTTT TATCCGTAGT 70106 .......... .......... .......... .......... .......... .......... 656 TGAAGTTTTG GTCCGGATCT TCTGTTCAGT TTTCATCAGT TAAACCGTCA TGGGCACTCC 70166 .......... .......... .......... .......... .......... .......... 656 TATAATTTAC ACCGGAATGA AAAAGAAAAC CTTGACATTA CGTAAGCTTT CAGCAATCAA 70226 |||| |||||||||| .......... .......... .......... .......... ......CTTT CAGCAATCAA 670 GTGTGCCATT GCTATTTGAC AGTGTTGAGC AGCTTCTGCG GATCAGGGGA CAAGGAAACT 70286 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGTGCCATT GCTATTTGAC AGTGTTGAGC AGCTTCTGCG GATCAGGGGA CAAGGAAACT 730 GCAAATTCAT ACTAGCATAC GTATCACGGG CTAGACAGTA AGTTTGTCAG CTAAAACTTT 70346 |||||||||| |||||||||| |||||||||| ||||||| GCAAATTCAT ACTAGCATAC GTATCACGGG CTAGACA... .......... .......... 767 TGCATTTTTC TCATATAAGT ATAACTGCAA GTTTCATACG AGTCAGAGAA TAAACTATCT 70406 .......... .......... .......... .......... .......... .......... 767 GAAACGTTAT TTCAGGATGG ATTCTGCAAT CTTGAGAGAA GGCGCTCAGC ACGGGATGCT 70466 ||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... .....GATGG ATTCTGCAAT CTTGAGAGAA GGCGCTCAGC ACGGGATGCT 812 GATGAATGAA GTTTCTGGGA CTCGGTGTAC CGTAGGAAAC TTGGAAGGGG TCATATATGA 70526 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATGAATGAA GTTTCTGGGA CTCGGTGTAC CGTAGGAAAC TTGGAAGGGG TCATATATGA 872 AATCACACTT CAAAAGAAGA GAGGAATTGT GTTCGAGTAA CTTAGTTCCT TTGATACCTC 70586 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATCACACTT CAAAAGAAGA GAGGAATTGT GTTCGAGTAA CTTAGTTCCT TTGATACCTC 932 AGAATTTTGT AACATTATTT TTATGTTATT TAGATGCATA CATTTTTGGT GAAACGTTTA 70646 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAATTTTGT AACATTATTT TTATGTTATT TAGATGCATA CATTTTTGGT GAAACGTTTA 992 CTAAAGTTAC AGTTCAAAAA GTATACAATG ACATTTGTGG ATGCTTTGAA GTGAATCCAC 70706 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTAAAGTTAC AGTTCAAAAA GTATACAATG ACATTTGTGG ATGCTTTGAA GTGAATCCAC 1052 ATTGTTAGCT GAAAAACTGT TGCAACT 70733 |||||||||| |||||||||| ||||||| ATTGTTAGCT GAAAAACTGT TGCAACT 1079 Total number of EST alignments reported: 2 ________________________________________________________________________________ Predicted gene locations (2): PGL 1 (+ strand): 61437 61849 AGS-1 (61437 61669,61782 61849) SCR (e 0.979 d 0.844 a 0.991,e 1.000) Exon 1 61437 61669 ( 233 n); score: 0.979 Intron 1 61670 61781 ( 112 n); Pd: 0.844 Pa: 0.991 Exon 2 61782 61849 ( 68 n); score: 1.000 PGS (61437 61669,61782 61849) gi+ 3-phase translation of AGS-1 (+strand): . . . . . . 61437 GAGGTGGTATTTGTTAATCAATACCTTTCTTCGTCGATTCCCTTAGCTGATGTGTTGTTA E V V F V N Q Y L S S S I P L A D V L L R W Y L L I N T F L R R F P - L M C C Y G G I C - S I P F F V D S L S - C V V . . . . . . 61497 TTTCCGAATTTGATTTGTTGGAACACACTTCGAAGCTTGGGATCTGAAATATCTGATTCT F P N L I C W N T L R S L G S E I S D S F R I - F V G T H F E A W D L K Y L I L I S E F D L L E H T S K L G I - N I - F . . . . . . 61557 GACTGTATACTGTAGTCAAAGATGGGCAGTGAAAGTGATAAAGGTAGAGAAGCAATTGTT D C I L - S K M G S E S D K G R E A I V T V Y C S Q R W A V K V I K V E K Q L L - L Y T V V K D G Q - K - - R - R S N C . . . . . . : 61617 GAAGAAGAAGAAGAAGAGATAGTCTGCTTGGAGTCTTTCTTCATCAACGATGA : TTATCAG E E E E E E I V C L E S F F I N D D : Y Q K K K K K R - S A W S L S S S T M : I I S - R R R R R D S L L G V F L H Q R - : L S . . . . . . 61789 TTGACGAAGTTTACGTTTGGTTCTCATGTTCTTGAGCTCTACTGTCTCCAATCAGCTTCA L T K F T F G S H V L E L Y C L Q S A S - R S L R L V L M F L S S T V S N Q L Q V D E V Y V W F S C S - A L L S P I S F . 61849 A Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (+ strand): 68247 70733 AGS-1 (68247 68479,68592 68659,69173 69292,69639 69712,69794 69851,69932 70034,70213 70323,70422 70733) SCR (e 0.991 d 0.844 a 0.990,e 1.000 d 0.992 a 0.983,e 1.000 d 0.932 a 0.843,e 1.000 d 0.000 a 0.981,e 1.000 d 0.990 a 0.994,e 0.990 d 0.000 a 0.001,e 1.000 d 0.985 a 0.890,e 1.000) Exon 1 68247 68479 ( 233 n); score: 0.991 Intron 1 68480 68591 ( 112 n); Pd: 0.844 Pa: 0.990 Exon 2 68592 68659 ( 68 n); score: 1.000 Intron 2 68660 69172 ( 513 n); Pd: 0.992 Pa: 0.983 Exon 3 69173 69292 ( 120 n); score: 1.000 Intron 3 69293 69638 ( 346 n); Pd: 0.932 Pa: 0.843 Exon 4 69639 69712 ( 74 n); score: 1.000 Intron 4 69713 69793 ( 81 n); Pd: 0.000 Pa: 0.981 Exon 5 69794 69851 ( 58 n); score: 1.000 Intron 5 69852 69931 ( 80 n); Pd: 0.990 Pa: 0.994 Exon 6 69932 70034 ( 103 n); score: 0.990 Intron 6 70035 70212 ( 178 n); Pd: 0.000 Pa: 0.001 Exon 7 70213 70323 ( 111 n); score: 1.000 Intron 7 70324 70421 ( 98 n); Pd: 0.985 Pa: 0.890 Exon 8 70422 70733 ( 312 n); score: 1.000 PGS (68247 68479,68592 68659,69173 69292,69639 69712,69794 69851,69932 70034,70213 70323,70422 70733) gi+ 3-phase translation of AGS-1 (+strand): . . . . . . 68247 GAGGAGGTATTTGTCAATCAATACCTTTCTTCGTCGATTCCCTTAGCTGATGTGTTGTTA E E V F V N Q Y L S S S I P L A D V L L R R Y L S I N T F L R R F P - L M C C Y G G I C Q S I P F F V D S L S - C V V . . . . . . 68307 TTTTCGAATTTGATTTGTTGGAACACACTTCGAAGCTTGGGATCTGAAATATCTGATTCT F S N L I C W N T L R S L G S E I S D S F R I - F V G T H F E A W D L K Y L I L I F E F D L L E H T S K L G I - N I - F . . . . . . 68367 GACTGTATACTGTAGTCAAAGATGGGCAGTGAAAGTGATAAAGGTAGAGAAGCAATTGTT D C I L - S K M G S E S D K G R E A I V T V Y C S Q R W A V K V I K V E K Q L L - L Y T V V K D G Q - K - - R - R S N C . . . . . . : 68427 GAAGAAGAAGAAGAAGAGATAGTCTGCTTGGAGTCTTTCTTCATCAACGATGA : TTATCAG E E E E E E I V C L E S F F I N D D : Y Q K K K K K R - S A W S L S S S T M : I I S - R R R R R D S L L G V F L H Q R - : L S . . . . . . 68599 TTGACGAAGTTTACGTTTGGTTCTCATGTTCTTGAGCTCTACTGTCTCCAATCAGCTTCA L T K F T F G S H V L E L Y C L Q S A S - R S L R L V L M F L S S T V S N Q L Q V D E V Y V W F S C S - A L L S P I S F . : . . . . . 68659 A : CTGATTTTGATTTAACAGGGCAGCTGGTTTGGCCTGGTGCGATGCTTATGAATGGTTAT : T D F D L T G Q L V W P G A M L M N G Y : L I L I - Q G S W F G L V R C L - M V I N : - F - F N R A A G L A W C D A Y E W L . . . . . . 69232 CTCTCAGAAAATGCTGACATTCTCCAGGGATGTTCAGTTTTGGAGTTGGGATCTGGCGTT L S E N A D I L Q G C S V L E L G S G V S Q K M L T F S R D V Q F W S W D L A L S L R K C - H S P G M F S F G V G I W R . : . . . . . 69292 G : GTATAACTGGAGTCCTATGTAGCAAATTTTGCCGTAAAGTTATTTTTACTGACCACAAC : G I T G V L C S K F C R K V I F T D H N : V - L E S Y V A N F A V K L F L L T T T W : Y N W S P M - Q I L P - S Y F Y - P Q . . : . . . . 69698 GATGAAGTGCTCAAG : ATACTGAAGAAAAACATTGACCTTCATGGACATTCAAGTGGTCCC D E V L K : I L K K N I D L H G H S S G P M K C S R : Y - R K T L T F M D I Q V V P R - S A Q : D T E E K H - P S W T F K W S . . : . . . . 69839 AAACCCTCAGCTG : AATTAGAGGCTGCAAAGCTTGAATGGGGAAATAGTGATCAGCTTGGT K P S A : E L E A A K L E W G N S D Q L G N P Q L : N - R L Q S L N G E I V I S L V Q T L S - : I R G C K A - M G K - - S A W . . . . . . : 69979 CAAATTTTAAAGAAACACAATGATGGCTTTGATCTTATTCTTGGAGCTGAGATCTA : CTTT Q I L K K H N D G F D L I L G A E I Y : F K F - R N T M M A L I L F L E L R S : T F S N F K E T Q - W L - S Y S W S - D L : L . . . . . . 70217 CAGCAATCAAGTGTGCCATTGCTATTTGACAGTGTTGAGCAGCTTCTGCGGATCAGGGGA Q Q S S V P L L F D S V E Q L L R I R G S N Q V C H C Y L T V L S S F C G S G D S A I K C A I A I - Q C - A A S A D Q G . . . . . : . 70277 CAAGGAAACTGCAAATTCATACTAGCATACGTATCACGGGCTAGACA : GATGGATTCTGCA Q G N C K F I L A Y V S R A R Q : M D S A K E T A N S Y - H T Y H G L D : R W I L Q T R K L Q I H T S I R I T G - T : D G F C . . . . . . 70435 ATCTTGAGAGAAGGCGCTCAGCACGGGATGCTGATGAATGAAGTTTCTGGGACTCGGTGT I L R E G A Q H G M L M N E V S G T R C S - E K A L S T G C - - M K F L G L G V N L E R R R S A R D A D E - S F W D S V . . . . . . 70495 ACCGTAGGAAACTTGGAAGGGGTCATATATGAAATCACACTTCAAAAGAAGAGAGGAATT T V G N L E G V I Y E I T L Q K K R G I P - E T W K G S Y M K S H F K R R E E L Y R R K L G R G H I - N H T S K E E R N . . . . . . 70555 GTGTTCGAGTAACTTAGTTCCTTTGATACCTCAGAATTTTGTAACATTATTTTTATGTTA V F E - L S S F D T S E F C N I I F M L C S S N L V P L I P Q N F V T L F L C Y C V R V T - F L - Y L R I L - H Y F Y V . . . . . . 70615 TTTAGATGCATACATTTTTGGTGAAACGTTTACTAAAGTTACAGTTCAAAAAGTATACAA F R C I H F W - N V Y - S Y S S K S I Q L D A Y I F G E T F T K V T V Q K V Y N I - M H T F L V K R L L K L Q F K K Y T . . . . . . 70675 TGACATTTGTGGATGCTTTGAAGTGAATCCACATTGTTAGCTGAAAAACTGTTGCAACT - H L W M L - S E S T L L A E K L L Q D I C G C F E V N P H C - L K N C C N M T F V D A L K - I H I V S - K T V A T Maximal non-overlapping open reading frames (>= 64 codons): >33026gAATTCTTATTGGATTGGACTTCAACTCAATCTTATTTGGACGATTTAAAATGATCTTAA+_PGL-2_AGS-1_PPS_1 (68382 68479,68592 68659,69173 69292,69639 69712,69794 69851,69932 70034,70213 70323,70422 70566) (frame '1'; 774 bp, 258 residues) 1 SKMGSESDKG REAIVEEEEE EIVCLESFFI NDDYQLTKFT FGSHVLELYC LQSASTDFDL 61 TGQLVWPGAM LMNGYLSENA DILQGCSVLE LGSGVGITGV LCSKFCRKVI FTDHNDEVLK 121 ILKKNIDLHG HSSGPKPSAE LEAAKLEWGN SDQLGQILKK HNDGFDLILG AEIYFQQSSV 181 PLLFDSVEQL LRIRGQGNCK FILAYVSRAR QMDSAILREG AQHGMLMNEV SGTRCTVGNL 241 EGVIYEITLQ KKRGIVFE-