GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:38:17 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 623 Minimum sequence length: 623 Maximum sequence length: 623 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 0 < 600: 0 < 700: 1 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 48852GAATTCACAACACTTCATACCATCTGATTTCAAGACAAGATCTCTTTGGGAAAACTACGT, from 1 to 93969, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 1 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 ******************************************************************************** EST sequence 1 -strand (File: gi-) 1 AACTGCTCCT GGTTTGCGGT TGGTAGATAA ATGGGCGGTA CTGATTGGTG GAGGGAGGAA 61 GTGAAGGAAG TGTTAGAGTA TGCTAGTGGA TCCGAGACTC GGTTGACGAG AATAATGCTG 121 GATAATATGG TTGTGCCCTT AGAGAATGGA GATGTCGATG TCACAATGCT TAAAGATGCG 181 GTAGAGCTAA TCAACGGGAG ATTCGAGACA GAGGCATCAG GCAATGTTAC ACTTGAGACA 241 GTACACAAGA TCGGACAGAG TGGAGTTACA TTCATTAGCA GTGGAGCTCT TACGCATTCG 301 GTGAAAGCAC TAGACATATC TCTGAAGATC GATACAGAGT TGGCTCTCGA GGTGGGAAGA 361 AGGACCAAAC GAGCATAGAG AACATCGTCT TCCTCGTATA CTTACTTTCT TTTTATTTTC 421 CGGGAAACAT GTAATCAATA AGGTGAGAGG CAAACGAATG AATCTGCCTA TAAAATGACT 481 GTTTCTTCTA CAATGTAATA ATAGTAAAAC TCTTGAGTAC AATAGAGTGG AGACATGGAT 541 GAATATCTGA ATAAGCTTTC TCTCAAACTT TCTGCGTTTT TTATGTACAT AGAATTTGCA 601 GAAACGTTGG GAAGCATTCT CTA Predicted gene structure (within gDNA segment 76077 to 74323): Exon 1 75772 75736 ( 37 n); cDNA 1 37 ( 37 n); score: 1.000 Intron 1 75735 75604 ( 132 n); Pd: 0.958 (s: n/a), Pa: 0.940 (s: n/a) Exon 2 75603 75584 ( 20 n); cDNA 38 57 ( 20 n); score: 1.000 Intron 2 75583 75344 ( 240 n); Pd: 0.000 (s: n/a), Pa: 0.001 (s: 1.00) Exon 3 75343 75188 ( 156 n); cDNA 58 213 ( 156 n); score: 1.000 Intron 3 75187 75116 ( 72 n); Pd: 0.789 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 4 75115 75048 ( 68 n); cDNA 214 281 ( 68 n); score: 1.000 Intron 4 75047 74965 ( 83 n); Pd: 1.000 (s: 1.00), Pa: 0.846 (s: 1.00) Exon 5 74964 74623 ( 342 n); cDNA 282 623 ( 342 n); score: 1.000 MATCH 48852GAATTCACAACACTTCATACCATCTGATTTCAAGACAAGATCTCTTTGGGAAAACTACGT- gi- 1.000 566 0.909 C PGS_48852GAATTCACAACACTTCATACCATCTGATTTCAAGACAAGATCTCTTTGGGAAAACTACGT-_gi- (75772 75736,75603 75584,75343 75188,75115 75048,74964 74623) Alignment (genomic DNA sequence = upper lines): AACTGCTCCT GGTTTGCGGT TGGTAGATAA ATGGGCGGTA CGTTCCACAC TACTGTTATA 75713 |||||||||| |||||||||| |||||||||| ||||||| AACTGCTCCT GGTTTGCGGT TGGTAGATAA ATGGGCG... .......... .......... 37 CCTATTTCAA ACCCAACCTT TGAGAGTTGA GACACATTTG TTGCTATATT TGTCTCATCT 75653 .......... .......... .......... .......... .......... .......... 37 TTGGAATATT AGGATTATGA TACACCTCAA GTTTTCTTGC TTAGCTTAGG TACTGATTGG 75593 | |||||||||| .......... .......... .......... .......... .........G TACTGATTGG 48 TGGAGGGAGG AATCATAGGA TGGGGTTATT TGATATGGTG ATGATAAAAG ATAACCATAT 75533 ||||||||| TGGAGGGAG. .......... .......... .......... .......... .......... 57 CTCAGCTGCA GGCGGTATAG TAAACGCCGT CAAATCCGTA GACGAATATC TAAAGCAAAA 75473 .......... .......... .......... .......... .......... .......... 57 GAATCTCGAA ATGGATGTAG AGGTAACTAA TCTAAAGCTT TCAAACCGTT TTGTGGCTGC 75413 .......... .......... .......... .......... .......... .......... 57 AAGTTAAATG GCAAACAAGT TGTAATCACT ATATATCTTT ACTTAAAGGT GGAGACGAGG 75353 .......... .......... .......... .......... .......... .......... 57 ACGCTTGAGG AAGTGAAGGA AGTGTTAGAG TATGCTAGTG GATCCGAGAC TCGGTTGACG 75293 | |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .........G AAGTGAAGGA AGTGTTAGAG TATGCTAGTG GATCCGAGAC TCGGTTGACG 108 AGAATAATGC TGGATAATAT GGTTGTGCCC TTAGAGAATG GAGATGTCGA TGTCACAATG 75233 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAATAATGC TGGATAATAT GGTTGTGCCC TTAGAGAATG GAGATGTCGA TGTCACAATG 168 CTTAAAGATG CGGTAGAGCT AATCAACGGG AGATTCGAGA CAGAGGTATG TGTGAGTTAG 75173 |||||||||| |||||||||| |||||||||| |||||||||| ||||| CTTAAAGATG CGGTAGAGCT AATCAACGGG AGATTCGAGA CAGAG..... .......... 213 AAAGAGAGAT AACAGATTTC TTTTAGTGAA ACTCAGACTC TAAATCTCAT GATGTAGGCA 75113 ||| .......... .......... .......... .......... .......... .......GCA 216 TCAGGCAATG TTACACTTGA GACAGTACAC AAGATCGGAC AGAGTGGAGT TACATTCATT 75053 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAGGCAATG TTACACTTGA GACAGTACAC AAGATCGGAC AGAGTGGAGT TACATTCATT 276 AGCAGGTAAG AATACTTTCG AAATCTGTTT TTGATACAAA TGTTTTGGTT TTTGGTCAGC 74993 ||||| AGCAG..... .......... .......... .......... .......... .......... 281 TGAAAACTGA AAGATCATAT GAATGCAGTG GAGCTCTTAC GCATTCGGTG AAAGCACTAG 74933 || |||||||||| |||||||||| |||||||||| .......... .......... ........TG GAGCTCTTAC GCATTCGGTG AAAGCACTAG 313 ACATATCTCT GAAGATCGAT ACAGAGTTGG CTCTCGAGGT GGGAAGAAGG ACCAAACGAG 74873 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACATATCTCT GAAGATCGAT ACAGAGTTGG CTCTCGAGGT GGGAAGAAGG ACCAAACGAG 373 CATAGAGAAC ATCGTCTTCC TCGTATACTT ACTTTCTTTT TATTTTCCGG GAAACATGTA 74813 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATAGAGAAC ATCGTCTTCC TCGTATACTT ACTTTCTTTT TATTTTCCGG GAAACATGTA 433 ATCAATAAGG TGAGAGGCAA ACGAATGAAT CTGCCTATAA AATGACTGTT TCTTCTACAA 74753 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCAATAAGG TGAGAGGCAA ACGAATGAAT CTGCCTATAA AATGACTGTT TCTTCTACAA 493 TGTAATAATA GTAAAACTCT TGAGTACAAT AGAGTGGAGA CATGGATGAA TATCTGAATA 74693 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTAATAATA GTAAAACTCT TGAGTACAAT AGAGTGGAGA CATGGATGAA TATCTGAATA 553 AGCTTTCTCT CAAACTTTCT GCGTTTTTTA TGTACATAGA ATTTGCAGAA ACGTTGGGAA 74633 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCTTTCTCT CAAACTTTCT GCGTTTTTTA TGTACATAGA ATTTGCAGAA ACGTTGGGAA 613 GCATTCTCTA 74623 |||||||||| GCATTCTCTA 623 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (- strand): 75772 74623 AGS-1 (75772 75736,75603 75584,75343 75188,75115 75048,74964 74623) SCR (e 1.000 d 0.958 a 0.940,e 1.000 d 0.000 a 0.001,e 1.000 d 0.789 a 0.997,e 1.000 d 1.000 a 0.846,e 1.000) Exon 1 75772 75736 ( 37 n); score: 1.000 Intron 1 75735 75604 ( 132 n); Pd: 0.958 Pa: 0.940 Exon 2 75603 75584 ( 20 n); score: 1.000 Intron 2 75583 75344 ( 240 n); Pd: 0.000 Pa: 0.001 Exon 3 75343 75188 ( 156 n); score: 1.000 Intron 3 75187 75116 ( 72 n); Pd: 0.789 Pa: 0.997 Exon 4 75115 75048 ( 68 n); score: 1.000 Intron 4 75047 74965 ( 83 n); Pd: 1.000 Pa: 0.846 Exon 5 74964 74623 ( 342 n); score: 1.000 PGS (75772 75736,75603 75584,75343 75188,75115 75048,74964 74623) gi- 3-phase translation of AGS-1 (-strand): . . . . : . . : 75772 AACTGCTCCTGGTTTGCGGTTGGTAGATAAATGGGCG : GTACTGATTGGTGGAGGGAG : GAA N C S W F A V G R - M G : G T D W W R E : E T A P G L R L V D K W A : V L I G G G R : K L L L V C G W - I N G R : Y - L V E G : G . . . . . . 75340 GTGAAGGAAGTGTTAGAGTATGCTAGTGGATCCGAGACTCGGTTGACGAGAATAATGCTG V K E V L E Y A S G S E T R L T R I M L - R K C - S M L V D P R L G - R E - C W S E G S V R V C - W I R D S V D E N N A . . . . . . 75280 GATAATATGGTTGTGCCCTTAGAGAATGGAGATGTCGATGTCACAATGCTTAAAGATGCG D N M V V P L E N G D V D V T M L K D A I I W L C P - R M E M S M S Q C L K M R G - Y G C A L R E W R C R C H N A - R C . . . . : . . 75220 GTAGAGCTAATCAACGGGAGATTCGAGACAGAG : GCATCAGGCAATGTTACACTTGAGACA V E L I N G R F E T E : A S G N V T L E T - S - S T G D S R Q R : H Q A M L H L R Q G R A N Q R E I R D R : G I R Q C Y T - D . . . . . : . 75088 GTACACAAGATCGGACAGAGTGGAGTTACATTCATTAGCAG : TGGAGCTCTTACGCATTCG V H K I G Q S G V T F I S S : G A L T H S Y T R S D R V E L H S L A : V E L L R I R S T Q D R T E W S Y I H - Q : W S S Y A F . . . . . . 74945 GTGAAAGCACTAGACATATCTCTGAAGATCGATACAGAGTTGGCTCTCGAGGTGGGAAGA V K A L D I S L K I D T E L A L E V G R - K H - T Y L - R S I Q S W L S R W E E G E S T R H I S E D R Y R V G S R G G K . . . . . . 74885 AGGACCAAACGAGCATAGAGAACATCGTCTTCCTCGTATACTTACTTTCTTTTTATTTTC R T K R A - R T S S S S Y T Y F L F I F G P N E H R E H R L P R I L T F F L F S K D Q T S I E N I V F L V Y L L S F Y F . . . . . . 74825 CGGGAAACATGTAATCAATAAGGTGAGAGGCAAACGAATGAATCTGCCTATAAAATGACT R E T C N Q - G E R Q T N E S A Y K M T G K H V I N K V R G K R M N L P I K - L P G N M - S I R - E A N E - I C L - N D . . . . . . 74765 GTTTCTTCTACAATGTAATAATAGTAAAACTCTTGAGTACAATAGAGTGGAGACATGGAT V S S T M - - - - N S - V Q - S G D M D F L L Q C N N S K T L E Y N R V E T W M C F F Y N V I I V K L L S T I E W R H G . . . . . . 74705 GAATATCTGAATAAGCTTTCTCTCAAACTTTCTGCGTTTTTTATGTACATAGAATTTGCA E Y L N K L S L K L S A F F M Y I E F A N I - I S F L S N F L R F L C T - N L Q - I S E - A F S Q T F C V F Y V H R I C . . . 74645 GAAACGTTGGGAAGCATTCTCTA E T L G S I L K R W E A F S R N V G K H S L Maximal non-overlapping open reading frames (>= 64 codons): >48852GAATTCACAACACTTCATACCATCTGATTTCAAGACAAGATCTCTTTGGGAAAACTACGT-_PGL-1_AGS-1_PPS_1 (75742 75736,75603 75584,75343 75188,75115 75048,74964 74868) (frame '1'; 345 bp, 115 residues) 1 MGGTDWWREE VKEVLEYASG SETRLTRIML DNMVVPLENG DVDVTMLKDA VELINGRFET 61 EASGNVTLET VHKIGQSGVT FISSGALTHS VKALDISLKI DTELALEVGR RTKRA-