GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:35:29 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 254 Minimum sequence length: 254 Maximum sequence length: 254 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 1 < 400: 0 < 500: 0 < 600: 0 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 7001AAGCTTGAAGAACATGAGCCCCCATCTATACTTAAGCTGTCATTGAGACAACAAGTTCTC, from 1 to 98877, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 1 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 1 Matches indexed, elapsed seconds = 1 HitsTableSize = 0 ******************************************************************************** EST sequence 1 -strand (File: gi-) 1 TGCTTCCACT GCTATTGCTA AAATGAATAA TTTTCCTTTC TACGACAAGG AGATGAGAAT 61 ACAATATGCC CGAGGCACCA GTCCGCCAAA TAATATTCTC TTTGTCCAAA ACCTTCCTCA 121 CGAGACAACT CCAATGGTGC TTCAGATGTT GTTCTGCCAG TACCAAGGAT TTAAGGAAGT 181 TAGAATGATT GAAGCCAAAC CGGGAATCGC CTTTGTGGAG TTTGCTGATG AGATGCAGTC 241 GACGGTCGCA ATGC Predicted gene structure (within gDNA segment 31323 to 29656): Exon 1 31018 30964 ( 55 n); cDNA 1 55 ( 55 n); score: 1.000 Intron 1 30963 30687 ( 277 n); Pd: 0.970 (s: 1.00), Pa: 0.914 (s: n/a) Exon 2 30686 30675 ( 12 n); cDNA 56 67 ( 12 n); score: 1.000 Intron 2 30674 30144 ( 531 n); Pd: 0.000 (s: n/a), Pa: 0.000 (s: 0.98) Exon 3 30143 29956 ( 188 n); cDNA 68 254 ( 187 n); score: 0.995 MATCH 7001AAGCTTGAAGAACATGAGCCCCCATCTATACTTAAGCTGTCATTGAGACAACAAGTTCTC- gi- 0.996 243 0.957 C PGS_7001AAGCTTGAAGAACATGAGCCCCCATCTATACTTAAGCTGTCATTGAGACAACAAGTTCTC-_gi- (31018 30964,30686 30675,30143 29956) Alignment (genomic DNA sequence = upper lines): TGCTTCCACT GCTATTGCTA AAATGAATAA TTTTCCTTTC TACGACAAGG AGATGGTACC 30959 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| TGCTTCCACT GCTATTGCTA AAATGAATAA TTTTCCTTTC TACGACAAGG AGATG..... 55 AATTCCTTCC ACTCCAGTTA CTTTCTTGAT AACTTCTTCT TACTCGATGT GTTTTGGTTT 30899 .......... .......... .......... .......... .......... .......... 55 GCCTATTTGC CCCTAGAGTT AGTGGAACAC CTAATTAGGG TTTGAATCTA CTGATTAGAG 30839 .......... .......... .......... .......... .......... .......... 55 AGCTTTTCTT AGTGAACCAC ATGGATTCTA TTTGAAAGAC CCTTTATATG TTTAGTGGTT 30779 .......... .......... .......... .......... .......... .......... 55 GCTTTTAGGT TATTGACCAT GTTTTAGGTT GTGGAAATAC TTATCAATCA CTTTGCTATA 30719 .......... .......... .......... .......... .......... .......... 55 TTATTAATTC CTAACTTTAA CAATGTGGAC AGAGAATACA ATATGCCAAA ACAAAATCAG 30659 |||||||| |||| .......... .......... .......... ..AGAATACA ATAT...... .......... 67 ATGTTGTTGC CAAGGCCGAT GGTACATTTG TTCCTCGCGA GAAGAGAAAG AGACATGAGG 30599 .......... .......... .......... .......... .......... .......... 67 AGAAAGGTGA CACTTTTTTT TTACTTCATT GCTTCATTGG CTCACTTTGA CCATTACTGC 30539 .......... .......... .......... .......... .......... .......... 67 ATATGTGTAC ATGAGATTTA TTGCTCTGTT TCTCATTCCC ACATTCTGCA TTTGATCATT 30479 .......... .......... .......... .......... .......... .......... 67 AGTAGTCACA TCACATTCAC ATTGCTTCTC CACAACCTAC AAACAGTTTG TTTTCGCATC 30419 .......... .......... .......... .......... .......... .......... 67 TAAATATCGT CTTTGGAAGT GCAGGAGGCG GCAAGAAAAA GAAAGACCAG CACCATGATT 30359 .......... .......... .......... .......... .......... .......... 67 CTACACAGAT GGGCATGCCC ATGAACTCAG CATATCCAGG TGTCTATGGA GCTGCACCTC 30299 .......... .......... .......... .......... .......... .......... 67 CTGTGAGTTA TTTCAACATC TTCTTAACTC TAATCTTGAT TCATACGTTT GGGGGACGCT 30239 .......... .......... .......... .......... .......... .......... 67 CTTAATTCTT TTCCTGTATT CTGATGATCG TTCTTATTGC CACGCTTTTC ACAGCTATCG 30179 .......... .......... .......... .......... .......... .......... 67 CAAGTACCAT ACCCTGGTGG TATGAAACCC AATATGCCCG AGGCACCAGC TCCGCCAAAT 30119 ||||| ||||||||| |||||||||| .......... .......... .......... .....GCCCG AGGCACCAG- TCCGCCAAAT 91 AATATTCTCT TTGTCCAAAA CCTTCCTCAC GAGACAACTC CAATGGTGCT TCAGATGTTG 30059 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATATTCTCT TTGTCCAAAA CCTTCCTCAC GAGACAACTC CAATGGTGCT TCAGATGTTG 151 TTCTGCCAGT ACCAAGGATT TAAGGAAGTT AGAATGATTG AAGCCAAACC GGGAATCGCC 29999 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCTGCCAGT ACCAAGGATT TAAGGAAGTT AGAATGATTG AAGCCAAACC GGGAATCGCC 211 TTTGTGGAGT TTGCTGATGA GATGCAGTCG ACGGTCGCAA TGC 29956 |||||||||| |||||||||| |||||||||| |||||||||| ||| TTTGTGGAGT TTGCTGATGA GATGCAGTCG ACGGTCGCAA TGC 254 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (- strand): 31018 29956 AGS-1 (31018 30964,30686 30675,30143 29956) SCR (e 1.000 d 0.970 a 0.914,e 1.000 d 0.000 a 0.000,e 0.995) Exon 1 31018 30964 ( 55 n); score: 1.000 Intron 1 30963 30687 ( 277 n); Pd: 0.970 Pa: 0.914 Exon 2 30686 30675 ( 12 n); score: 1.000 Intron 2 30674 30144 ( 531 n); Pd: 0.000 Pa: 0.000 Exon 3 30143 29956 ( 188 n); score: 0.995 PGS (31018 30964,30686 30675,30143 29956) gi- 3-phase translation of AGS-1 (-strand): . . . . . . : 31018 TGCTTCCACTGCTATTGCTAAAATGAATAATTTTCCTTTCTACGACAAGGAGATG : AGAAT C F H C Y C - N E - F S F L R Q G D : E N A S T A I A K M N N F P F Y D K E M : R I L P L L L L K - I I F L S T T R R - : E . : . . . . . 30681 ACAATAT : GCCCGAGGCACCAGCTCCGCCAAATAATATTCTCTTTGTCCAAAACCTTCCTC T I : C P R H Q L R Q I I F S L S K T F L Q Y : A R G T S S A K - Y S L C P K P S S Y N M : P E A P A P P N N I L F V Q N L P . . . . . . 30090 ACGAGACAACTCCAATGGTGCTTCAGATGTTGTTCTGCCAGTACCAAGGATTTAAGGAAG T R Q L Q W C F R C C S A S T K D L R K R D N S N G A S D V V L P V P R I - G S H E T T P M V L Q M L F C Q Y Q G F K E . . . . . . 30030 TTAGAATGATTGAAGCCAAACCGGGAATCGCCTTTGTGGAGTTTGCTGATGAGATGCAGT L E - L K P N R E S P L W S L L M R C S - N D - S Q T G N R L C G V C - - D A V V R M I E A K P G I A F V E F A D E M Q . . 29970 CGACGGTCGCAATGC R R S Q C D G R N S T V A M Maximal non-overlapping open reading frames (>= 64 codons): >7001AAGCTTGAAGAACATGAGCCCCCATCTATACTTAAGCTGTCATTGAGACAACAAGTTCTC-_PGL-1_AGS-1_PPS_1 (30685 30675,30143 29957) (frame '0'; 198 bp, 66 residues) 1 EYNMPEAPAP PNNILFVQNL PHETTPMVLQ MLFCQYQGFK EVRMIEAKPG IAFVEFADEM 61 QSTVAM