GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:37:22 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 627 Minimum sequence length: 627 Maximum sequence length: 627 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 0 < 600: 0 < 700: 1 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 42596GAATTCCTGGCAAACGCCCAACAACAGTGAATCCAGCTTATGAAGAGTTCTTCCTAACCA, from 1 to 118136, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 1 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 ******************************************************************************** EST sequence 1 -strand (File: gi-) 1 TNAATGATGA AGAATGGGGT TTATGATGGA AATCTCTCAC AGAAGTTCCC TNAATTTTCT 61 ATTGATCTCT ACTTAGGGCA CGAGTTTGAG ATTTCTCACA ACAGTACAAG CAGTGTCNAT 121 CTTATTGGTT ACAGGACCTT TGATGCTTTT GACGAACTGG ATGAGGAGAT TGATTCTGAT 181 TCTGAGTTAG ATGAATATAT GGAACAACAA ATTGCTGCTT TGCCTCAAAA TGAGATCAAT 241 CCTGAAGAAG ATGATGAATC CGACTCAGAT GAGATGGGTT TGGACGAGGA TGATGACTCT 301 TCAGATGAAG AAGATGTAGA GGCTGAAGCA CCTTTAAAGG TGGCTCCTCC GAGCAAAAAG 361 ATGCCAAATG GTGCATTTGA GATAGCTAAA GGTGGAAAGA AGAACAAGTC ATCAGGAGGG 421 AAGAAGAGAT GCCCATTCCC TTGTGGTCCC TCTTGCAAAA AGTAGAAGAT ATTTGCACAC 481 CAAGTCACAT TTTTCCAATA GAAATTTTTA CTTGACTGTA TTGGTGAATC GTTGAGTGAC 541 TTATGAGGCT TTTGGCATCT TAAAATTTTT GGATTATTAT AATATATGTT ATGTTGTCAT 601 TTTGGAGTGT TAATAGGGTT TAAATGA Predicted gene structure (within gDNA segment 17258 to 15449): Exon 1 16851 16692 ( 160 n); cDNA 1 158 ( 158 n); score: 0.931 Intron 1 16691 16580 ( 112 n); Pd: 0.000 (s: 0.98), Pa: 0.000 (s: n/a) Exon 2 16579 16560 ( 20 n); cDNA 159 178 ( 20 n); score: 1.000 Intron 2 16559 16476 ( 84 n); Pd: 0.996 (s: n/a), Pa: 0.980 (s: n/a) Exon 3 16475 16440 ( 36 n); cDNA 179 214 ( 36 n); score: 1.000 Intron 3 16439 16347 ( 93 n); Pd: 0.950 (s: n/a), Pa: 0.996 (s: 1.00) Exon 4 16346 16288 ( 59 n); cDNA 215 273 ( 59 n); score: 1.000 Intron 4 16287 16197 ( 91 n); Pd: 0.999 (s: 1.00), Pa: 1.000 (s: 1.00) Exon 5 16196 16055 ( 142 n); cDNA 274 415 ( 142 n); score: 1.000 Intron 5 16054 15960 ( 95 n); Pd: 0.993 (s: 1.00), Pa: 0.987 (s: 1.00) Exon 6 15959 15749 ( 211 n); cDNA 416 627 ( 212 n); score: 0.991 MATCH 42596GAATTCCTGGCAAACGCCCAACAACAGTGAATCCAGCTTATGAAGAGTTCTTCCTAACCA- gi- 0.977 572 0.912 C PGS_42596GAATTCCTGGCAAACGCCCAACAACAGTGAATCCAGCTTATGAAGAGTTCTTCCTAACCA-_gi- (16851 16692,16579 16560,16475 16440,16346 16288,16196 16055,15959 15749) Alignment (genomic DNA sequence = upper lines): TTGGTGATGA TGAGAATGGG TTTATGATTG GAAATCTCTC ACAGAAGTTT CCTCAATTTT 16792 | |||||| || | ||| ||||||| || |||||||||| ||||||||| ||| |||||| TNAATGATGA AGA-ATGGGG TTTATGA-TG GAAATCTCTC ACAGAAGTTC CCTNAATTTT 58 CTATTGATCT CTACTTAGGG CACGAGTTTG AGATTTCTCA CAACAGTACA AGCAGTGTCT 16732 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| CTATTGATCT CTACTTAGGG CACGAGTTTG AGATTTCTCA CAACAGTACA AGCAGTGTCN 118 ATCTTATTGG TTACAGGACC TTTGATGCTT TTGACGAACT ATATCCTTTC TTTCATTGTC 16672 |||||||||| |||||||||| |||||||||| |||||||||| ATCTTATTGG TTACAGGACC TTTGATGCTT TTGACGAACT .......... .......... 158 TTTTTCTAGT TAAATTCATA CTCACTTGCA ATTTTGAAGC TTTGTTTGTT TATCCTTGAT 16612 .......... .......... .......... .......... .......... .......... 158 TTTGTTTTGC TCTGCTTTTT TCTGATGCTC ACGGATGAGG AGATTGATTC TGGTGAGTTT 16552 |||||||| |||||||||| || .......... .......... .......... ..GGATGAGG AGATTGATTC TG........ 178 ATAATTCTTT AGTTGATGAT TTGGTTTACT CTATTATGAA GCTTGAAGCT AATCTTTTGC 16492 .......... .......... .......... .......... .......... .......... 178 TTTTTGGTTT GTGTAGATTC TGAGTTAGAT GAATATATGG AACAACAAAT TGGTATGTTT 16432 |||| |||||||||| |||||||||| |||||||||| || .......... ......ATTC TGAGTTAGAT GAATATATGG AACAACAAAT TG........ 214 ACTTTTTTTA CTAAGCTACT GCTTTGTGGT AGTCATATTT ACATGTGTAT CTCTTATGAA 16372 .......... .......... .......... .......... .......... .......... 214 ATCTAACTTT GACATTGTTG AAAAGCTGCT TTGCCTCAAA ATGAGATCAA TCCTGAAGAA 16312 ||||| |||||||||| |||||||||| |||||||||| .......... .......... .....CTGCT TTGCCTCAAA ATGAGATCAA TCCTGAAGAA 249 GATGATGAAT CCGACTCAGA TGAGGTAGAT TTTTCTTTTA AGTATTTACT GATTAGCAAT 16252 |||||||||| |||||||||| |||| GATGATGAAT CCGACTCAGA TGAG...... .......... .......... .......... 273 GAATCAGTAT AAAAAAAAAG GTAATCTTGT TGTTTTTTTC TGTATGTTTG GTTAGATGGG 16192 ||||| .......... .......... .......... .......... .......... .....ATGGG 278 TTTGGACGAG GATGATGACT CTTCAGATGA AGAAGATGTA GAGGCTGAAG CACCTTTAAA 16132 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTGGACGAG GATGATGACT CTTCAGATGA AGAAGATGTA GAGGCTGAAG CACCTTTAAA 338 GGTGGCTCCT CCGAGCAAAA AGATGCCAAA TGGTGCATTT GAGATAGCTA AAGGTGGAAA 16072 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTGGCTCCT CCGAGCAAAA AGATGCCAAA TGGTGCATTT GAGATAGCTA AAGGTGGAAA 398 GAAGAACAAG TCATCAGGTA ACGAAAACTA AAGTTAAGAT ATATAAAGAT AACTTTGCAT 16012 |||||||||| ||||||| GAAGAACAAG TCATCAG... .......... .......... .......... .......... 415 ATGAGATGAG AATTGTGGAT TATTGATTTG GTTTAAAACC ATTTTGGTGC AGGAGGGAAG 15952 |||||||| .......... .......... .......... .......... .......... ..GAGGGAAG 423 AAGAGATGCC CATTCCCTTG TGGTCCCTCT TGCAAAAAGT AGAAGATATT TGCACACCAA 15892 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGAGATGCC CATTCCCTTG TGGTCCCTCT TGCAAAAAGT AGAAGATATT TGCACACCAA 483 GTCACATTTT TCCAATAGAA ATTTTTACTT GACTGTATTG GTGAATCGTT GAGTGACTTA 15832 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTCACATTTT TCCAATAGAA ATTTTTACTT GACTGTATTG GTGAATCGTT GAGTGACTTA 543 TGAGGCTTTT GGCATCTTAA AATTTTTGGA TTATTATAAT ATATGTTATG TTGTCATTTT 15772 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAGGCTTTT GGCATCTTAA AATTTTTGGA TTATTATAAT ATATGTTATG TTGTCATTTT 603 GGAGTGTTAA T-GGGTTTAA ATGA 15749 |||||||||| | |||||||| |||| GGAGTGTTAA TAGGGTTTAA ATGA 627 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (- strand): 16851 15749 AGS-1 (16851 16692,16579 16560,16475 16440,16346 16288,16196 16055,15959 15749) SCR (e 0.931 d 0.000 a 0.000,e 1.000 d 0.996 a 0.980,e 1.000 d 0.950 a 0.996,e 1.000 d 0.999 a 1.000,e 1.000 d 0.993 a 0.987,e 0.991) Exon 1 16851 16692 ( 160 n); score: 0.931 Intron 1 16691 16580 ( 112 n); Pd: 0.000 Pa: 0.000 Exon 2 16579 16560 ( 20 n); score: 1.000 Intron 2 16559 16476 ( 84 n); Pd: 0.996 Pa: 0.980 Exon 3 16475 16440 ( 36 n); score: 1.000 Intron 3 16439 16347 ( 93 n); Pd: 0.950 Pa: 0.996 Exon 4 16346 16288 ( 59 n); score: 1.000 Intron 4 16287 16197 ( 91 n); Pd: 0.999 Pa: 1.000 Exon 5 16196 16055 ( 142 n); score: 1.000 Intron 5 16054 15960 ( 95 n); Pd: 0.993 Pa: 0.987 Exon 6 15959 15749 ( 211 n); score: 0.991 PGS (16851 16692,16579 16560,16475 16440,16346 16288,16196 16055,15959 15749) gi- 3-phase translation of AGS-1 (-strand): . . . . . . 16851 TTGGTGATGATGAGAATGGGTTTATGATTGGAAATCTCTCACAGAAGTTTCCTCAATTTT L V M M R M G L - L E I S H R S F L N F W - - - E W V Y D W K S L T E V S S I F G D D E N G F M I G N L S Q K F P Q F . . . . . . 16791 CTATTGATCTCTACTTAGGGCACGAGTTTGAGATTTCTCACAACAGTACAAGCAGTGTCT L L I S T - G T S L R F L T T V Q A V S Y - S L L R A R V - D F S Q Q Y K Q C L S I D L Y L G H E F E I S H N S T S S V . . . . : . . : 16731 ATCTTATTGGTTACAGGACCTTTGATGCTTTTGACGAACT : GGATGAGGAGATTGATTCTG : I L L V T G P L M L L T N : W M R R L I L : S Y W L Q D L - C F - R T : G - G D - F - : Y L I G Y R T F D A F D E L : D E E I D S : . . . . : . . 16559 ATTCTGAGTTAGATGAATATATGGAACAACAAATTG : CTGCTTTGCCTCAAAATGAGATCA I L S - M N I W N N K L : L L C L K M R S F - V R - I Y G T T N C : C F A S K - D Q D S E L D E Y M E Q Q I : A A L P Q N E I . . . . : . . 16322 ATCCTGAAGAAGATGATGAATCCGACTCAGATGAG : ATGGGTTTGGACGAGGATGATGACT I L K K M M N P T Q M R : W V W T R M M T S - R R - - I R L R - : D G F G R G - - L N P E E D D E S D S D E : M G L D E D D D . . . . . . 16171 CTTCAGATGAAGAAGATGTAGAGGCTGAAGCACCTTTAAAGGTGGCTCCTCCGAGCAAAA L Q M K K M - R L K H L - R W L L R A K F R - R R C R G - S T F K G G S S E Q K S S D E E D V E A E A P L K V A P P S K . . . . . . : 16111 AGATGCCAAATGGTGCATTTGAGATAGCTAAAGGTGGAAAGAAGAACAAGTCATCAG : GAG R C Q M V H L R - L K V E R R T S H Q : E D A K W C I - D S - R W K E E Q V I R : R K M P N G A F E I A K G G K K N K S S : G . . . . . . 15956 GGAAGAAGAGATGCCCATTCCCTTGTGGTCCCTCTTGCAAAAAGTAGAAGATATTTGCAC G R R D A H S L V V P L A K S R R Y L H E E E M P I P L W S L L Q K V E D I C T G K K R C P F P C G P S C K K - K I F A . . . . . . 15896 ACCAAGTCACATTTTTCCAATAGAAATTTTTACTTGACTGTATTGGTGAATCGTTGAGTG T K S H F S N R N F Y L T V L V N R - V P S H I F P I E I F T - L Y W - I V E - H Q V T F F Q - K F L L D C I G E S L S . . . . . . 15836 ACTTATGAGGCTTTTGGCATCTTAAAATTTTTGGATTATTATAATATATGTTATGTTGTC T Y E A F G I L K F L D Y Y N I C Y V V L M R L L A S - N F W I I I I Y V M L S D L - G F W H L K I F G L L - Y M L C C . . . 15776 ATTTTGGAGTGTTAATGGGTTTAAATGA I L E C - W V - M F W S V N G F K - H F G V L M G L N Maximal non-overlapping open reading frames (>= 64 codons): >42596GAATTCCTGGCAAACGCCCAACAACAGTGAATCCAGCTTATGAAGAGTTCTTCCTAACCA-_PGL-1_AGS-1_PPS_1 (16849 16692,16579 16560,16475 16440,16346 16288,16196 16055,15959 15910) (frame '0'; 462 bp, 154 residues) 1 GDDENGFMIG NLSQKFPQFS IDLYLGHEFE ISHNSTSSVY LIGYRTFDAF DELDEEIDSD 61 SELDEYMEQQ IAALPQNEIN PEEDDESDSD EMGLDEDDDS SDEEDVEAEA PLKVAPPSKK 121 MPNGAFEIAK GGKKNKSSGG KKRCPFPCGP SCKK-