GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:38:08 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 536 Minimum sequence length: 536 Maximum sequence length: 536 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 0 < 600: 1 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 48845AAGCTTAGGTTACTTGATGTGTCTAACAACGATTTTTATGGAATACCGCCAAAGTTTCGG, from 1 to 89078, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 1 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 ******************************************************************************** EST sequence 1 -strand (File: gi-) 1 AGAGAAACGT GGGCTAGTCA TNTACAGAAG TCCCGATTGT ATTTGAAGAG ATTAAGCGGT 61 GAAGCTTCGC AGAGTAATGA CTCTGAGTCT ACCAAAAGAT ATGAAAACAT TCAAGCTCTG 121 GTTTCCTCAG GACAATTACA TCCACAGACA TTAGCTGCAT TGTTTGGTCA ACCAATAGAC 181 AATCATCACT CTGCTAGTTT TGGAGTTTGG ATTCCTAATG ACAATCTTGG TAGATCTCAA 241 AATGAACACT TCTCGGTTGA TGTATCATCC GCTTCTAACC GTCCTGTTTC TGTGGCGGTT 301 CATGGTCTAT CTTCCTCAGA CAAGGGTATG GATCAAATGT CAATGAAGAA TCTTGGATCT 361 TGGAAAGATC TTCAAGACAA AGATAGTGAC CAATATGGAT ACGCTTAAGA AGAGAAGGCT 421 CTGCAACTGT TTCTTTTAAA TCTTCAGGAA TGGATTAATG TCAGAAGAAA ACTTGTGTGT 481 CTGCAATGTT AGCTTAGGAA AGTAATTATG TAACCAATAT TTTGAATAAG TATCTC Predicted gene structure (within gDNA segment 25733 to 24386): Exon 1 25380 25354 ( 27 n); cDNA 1 27 ( 27 n); score: 0.815 Intron 1 25353 25274 ( 80 n); Pd: 0.672 (s: n/a), Pa: 0.903 (s: 0.98) Exon 2 25273 24982 ( 292 n); cDNA 28 319 ( 292 n); score: 0.997 Intron 2 24981 24939 ( 43 n); Pd: 0.000 (s: 1.00), Pa: 0.001 (s: 1.00) Exon 3 24938 24722 ( 217 n); cDNA 320 536 ( 217 n); score: 0.986 MATCH 48845AAGCTTAGGTTACTTGATGTGTCTAACAACGATTTTTATGGAATACCGCCAAAGTTTCGG- gi- 0.992 509 0.950 C PGS_48845AAGCTTAGGTTACTTGATGTGTCTAACAACGATTTTTATGGAATACCGCCAAAGTTTCGG-_gi- (25380 25354,25273 24982,24938 24722) Alignment (genomic DNA sequence = upper lines): AGAGAAAACG TGGCTAGTCA TTTACAGGTT TCTTTGATTG ATTGATTGAA TCTTTCTTCT 25321 ||||||| ||||||||| | ||||| AGAGAAACGT GGGCTAGTCA TNTACAG... .......... .......... .......... 27 TATTAGGGTC TCTATAAGGG TTATGACTGA ATCAATACCA TTTGCAGAAG TTCCGATTGT 25261 ||| | |||||||| .......... .......... .......... .......... .......AAG TCCCGATTGT 40 ATTTGAAGAG ATTAAGCGGT GAAGCTTCGC AGAGTAATGA CTCTGAGTCT ACCAAAAGAT 25201 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTGAAGAG ATTAAGCGGT GAAGCTTCGC AGAGTAATGA CTCTGAGTCT ACCAAAAGAT 100 ATGAAAACAT TCAAGCTCTG GTTTCCTCAG GACAATTACA TCCACAGACA TTAGCTGCAT 25141 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGAAAACAT TCAAGCTCTG GTTTCCTCAG GACAATTACA TCCACAGACA TTAGCTGCAT 160 TGTTTGGTCA ACCAATAGAC AATCATCACT CTGCTAGTTT TGGAGTTTGG ATTCCTAATG 25081 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTTTGGTCA ACCAATAGAC AATCATCACT CTGCTAGTTT TGGAGTTTGG ATTCCTAATG 220 ACAATCTTGG TAGATCTCAA AATGAACACT TCTCGGTTGA TGTATCATCC GCTTCTAACC 25021 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACAATCTTGG TAGATCTCAA AATGAACACT TCTCGGTTGA TGTATCATCC GCTTCTAACC 280 GTCCTGTTTC TGTGGCGGTT CATGGTCTAT CTTCCTCAGC AAATTTCAGA CAGAGAGGTG 24961 |||||||||| |||||||||| |||||||||| ||||||||| GTCCTGTTTC TGTGGCGGTT CATGGTCTAT CTTCCTCAG. .......... .......... 319 ACGTTAACAA CAACAGAATC AGACAAGGGT ATGGATCAAA TGTCAATGAA GAATCTTGGA 24901 |||||||| |||||||||| |||||||||| |||||||||| .......... .......... ..ACAAGGGT ATGGATCAAA TGTCAATGAA GAATCTTGGA 357 TCTTGGAAAG ATCTTCAAGA CAAAGATAGT GACCAATATG GATACGCTTA AGAAGAGAAG 24841 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTTGGAAAG ATCTTCAAGA CAAAGATAGT GACCAATATG GATACGCTTA AGAAGAGAAG 417 GCTCTGCAAC TGTTTCTTTT AAATCTTCAG GAATGGATTA ATGTCAGAAG AAAACTTGTG 24781 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTCTGCAAC TGTTTCTTTT AAATCTTCAG GAATGGATTA ATGTCAGAAG AAAACTTGTG 477 TGTCTGCAAT GTTAGCTTAG GAAAGTAATT ATGTAACCAA TATTTTGAAT TGTTATCTC 24722 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||| TGTCTGCAAT GTTAGCTTAG GAAAGTAATT ATGTAACCAA TATTTTGAAT AAGTATCTC 536 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (- strand): 25380 24722 AGS-1 (25380 25354,25273 24982,24938 24722) SCR (e 0.815 d 0.672 a 0.903,e 0.997 d 0.000 a 0.001,e 0.986) Exon 1 25380 25354 ( 27 n); score: 0.815 Intron 1 25353 25274 ( 80 n); Pd: 0.672 Pa: 0.903 Exon 2 25273 24982 ( 292 n); score: 0.997 Intron 2 24981 24939 ( 43 n); Pd: 0.000 Pa: 0.001 Exon 3 24938 24722 ( 217 n); score: 0.986 PGS (25380 25354,25273 24982,24938 24722) gi- 3-phase translation of AGS-1 (-strand): . . . : . . . 25380 AGAGAAAACGTGGCTAGTCATTTACAG : AAGTTCCGATTGTATTTGAAGAGATTAAGCGGT R E N V A S H L Q : K F R L Y L K R L S G E K T W L V I Y R : S S D C I - R D - A V R K R G - S F T : E V P I V F E E I K R . . . . . . 25240 GAAGCTTCGCAGAGTAATGACTCTGAGTCTACCAAAAGATATGAAAACATTCAAGCTCTG E A S Q S N D S E S T K R Y E N I Q A L K L R R V M T L S L P K D M K T F K L W - S F A E - - L - V Y Q K I - K H S S S . . . . . . 25180 GTTTCCTCAGGACAATTACATCCACAGACATTAGCTGCATTGTTTGGTCAACCAATAGAC V S S G Q L H P Q T L A A L F G Q P I D F P Q D N Y I H R H - L H C L V N Q - T G F L R T I T S T D I S C I V W S T N R . . . . . . 25120 AATCATCACTCTGCTAGTTTTGGAGTTTGGATTCCTAATGACAATCTTGGTAGATCTCAA N H H S A S F G V W I P N D N L G R S Q I I T L L V L E F G F L M T I L V D L K Q S S L C - F W S L D S - - Q S W - I S . . . . . . 25060 AATGAACACTTCTCGGTTGATGTATCATCCGCTTCTAACCGTCCTGTTTCTGTGGCGGTT N E H F S V D V S S A S N R P V S V A V M N T S R L M Y H P L L T V L F L W R F K - T L L G - C I I R F - P S C F C G G . . : . . . . 25000 CATGGTCTATCTTCCTCAG : ACAAGGGTATGGATCAAATGTCAATGAAGAATCTTGGATCT H G L S S S : D K G M D Q M S M K N L G S M V Y L P Q : T R V W I K C Q - R I L D L S W S I F L R : Q G Y G S N V N E E S W I . . . . . . 24897 TGGAAAGATCTTCAAGACAAAGATAGTGACCAATATGGATACGCTTAAGAAGAGAAGGCT W K D L Q D K D S D Q Y G Y A - E E K A G K I F K T K I V T N M D T L K K R R L L E R S S R Q R - - P I W I R L R R E G . . . . . . 24837 CTGCAACTGTTTCTTTTAAATCTTCAGGAATGGATTAATGTCAGAAGAAAACTTGTGTGT L Q L F L L N L Q E W I N V R R K L V C C N C F F - I F R N G L M S E E N L C V S A T V S F K S S G M D - C Q K K T C V . . . . . . 24777 CTGCAATGTTAGCTTAGGAAAGTAATTATGTAACCAATATTTTGAATTGTTATCTC L Q C - L R K V I M - P I F - I V I C N V S L G K - L C N Q Y F E L L S S A M L A - E S N Y V T N I L N C Y L Maximal non-overlapping open reading frames (>= 64 codons): >48845AAGCTTAGGTTACTTGATGTGTCTAACAACGATTTTTATGGAATACCGCCAAAGTTTCGG-_PGL-1_AGS-1_PPS_1 (25380 25354,25273 24982,24938 24850) (frame '1'; 405 bp, 135 residues) 1 RENVASHLQK FRLYLKRLSG EASQSNDSES TKRYENIQAL VSSGQLHPQT LAALFGQPID 61 NHHSASFGVW IPNDNLGRSQ NEHFSVDVSS ASNRPVSVAV HGLSSSDKGM DQMSMKNLGS 121 WKDLQDKDSD QYGYA-