GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:35:15 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 468 Minimum sequence length: 468 Maximum sequence length: 468 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 1 < 600: 0 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 1954AAGCTTCAAACGACAAGATCATGGGACTTCATGAATCTAACATTAAAAGCTGAGCGAAAT, from 1 to 108787, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 1 Matches indexed, elapsed seconds = 1 HitsTableSize = 1 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 GAGCTCCATC CAATATATCG TCTTCTTCCT CCTCGAACCA AAAACATCAG TCTCTCCTAG 61 TCTCAGCGCT CGTGAAAATC TGAAACTCTG CGTTCGAATT TTCGAAGAAC ATTAGAGATC 121 TGGTATTATC TGGATCCGAT CTGTTCACCA TGGAGGAGAA GAAGACAGAA TCAACCAACA 181 AGAATGTTAA GAAGGCCAAT CTATTGGACC ACCACTCGAT CAAGCACATC CTTGACGAAT 241 CCGTCTCCGA TATCGTTACG AGCCGTGGCT ATAAGGAGGA TGTGAGATTG AGCAACCTCA 301 AACTGATTTT GGGAACGATT ATCATCGTTG TTGCTCTTGT TGCTCAATTC TACAACAAGA 361 AATTCCCTGA GAACAGGGAT TTTTTGATTG GATGCATCGC ATTGTATGTA GTGTTGAATG 421 CGGTGTTGCA NCTGATTCTG NACACTAAGG AGAAGAATGC GATTNTGT Predicted gene structure (within gDNA segment 52545 to 51123): Exon 1 52241 51991 ( 251 n); cDNA 1 251 ( 251 n); score: 0.996 Intron 1 51990 51920 ( 71 n); Pd: 0.995 (s: 1.00), Pa: 0.989 (s: 1.00) Exon 2 51919 51768 ( 152 n); cDNA 252 403 ( 152 n); score: 1.000 Intron 2 51767 51509 ( 259 n); Pd: 0.000 (s: 1.00), Pa: 0.000 (s: 0.96) Exon 3 51508 51444 ( 65 n); cDNA 404 468 ( 65 n); score: 0.954 MATCH 1954AAGCTTCAAACGACAAGATCATGGGACTTCATGAATCTAACATTAAAAGCTGAGCGAAAT- gi+ 0.991 468 1.000 C PGS_1954AAGCTTCAAACGACAAGATCATGGGACTTCATGAATCTAACATTAAAAGCTGAGCGAAAT-_gi+ (52241 51991,51919 51768,51508 51444) Alignment (genomic DNA sequence = upper lines): TAGCTCCATC CAATATATCG TCTTCTTCCT CCTCGAACCA AAAACATCAG TCTCTCCTAG 52182 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAGCTCCATC CAATATATCG TCTTCTTCCT CCTCGAACCA AAAACATCAG TCTCTCCTAG 60 TCTCAGCGCT CGTGAAAATC TGAAACTCTG CGTTCGAATT TTCGAAGAAC ATTAGAGATC 52122 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTCAGCGCT CGTGAAAATC TGAAACTCTG CGTTCGAATT TTCGAAGAAC ATTAGAGATC 120 TGGTATTATC TGGATCCGAT CTGTTCACCA TGGAGGAGAA GAAGACAGAA TCAACCAACA 52062 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGTATTATC TGGATCCGAT CTGTTCACCA TGGAGGAGAA GAAGACAGAA TCAACCAACA 180 AGAATGTTAA GAAGGCCAAT CTATTGGACC ACCACTCGAT CAAGCACATC CTTGACGAAT 52002 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAATGTTAA GAAGGCCAAT CTATTGGACC ACCACTCGAT CAAGCACATC CTTGACGAAT 240 CCGTCTCCGA TGTAAGAATT TTCTAAGATT TTCCTTGTTC GTTGCTCCAT ATTTCTCTGA 51942 |||||||||| | CCGTCTCCGA T......... .......... .......... .......... .......... 251 TATGTGAATC TATGATTCGT AGATCGTTAC GAGCCGTGGC TATAAGGAGG ATGTGAGATT 51882 |||||||| |||||||||| |||||||||| |||||||||| .......... .......... ..ATCGTTAC GAGCCGTGGC TATAAGGAGG ATGTGAGATT 289 GAGCAACCTC AAACTGATTT TGGGAACGAT TATCATCGTT GTTGCTCTTG TTGCTCAATT 51822 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAGCAACCTC AAACTGATTT TGGGAACGAT TATCATCGTT GTTGCTCTTG TTGCTCAATT 349 CTACAACAAG AAATTCCCTG AGAACAGGGA TTTTTTGATT GGATGCATCG CATTATATCC 51762 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| CTACAACAAG AAATTCCCTG AGAACAGGGA TTTTTTGATT GGATGCATCG CATT...... 403 TTTTTTCTCG AATTCAATTG ATACTTTCTG CTTTCTGCTT GACTCTCATT TGGTTGGTTT 51702 .......... .......... .......... .......... .......... .......... 403 CCAGTGGCTA AGATGAAATT TAAGAAAATT GAGATTAGAG AACTTGTAAT GTGGCTCGAA 51642 .......... .......... .......... .......... .......... .......... 403 TTCTTTGTGC TTAGTTAGCA TTAGATTTTG GAAAACCTTC TTGTATCTAA AGTATGACTA 51582 .......... .......... .......... .......... .......... .......... 403 TAGAAGCTGA TCGATATATG TCTTGTATTT GGATAATTCT GAAGAAGATG TTTGGTCCTT 51522 .......... .......... .......... .......... .......... .......... 403 GACTATATGA CACGTATGTA GTGTTGAATG CGGTGTTGCA GCTGATTCTG TACACTAAGG 51462 ||||||| |||||||||| |||||||||| ||||||||| ||||||||| .......... ...GTATGTA GTGTTGAATG CGGTGTTGCA NCTGATTCTG NACACTAAGG 450 AGAAGAATGC GATTTTGT 51444 |||||||||| |||| ||| AGAAGAATGC GATTNTGT 468 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (- strand): 52241 51444 AGS-1 (52241 51991,51919 51768,51508 51444) SCR (e 0.996 d 0.995 a 0.989,e 1.000 d 0.000 a 0.000,e 0.954) Exon 1 52241 51991 ( 251 n); score: 0.996 Intron 1 51990 51920 ( 71 n); Pd: 0.995 Pa: 0.989 Exon 2 51919 51768 ( 152 n); score: 1.000 Intron 2 51767 51509 ( 259 n); Pd: 0.000 Pa: 0.000 Exon 3 51508 51444 ( 65 n); score: 0.954 PGS (52241 51991,51919 51768,51508 51444) gi+ 3-phase translation of AGS-1 (-strand): . . . . . . 52241 TAGCTCCATCCAATATATCGTCTTCTTCCTCCTCGAACCAAAAACATCAGTCTCTCCTAG - L H P I Y R L L P P R T K N I S L S - S S I Q Y I V F F L L E P K T S V S P S A P S N I S S S S S S N Q K H Q S L L . . . . . . 52181 TCTCAGCGCTCGTGAAAATCTGAAACTCTGCGTTCGAATTTTCGAAGAACATTAGAGATC S Q R S - K S E T L R S N F R R T L E I L S A R E N L K L C V R I F E E H - R S V S A L V K I - N S A F E F S K N I R D . . . . . . 52121 TGGTATTATCTGGATCCGATCTGTTCACCATGGAGGAGAAGAAGACAGAATCAACCAACA W Y Y L D P I C S P W R R R R Q N Q P T G I I W I R S V H H G G E E D R I N Q Q L V L S G S D L F T M E E K K T E S T N . . . . . . 52061 AGAATGTTAAGAAGGCCAATCTATTGGACCACCACTCGATCAAGCACATCCTTGACGAAT R M L R R P I Y W T T T R S S T S L T N E C - E G Q S I G P P L D Q A H P - R I K N V K K A N L L D H H S I K H I L D E . . : . . . . 52001 CCGTCTCCGAT : ATCGTTACGAGCCGTGGCTATAAGGAGGATGTGAGATTGAGCAACCTCA P S P I : S L R A V A I R R M - D - A T S R L R : Y R Y E P W L - G G C E I E Q P Q S V S D : I V T S R G Y K E D V R L S N L . . . . . . 51870 AACTGATTTTGGGAACGATTATCATCGTTGTTGCTCTTGTTGCTCAATTCTACAACAAGA N - F W E R L S S L L L L L L N S T T R T D F G N D Y H R C C S C C S I L Q Q E K L I L G T I I I V V A L V A Q F Y N K . . . . . : . 51810 AATTCCCTGAGAACAGGGATTTTTTGATTGGATGCATCGCATT : GTATGTAGTGTTGAATG N S L R T G I F - L D A S H : C M - C - M I P - E Q G F F D W M H R I : V C S V E C K F P E N R D F L I G C I A L : Y V V L N . . . . . 51491 CGGTGTTGCAGCTGATTCTGTACACTAAGGAGAAGAATGCGATTTTGT R C C S - F C T L R R R M R F C G V A A D S V H - G E E C D F A V L Q L I L Y T K E K N A I L Maximal non-overlapping open reading frames (>= 64 codons): >1954AAGCTTCAAACGACAAGATCATGGGACTTCATGAATCTAACATTAAAAGCTGAGCGAAAT-_PGL-1_AGS-1_PPS_1 (52158 51991,51919 51768,51508 51445) (frame '0'; 384 bp, 128 residues) 1 NSAFEFSKNI RDLVLSGSDL FTMEEKKTES TNKNVKKANL LDHHSIKHIL DESVSDIVTS 61 RGYKEDVRLS NLKLILGTII IVVALVAQFY NKKFPENRDF LIGCIALYVV LNAVLQLILY 121 TKEKNAIL