GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:35:05 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 431 Minimum sequence length: 431 Maximum sequence length: 431 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 1 < 600: 0 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 1954AAGCTTCAAACGACAAGATCATGGGACTTCATGAATCTAACATTAAAAGCTGAGCGAAAT, from 1 to 108787, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 1 Matches indexed, elapsed seconds = 1 HitsTableSize = 1 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 AATATATCGT CTTCTTCCTC CTCGAACCAA AAACATCAGT CTCTCCTAGT CTCAGCGCTC 61 GTGAAAATCT GAAACTCTGC GTTCGAATTT TCGAAGAACA TTAGAGATCT GGTATTATCT 121 GGATCCGATC TGTTCACCAT GGAGGAGAAG AAGACAGAAT CAACCAACAA GAATGTTAAG 181 AAGGCCAATC TATTGGACCA CCACTCGATC AAGCACATCC TTGACGAATC CGTCTCCGAT 241 ATCGTTACGA GCCGTGGCTA TAAGGAGGAT GTGAGATTGA GCAACCTCAA ACTGATTTTG 301 GGAACGATTA TCATCGTTGT TGCTCTTGTT GCTCAATTCT ACAACAAGAA ATTCCCTGAG 361 AACAGGGATT TTTTGATTGG ATGCATCGCA TTGTATGTAG TGTTGAATGC GGTGTTGCAG 421 CTGATTCTGT A Predicted gene structure (within gDNA segment 52530 to 51165): Exon 1 52230 51991 ( 240 n); cDNA 1 240 ( 240 n); score: 1.000 Intron 1 51990 51920 ( 71 n); Pd: 0.995 (s: 1.00), Pa: 0.989 (s: 1.00) Exon 2 51919 51768 ( 152 n); cDNA 241 392 ( 152 n); score: 1.000 Intron 2 51767 51509 ( 259 n); Pd: 0.000 (s: 1.00), Pa: 0.000 (s: n/a) Exon 3 51508 51470 ( 39 n); cDNA 393 431 ( 39 n); score: 1.000 MATCH 1954AAGCTTCAAACGACAAGATCATGGGACTTCATGAATCTAACATTAAAAGCTGAGCGAAAT- gi+ 1.000 392 0.910 C PGS_1954AAGCTTCAAACGACAAGATCATGGGACTTCATGAATCTAACATTAAAAGCTGAGCGAAAT-_gi+ (52230 51991,51919 51768,51508 51470) Alignment (genomic DNA sequence = upper lines): AATATATCGT CTTCTTCCTC CTCGAACCAA AAACATCAGT CTCTCCTAGT CTCAGCGCTC 52171 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATATATCGT CTTCTTCCTC CTCGAACCAA AAACATCAGT CTCTCCTAGT CTCAGCGCTC 60 GTGAAAATCT GAAACTCTGC GTTCGAATTT TCGAAGAACA TTAGAGATCT GGTATTATCT 52111 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGAAAATCT GAAACTCTGC GTTCGAATTT TCGAAGAACA TTAGAGATCT GGTATTATCT 120 GGATCCGATC TGTTCACCAT GGAGGAGAAG AAGACAGAAT CAACCAACAA GAATGTTAAG 52051 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGATCCGATC TGTTCACCAT GGAGGAGAAG AAGACAGAAT CAACCAACAA GAATGTTAAG 180 AAGGCCAATC TATTGGACCA CCACTCGATC AAGCACATCC TTGACGAATC CGTCTCCGAT 51991 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGGCCAATC TATTGGACCA CCACTCGATC AAGCACATCC TTGACGAATC CGTCTCCGAT 240 GTAAGAATTT TCTAAGATTT TCCTTGTTCG TTGCTCCATA TTTCTCTGAT ATGTGAATCT 51931 .......... .......... .......... .......... .......... .......... 240 ATGATTCGTA GATCGTTACG AGCCGTGGCT ATAAGGAGGA TGTGAGATTG AGCAACCTCA 51871 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... .ATCGTTACG AGCCGTGGCT ATAAGGAGGA TGTGAGATTG AGCAACCTCA 289 AACTGATTTT GGGAACGATT ATCATCGTTG TTGCTCTTGT TGCTCAATTC TACAACAAGA 51811 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACTGATTTT GGGAACGATT ATCATCGTTG TTGCTCTTGT TGCTCAATTC TACAACAAGA 349 AATTCCCTGA GAACAGGGAT TTTTTGATTG GATGCATCGC ATTATATCCT TTTTTCTCGA 51751 |||||||||| |||||||||| |||||||||| |||||||||| ||| AATTCCCTGA GAACAGGGAT TTTTTGATTG GATGCATCGC ATT....... .......... 392 ATTCAATTGA TACTTTCTGC TTTCTGCTTG ACTCTCATTT GGTTGGTTTC CAGTGGCTAA 51691 .......... .......... .......... .......... .......... .......... 392 GATGAAATTT AAGAAAATTG AGATTAGAGA ACTTGTAATG TGGCTCGAAT TCTTTGTGCT 51631 .......... .......... .......... .......... .......... .......... 392 TAGTTAGCAT TAGATTTTGG AAAACCTTCT TGTATCTAAA GTATGACTAT AGAAGCTGAT 51571 .......... .......... .......... .......... .......... .......... 392 CGATATATGT CTTGTATTTG GATAATTCTG AAGAAGATGT TTGGTCCTTG ACTATATGAC 51511 .......... .......... .......... .......... .......... .......... 392 ACGTATGTAG TGTTGAATGC GGTGTTGCAG CTGATTCTGT A 51470 |||||||| |||||||||| |||||||||| |||||||||| | ..GTATGTAG TGTTGAATGC GGTGTTGCAG CTGATTCTGT A 431 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (- strand): 52230 51470 AGS-1 (52230 51991,51919 51768,51508 51470) SCR (e 1.000 d 0.995 a 0.989,e 1.000 d 0.000 a 0.000,e 1.000) Exon 1 52230 51991 ( 240 n); score: 1.000 Intron 1 51990 51920 ( 71 n); Pd: 0.995 Pa: 0.989 Exon 2 51919 51768 ( 152 n); score: 1.000 Intron 2 51767 51509 ( 259 n); Pd: 0.000 Pa: 0.000 Exon 3 51508 51470 ( 39 n); score: 1.000 PGS (52230 51991,51919 51768,51508 51470) gi+ 3-phase translation of AGS-1 (-strand): . . . . . . 52230 AATATATCGTCTTCTTCCTCCTCGAACCAAAAACATCAGTCTCTCCTAGTCTCAGCGCTC N I S S S S S S N Q K H Q S L L V S A L I Y R L L P P R T K N I S L S - S Q R S Y I V F F L L E P K T S V S P S L S A . . . . . . 52170 GTGAAAATCTGAAACTCTGCGTTCGAATTTTCGAAGAACATTAGAGATCTGGTATTATCT V K I - N S A F E F S K N I R D L V L S - K S E T L R S N F R R T L E I W Y Y L R E N L K L C V R I F E E H - R S G I I . . . . . . 52110 GGATCCGATCTGTTCACCATGGAGGAGAAGAAGACAGAATCAACCAACAAGAATGTTAAG G S D L F T M E E K K T E S T N K N V K D P I C S P W R R R R Q N Q P T R M L R W I R S V H H G G E E D R I N Q Q E C - . . . . . . : 52050 AAGGCCAATCTATTGGACCACCACTCGATCAAGCACATCCTTGACGAATCCGTCTCCGAT : K A N L L D H H S I K H I L D E S V S D : R P I Y W T T T R S S T S L T N P S P I : E G Q S I G P P L D Q A H P - R I R L R : . . . . . . 51990 ATCGTTACGAGCCGTGGCTATAAGGAGGATGTGAGATTGAGCAACCTCAAACTGATTTTG I V T S R G Y K E D V R L S N L K L I L S L R A V A I R R M - D - A T S N - F W Y R Y E P W L - G G C E I E Q P Q T D F . . . . . . 51859 GGAACGATTATCATCGTTGTTGCTCTTGTTGCTCAATTCTACAACAAGAAATTCCCTGAG G T I I I V V A L V A Q F Y N K K F P E E R L S S L L L L L L N S T T R N S L R G N D Y H R C C S C C S I L Q Q E I P - . . . . : . . 51799 AACAGGGATTTTTTGATTGGATGCATCGCATT : GTATGTAGTGTTGAATGCGGTGTTGCAG N R D F L I G C I A L : Y V V L N A V L Q T G I F - L D A S H : C M - C - M R C C S E Q G F F D W M H R I : V C S V E C G V A . . 51480 CTGATTCTGTA L I L - F C A D S V Maximal non-overlapping open reading frames (>= 64 codons): >1954AAGCTTCAAACGACAAGATCATGGGACTTCATGAATCTAACATTAAAAGCTGAGCGAAAT-_PGL-1_AGS-1_PPS_1 (52158 51991,51919 51768,51508 51472) (frame '1'; 357 bp, 119 residues) 1 NSAFEFSKNI RDLVLSGSDL FTMEEKKTES TNKNVKKANL LDHHSIKHIL DESVSDIVTS 61 RGYKEDVRLS NLKLILGTII IVVALVAQFY NKKFPENRDF LIGCIALYVV LNAVLQLIL