GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:35:09 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 862 Minimum sequence length: 862 Maximum sequence length: 862 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 0 < 600: 0 < 700: 0 < 800: 0 < 900: 1 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 1954AAGCTTCAAACGACAAGATCATGGGACTTCATGAATCTAACATTAAAAGCTGAGCGAAAT, from 1 to 108787, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 1 Matches indexed, elapsed seconds = 1 HitsTableSize = 1 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 AGCTCCATCC AATATATCGT CTTCTTCCTC CTCGAACCAA AAACATCAGT CTCTCCTAGT 61 CTCAGCGCTC GTGAAAATCT GAAACTCTGC GTTCGAATTT TCGAAGAACA TTAGAGATCT 121 GGTATTATCT GGATCCGATC TGTTCACCAT GGAGGAGAAG AAGACAGAAT CAACCAACAA 181 GAATGTTAAG AAGGCCAATC TATTGGACCA CCACTCGATC AAGCACATCC TTGACGAATC 241 CGTCTCCGAT ATCGTTACGA GCCGTGGCTA TAAGGAGGAT GTGAGATTGA GCAACCTCAA 301 ACTGATTTTG GGAACGATTA TCATCGTTGT TGCTCTTGTT GCTCAATTCT ACAACAAGAA 361 ATTCCCTGAG AACAGGGATT TTTTGATTGG ATGCATCGCA TTGTATGTAG TGTTGAATGC 421 GGTGTTGCAG CTGATTCTGT ACACTAAGGA GAAGAATGCG ATTTTGTTCA CCTATCCTCC 481 TGAGGGATCA TTCACCAGCA CTGGTTTGGT GGTATCTTCA AAGTTGCCCA GATTCTCTGA 541 TCAGTACACT CTCACCATCG ACAGTGCTGA TCCAAAATCA ATCTCAGCTG GGAAGTCAGT 601 TCAGCTCACC AAAAGTGTCA CCCAATGGTT CACGAAAGAT GGAGTTCTTG TTGAGGGTTT 661 ATTCTGGAAA GACGTAGAAG CACTAATCAA GAACTATGCA GAAGAAGAAC CAAAGAAGAA 721 GAAATGATCA TCTTCCAGAG AAATATATGC TTTTGAAACT CCATATGTTG GCGTTTTAGA 781 ATACTAAATC GATGGTGACT TTGTTTAGAG ATGAAACTAA TATTTGGATT TTTCCTTATA 841 GCCAACTTTT TGAATTAATC TG Predicted gene structure (within gDNA segment 52540 to 49819): Exon 1 52240 51991 ( 250 n); cDNA 1 250 ( 250 n); score: 1.000 Intron 1 51990 51920 ( 71 n); Pd: 0.995 (s: 1.00), Pa: 0.989 (s: 1.00) Exon 2 51919 51768 ( 152 n); cDNA 251 402 ( 152 n); score: 1.000 Intron 2 51767 51509 ( 259 n); Pd: 0.000 (s: 1.00), Pa: 0.000 (s: 1.00) Exon 3 51508 51427 ( 82 n); cDNA 403 484 ( 82 n); score: 1.000 Intron 3 51426 50597 ( 830 n); Pd: 0.850 (s: 1.00), Pa: 0.967 (s: 1.00) Exon 4 50596 50454 ( 143 n); cDNA 485 627 ( 143 n); score: 1.000 Intron 4 50453 50367 ( 87 n); Pd: 0.122 (s: 1.00), Pa: 0.998 (s: 1.00) Exon 5 50366 50132 ( 235 n); cDNA 628 862 ( 235 n); score: 0.996 MATCH 1954AAGCTTCAAACGACAAGATCATGGGACTTCATGAATCTAACATTAAAAGCTGAGCGAAAT- gi+ 0.999 862 1.000 C PGS_1954AAGCTTCAAACGACAAGATCATGGGACTTCATGAATCTAACATTAAAAGCTGAGCGAAAT-_gi+ (52240 51991,51919 51768,51508 51427,50596 50454,50366 50132) Alignment (genomic DNA sequence = upper lines): AGCTCCATCC AATATATCGT CTTCTTCCTC CTCGAACCAA AAACATCAGT CTCTCCTAGT 52181 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCTCCATCC AATATATCGT CTTCTTCCTC CTCGAACCAA AAACATCAGT CTCTCCTAGT 60 CTCAGCGCTC GTGAAAATCT GAAACTCTGC GTTCGAATTT TCGAAGAACA TTAGAGATCT 52121 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCAGCGCTC GTGAAAATCT GAAACTCTGC GTTCGAATTT TCGAAGAACA TTAGAGATCT 120 GGTATTATCT GGATCCGATC TGTTCACCAT GGAGGAGAAG AAGACAGAAT CAACCAACAA 52061 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTATTATCT GGATCCGATC TGTTCACCAT GGAGGAGAAG AAGACAGAAT CAACCAACAA 180 GAATGTTAAG AAGGCCAATC TATTGGACCA CCACTCGATC AAGCACATCC TTGACGAATC 52001 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAATGTTAAG AAGGCCAATC TATTGGACCA CCACTCGATC AAGCACATCC TTGACGAATC 240 CGTCTCCGAT GTAAGAATTT TCTAAGATTT TCCTTGTTCG TTGCTCCATA TTTCTCTGAT 51941 |||||||||| CGTCTCCGAT .......... .......... .......... .......... .......... 250 ATGTGAATCT ATGATTCGTA GATCGTTACG AGCCGTGGCT ATAAGGAGGA TGTGAGATTG 51881 ||||||||| |||||||||| |||||||||| |||||||||| .......... .......... .ATCGTTACG AGCCGTGGCT ATAAGGAGGA TGTGAGATTG 289 AGCAACCTCA AACTGATTTT GGGAACGATT ATCATCGTTG TTGCTCTTGT TGCTCAATTC 51821 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCAACCTCA AACTGATTTT GGGAACGATT ATCATCGTTG TTGCTCTTGT TGCTCAATTC 349 TACAACAAGA AATTCCCTGA GAACAGGGAT TTTTTGATTG GATGCATCGC ATTATATCCT 51761 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| TACAACAAGA AATTCCCTGA GAACAGGGAT TTTTTGATTG GATGCATCGC ATT....... 402 TTTTTCTCGA ATTCAATTGA TACTTTCTGC TTTCTGCTTG ACTCTCATTT GGTTGGTTTC 51701 .......... .......... .......... .......... .......... .......... 402 CAGTGGCTAA GATGAAATTT AAGAAAATTG AGATTAGAGA ACTTGTAATG TGGCTCGAAT 51641 .......... .......... .......... .......... .......... .......... 402 TCTTTGTGCT TAGTTAGCAT TAGATTTTGG AAAACCTTCT TGTATCTAAA GTATGACTAT 51581 .......... .......... .......... .......... .......... .......... 402 AGAAGCTGAT CGATATATGT CTTGTATTTG GATAATTCTG AAGAAGATGT TTGGTCCTTG 51521 .......... .......... .......... .......... .......... .......... 402 ACTATATGAC ACGTATGTAG TGTTGAATGC GGTGTTGCAG CTGATTCTGT ACACTAAGGA 51461 |||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ..GTATGTAG TGTTGAATGC GGTGTTGCAG CTGATTCTGT ACACTAAGGA 450 GAAGAATGCG ATTTTGTTCA CCTATCCTCC TGAGGTTTGG TCTCTGACTA TTTAGAAAAG 51401 |||||||||| |||||||||| |||||||||| |||| GAAGAATGCG ATTTTGTTCA CCTATCCTCC TGAG...... .......... .......... 484 TTATTTCATT GAGAAGTTTC GTGGTTTGGG CTACATTCTT GTTTTGTCTA TGATGCTACA 51341 .......... .......... .......... .......... .......... .......... 484 ATTGTATTGT ATTTCTGTGA TTTAGGTGCA TAATGTACCT TTTTTCCTTG TTCACTTACT 51281 .......... .......... .......... .......... .......... .......... 484 TTTAACTGGA GCTATAGTTT GAATTCTGGT GTCAAACTTT TGCATTCCTT TTGTTTTGTA 51221 .......... .......... .......... .......... .......... .......... 484 TTTTGTTTCT GCTTGCCTGA CATGTTATGT AAAGTTTGCA ACTTTGCATA TCTTAAGGCA 51161 .......... .......... .......... .......... .......... .......... 484 TCTTTTTTTT TTTTTTTTTT TTTCATTTGA GCTCTAGGCA AAACTACATT TCCCTTTCAA 51101 .......... .......... .......... .......... .......... .......... 484 TGTATTGTTC AATTAGTTTT AGTCTTCAAA CTTTAGATTT TGAAAAATAG TTGAACAAAA 51041 .......... .......... .......... .......... .......... .......... 484 GTTTTTGATG TTTGCAATCG TTTCTGTCTT GTTTCTCGAA ACATAATTGT TCAACTAAGT 50981 .......... .......... .......... .......... .......... .......... 484 GGAGGATTCT GGACTGTAAC TTGGGTGGTT TATTTCTCAT TCTTACCAGC TTGAAGAAAT 50921 .......... .......... .......... .......... .......... .......... 484 TACTCCTCTC ATTTAGTTAG TATCCGAGAC GATGAGATTT TTTTTTCCAA ACATTTCATT 50861 .......... .......... .......... .......... .......... .......... 484 CTGTACTCGT TTAGTTAGCT CTAGGAAACA TAACAAAAAG AGCTCAGACA TTTTGTCTTT 50801 .......... .......... .......... .......... .......... .......... 484 CTGAAGCCAT GTTAAATTGA TGCAATAATT TGGTTAGTGG GATTGCTTCA TTTTGTTATG 50741 .......... .......... .......... .......... .......... .......... 484 TCAAGACTCA AGATCCTAAA GCTCTCTGGT TGATATATAT TTGCTCACTC TCATTTGTTT 50681 .......... .......... .......... .......... .......... .......... 484 TGCTACTGTT CAAACATGTA TGTTTAGCTC ATCATTTGGT GCTCCTCTTT GGAAAAATCT 50621 .......... .......... .......... .......... .......... .......... 484 TAAAATCTGA TGGTTGATAC GCAGGGATCA TTCACCAGCA CTGGTTTGGT GGTATCTTCA 50561 |||||| |||||||||| |||||||||| |||||||||| .......... .......... ....GGATCA TTCACCAGCA CTGGTTTGGT GGTATCTTCA 520 AAGTTGCCCA GATTCTCTGA TCAGTACACT CTCACCATCG ACAGTGCTGA TCCAAAATCA 50501 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGTTGCCCA GATTCTCTGA TCAGTACACT CTCACCATCG ACAGTGCTGA TCCAAAATCA 580 ATCTCAGCTG GGAAGTCAGT TCAGCTCACC AAAAGTGTCA CCCAATGGTC TCTCTCTCGG 50441 |||||||||| |||||||||| |||||||||| |||||||||| ||||||| ATCTCAGCTG GGAAGTCAGT TCAGCTCACC AAAAGTGTCA CCCAATG... .......... 627 CATCTTTCTT ACAATTGCTC GCTCTGTTTT CCTTACATGA GTCTCATTAG ATGGTTTCTC 50381 .......... .......... .......... .......... .......... .......... 627 TGTGTGTTTT TCAGGTTCAC GAAAGATGGA GTTCTTGTTG AGGGTTTATT CTGGAAAGAC 50321 |||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ....GTTCAC GAAAGATGGA GTTCTTGTTG AGGGTTTATT CTGGAAAGAC 673 GTAGAAGCAC TAATCAAGAA CTATGCAGAA GAAGAACCAA AGAAGAAGAA ATGATCATCT 50261 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTAGAAGCAC TAATCAAGAA CTATGCAGAA GAAGAACCAA AGAAGAAGAA ATGATCATCT 733 TCCAGAGAAA TATATGCTTT TGAAACTCCA TATGTTGGCG TTTTAGAATA CTAAATCGAT 50201 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCAGAGAAA TATATGCTTT TGAAACTCCA TATGTTGGCG TTTTAGAATA CTAAATCGAT 793 GGTGACTTTG TTTAGAGATG AAACTAATAT TTGGATTTTT CCTTATAGCC AACTTTTTGA 50141 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTGACTTTG TTTAGAGATG AAACTAATAT TTGGATTTTT CCTTATAGCC AACTTTTTGA 853 ATTAATCCG 50132 ||||||| | ATTAATCTG 862 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (- strand): 52240 50132 AGS-1 (52240 51991,51919 51768,51508 51427,50596 50454,50366 50132) SCR (e 1.000 d 0.995 a 0.989,e 1.000 d 0.000 a 0.000,e 1.000 d 0.850 a 0.967,e 1.000 d 0.122 a 0.998,e 0.996) Exon 1 52240 51991 ( 250 n); score: 1.000 Intron 1 51990 51920 ( 71 n); Pd: 0.995 Pa: 0.989 Exon 2 51919 51768 ( 152 n); score: 1.000 Intron 2 51767 51509 ( 259 n); Pd: 0.000 Pa: 0.000 Exon 3 51508 51427 ( 82 n); score: 1.000 Intron 3 51426 50597 ( 830 n); Pd: 0.850 Pa: 0.967 Exon 4 50596 50454 ( 143 n); score: 1.000 Intron 4 50453 50367 ( 87 n); Pd: 0.122 Pa: 0.998 Exon 5 50366 50132 ( 235 n); score: 0.996 PGS (52240 51991,51919 51768,51508 51427,50596 50454,50366 50132) gi+ 3-phase translation of AGS-1 (-strand): . . . . . . 52240 AGCTCCATCCAATATATCGTCTTCTTCCTCCTCGAACCAAAAACATCAGTCTCTCCTAGT S S I Q Y I V F F L L E P K T S V S P S A P S N I S S S S S S N Q K H Q S L L V L H P I Y R L L P P R T K N I S L S - . . . . . . 52180 CTCAGCGCTCGTGAAAATCTGAAACTCTGCGTTCGAATTTTCGAAGAACATTAGAGATCT L S A R E N L K L C V R I F E E H - R S S A L V K I - N S A F E F S K N I R D L S Q R S - K S E T L R S N F R R T L E I . . . . . . 52120 GGTATTATCTGGATCCGATCTGTTCACCATGGAGGAGAAGAAGACAGAATCAACCAACAA G I I W I R S V H H G G E E D R I N Q Q V L S G S D L F T M E E K K T E S T N K W Y Y L D P I C S P W R R R R Q N Q P T . . . . . . 52060 GAATGTTAAGAAGGCCAATCTATTGGACCACCACTCGATCAAGCACATCCTTGACGAATC E C - E G Q S I G P P L D Q A H P - R I N V K K A N L L D H H S I K H I L D E S R M L R R P I Y W T T T R S S T S L T N . : . . . . . 52000 CGTCTCCGAT : ATCGTTACGAGCCGTGGCTATAAGGAGGATGTGAGATTGAGCAACCTCAA R L R : Y R Y E P W L - G G C E I E Q P Q V S D : I V T S R G Y K E D V R L S N L K P S P I : S L R A V A I R R M - D - A T S . . . . . . 51869 ACTGATTTTGGGAACGATTATCATCGTTGTTGCTCTTGTTGCTCAATTCTACAACAAGAA T D F G N D Y H R C C S C C S I L Q Q E L I L G T I I I V V A L V A Q F Y N K K N - F W E R L S S L L L L L L N S T T R . . . . . : . 51809 ATTCCCTGAGAACAGGGATTTTTTGATTGGATGCATCGCATT : GTATGTAGTGTTGAATGC I P - E Q G F F D W M H R I : V C S V E C F P E N R D F L I G C I A L : Y V V L N A N S L R T G I F - L D A S H : C M - C - M . . . . . . 51490 GGTGTTGCAGCTGATTCTGTACACTAAGGAGAAGAATGCGATTTTGTTCACCTATCCTCC G V A A D S V H - G E E C D F V H L S S V L Q L I L Y T K E K N A I L F T Y P P R C C S - F C T L R R R M R F C S P I L . : . . . . . 51430 TGAG : GGATCATTCACCAGCACTGGTTTGGTGGTATCTTCAAAGTTGCCCAGATTCTCTGA - : G I I H Q H W F G G I F K V A Q I L - E : G S F T S T G L V V S S K L P R F S D L R : D H S P A L V W W Y L Q S C P D S L . . . . . . 50540 TCAGTACACTCTCACCATCGACAGTGCTGATCCAAAATCAATCTCAGCTGGGAAGTCAGT S V H S H H R Q C - S K I N L S W E V S Q Y T L T I D S A D P K S I S A G K S V I S T L S P S T V L I Q N Q S Q L G S Q . . . : . . . 50480 TCAGCTCACCAAAAGTGTCACCCAATG : GTTCACGAAAGATGGAGTTCTTGTTGAGGGTTT S A H Q K C H P M : V H E R W S S C - G F Q L T K S V T Q W : F T K D G V L V E G L F S S P K V S P N : G S R K M E F L L R V . . . . . . 50333 ATTCTGGAAAGACGTAGAAGCACTAATCAAGAACTATGCAGAAGAAGAACCAAAGAAGAA I L E R R R S T N Q E L C R R R T K E E F W K D V E A L I K N Y A E E E P K K K Y S G K T - K H - S R T M Q K K N Q R R . . . . . . 50273 GAAATGATCATCTTCCAGAGAAATATATGCTTTTGAAACTCCATATGTTGGCGTTTTAGA E M I I F Q R N I C F - N S I C W R F R K - S S S R E I Y A F E T P Y V G V L E R N D H L P E K Y M L L K L H M L A F - . . . . . . 50213 ATACTAAATCGATGGTGACTTTGTTTAGAGATGAAACTAATATTTGGATTTTTCCTTATA I L N R W - L C L E M K L I F G F F L I Y - I D G D F V - R - N - Y L D F S L - N T K S M V T L F R D E T N I W I F P Y . . . 50153 GCCAACTTTTTGAATTAATCCG A N F L N - S P T F - I N P S Q L F E L I Maximal non-overlapping open reading frames (>= 64 codons): >1954AAGCTTCAAACGACAAGATCATGGGACTTCATGAATCTAACATTAAAAGCTGAGCGAAAT-_PGL-1_AGS-1_PPS_1 (52158 51991,51919 51768,51508 51427,50596 50454,50366 50267) (frame '2'; 642 bp, 214 residues) 1 NSAFEFSKNI RDLVLSGSDL FTMEEKKTES TNKNVKKANL LDHHSIKHIL DESVSDIVTS 61 RGYKEDVRLS NLKLILGTII IVVALVAQFY NKKFPENRDF LIGCIALYVV LNAVLQLILY 121 TKEKNAILFT YPPEGSFTST GLVVSSKLPR FSDQYTLTID SADPKSISAG KSVQLTKSVT 181 QWFTKDGVLV EGLFWKDVEA LIKNYAEEEP KKKK-