GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:37:04 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 562 Minimum sequence length: 562 Maximum sequence length: 562 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 0 < 600: 1 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 39130AAGCTTTACAAACTTTGCATATTTCTCTTCCCGCATCAGCCTCCCAAAATAACATGCAAT, from 1 to 85154, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 1 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 TCATTCTCAC TAACAGAGAG AAAAAGCAAA AGCAAAAATG GTTCGATCTT TCGCTATTGC 61 TGTGATCTGT ATCGTCCTCA TCGCCGGCGT TACTGGTCAA GCTCCGACTT CACCACCAAC 121 CGCTACACCA GCTCCACCAA CTCCAACAAC TCCTCCACCA GCAGCAACAA CATCTCCTCC 181 TCCAGTCACC ACTGCTCCTC CTCCAGCAAA TCCACCACCA CCAGTCTCTT CTCCTCCTCC 241 TGCTTCTCCT CCACCAGCTA CTCCTCCTCC AGTCGCTTCT CCTCCTCCTC CCGTTGCTTC 301 TCCTCCACCA GCAACTCCTC CTCCCGTCGC TACTCCTCCC CCAGCTCCTC TAGCTTCCCC 361 TCCTGCTCAA GTTCCAGCTC CAGCACCGAC CACGAAGCCA GATTCTCCAT CTCCATCTCC 421 GTCATCTAGC CCACCTCTTC CATCAAGCGA CGCTCCTGGA CCTAGCACCG ATTCTATCTC 481 TCCTGCTCCT AGCCCCACTG ATGTGAACGA CCAGAATGGA GCTAGCAAGA TGGTTTCGAG 541 CTTAGTATTT GGATCTGTTC TC Predicted gene structure (within gDNA segment 19774 to 21474): Exon 1 20074 20239 ( 166 n); cDNA 1 166 ( 166 n); score: 0.994 Intron 1 20240 20272 ( 33 n); Pd: 0.000 (s: 1.00), Pa: 0.000 (s: 1.00) Exon 2 20273 20620 ( 348 n); cDNA 167 514 ( 348 n); score: 1.000 Intron 2 20621 21121 ( 501 n); Pd: 1.000 (s: 1.00), Pa: 0.973 (s: 1.00) Exon 3 21122 21169 ( 48 n); cDNA 515 562 ( 48 n); score: 1.000 MATCH 39130AAGCTTTACAAACTTTGCATATTTCTCTTCCCGCATCAGCCTCCCAAAATAACATGCAAT+ gi+ 0.998 514 0.915 C PGS_39130AAGCTTTACAAACTTTGCATATTTCTCTTCCCGCATCAGCCTCCCAAAATAACATGCAAT+_gi+ (20074 20239,20273 20620,21122 21169) Alignment (genomic DNA sequence = upper lines): TCATTCTCAC TAACAGAGAG AAAAAGCAAA AGCAAAAATG GCTCGATCTT TCGCTATTGC 20133 |||||||||| |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| TCATTCTCAC TAACAGAGAG AAAAAGCAAA AGCAAAAATG GTTCGATCTT TCGCTATTGC 60 TGTGATCTGT ATCGTCCTCA TCGCCGGCGT TACTGGTCAA GCTCCGACTT CACCACCAAC 20193 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTGATCTGT ATCGTCCTCA TCGCCGGCGT TACTGGTCAA GCTCCGACTT CACCACCAAC 120 CGCTACACCA GCTCCACCAA CTCCAACAAC TCCTCCACCA GCAGCAACTC CTCCTCCTGT 20253 |||||||||| |||||||||| |||||||||| |||||||||| |||||| CGCTACACCA GCTCCACCAA CTCCAACAAC TCCTCCACCA GCAGCA.... .......... 166 CTCAGCACCA CCACCAGTCA CAACATCTCC TCCTCCAGTC ACCACTGCTC CTCCTCCAGC 20313 | |||||||||| |||||||||| |||||||||| |||||||||| .......... .........A CAACATCTCC TCCTCCAGTC ACCACTGCTC CTCCTCCAGC 207 AAATCCACCA CCACCAGTCT CTTCTCCTCC TCCTGCTTCT CCTCCACCAG CTACTCCTCC 20373 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAATCCACCA CCACCAGTCT CTTCTCCTCC TCCTGCTTCT CCTCCACCAG CTACTCCTCC 267 TCCAGTCGCT TCTCCTCCTC CTCCCGTTGC TTCTCCTCCA CCAGCAACTC CTCCTCCCGT 20433 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCAGTCGCT TCTCCTCCTC CTCCCGTTGC TTCTCCTCCA CCAGCAACTC CTCCTCCCGT 327 CGCTACTCCT CCCCCAGCTC CTCTAGCTTC CCCTCCTGCT CAAGTTCCAG CTCCAGCACC 20493 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGCTACTCCT CCCCCAGCTC CTCTAGCTTC CCCTCCTGCT CAAGTTCCAG CTCCAGCACC 387 GACCACGAAG CCAGATTCTC CATCTCCATC TCCGTCATCT AGCCCACCTC TTCCATCAAG 20553 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACCACGAAG CCAGATTCTC CATCTCCATC TCCGTCATCT AGCCCACCTC TTCCATCAAG 447 CGACGCTCCT GGACCTAGCA CCGATTCTAT CTCTCCTGCT CCTAGCCCCA CTGATGTGAA 20613 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGACGCTCCT GGACCTAGCA CCGATTCTAT CTCTCCTGCT CCTAGCCCCA CTGATGTGAA 507 CGACCAGGTT AGTAACCTTT TCTTCTAGAT CTTGTACAAT GTGTTTCAGA TCTGAGATGT 20673 ||||||| CGACCAG... .......... .......... .......... .......... .......... 514 TTTGTCTGAA CTTGAATAGA TCTAAAACGG TATGATTTAG AAAAAAGTGA AAAACATAAT 20733 .......... .......... .......... .......... .......... .......... 514 TAGCGCAATT TAGCAAGAAA ATGCTAGTCT TTGCTAACGT GCAACCGTAT CGTGATTTGA 20793 .......... .......... .......... .......... .......... .......... 514 TGATTTTGGC TTGTCTGCCT CGAGATCTTC GAATCCTAGC CGTGAGCGCG TGAGTGCTGT 20853 .......... .......... .......... .......... .......... .......... 514 AGTCACGTGA CTTTGTCAGT CTGCACGTGA ATAATTTAAT ACTGCCGCGT GGCAATATTC 20913 .......... .......... .......... .......... .......... .......... 514 TTTCGGACCA AAGTAGATCT AGTTTAGTTT AGTAATAAAA AAACAGTCCA AAGCGTGATG 20973 .......... .......... .......... .......... .......... .......... 514 GACGGATTCT CTAAGATCTG CTGTCCAAAT TATTTAACTT TCCACCCCTA AATTTTAGGA 21033 .......... .......... .......... .......... .......... .......... 514 TCTTTTCAAA TGTAACCCCG GATTAAGTAA TTGAACTGTG AAGTCCCTTG GCTATTGAAA 21093 .......... .......... .......... .......... .......... .......... 514 TGCTGATTGG AAACCTTTTG TTTTGCAGAA TGGAGCTAGC AAGATGGTTT CGAGCTTAGT 21153 || |||||||||| |||||||||| |||||||||| .......... .......... ........AA TGGAGCTAGC AAGATGGTTT CGAGCTTAGT 546 ATTTGGATCT GTTCTC 21169 |||||||||| |||||| ATTTGGATCT GTTCTC 562 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (+ strand): 20074 21169 AGS-1 (20074 20239,20273 20620,21122 21169) SCR (e 0.994 d 0.000 a 0.000,e 1.000 d 1.000 a 0.973,e 1.000) Exon 1 20074 20239 ( 166 n); score: 0.994 Intron 1 20240 20272 ( 33 n); Pd: 0.000 Pa: 0.000 Exon 2 20273 20620 ( 348 n); score: 1.000 Intron 2 20621 21121 ( 501 n); Pd: 1.000 Pa: 0.973 Exon 3 21122 21169 ( 48 n); score: 1.000 PGS (20074 20239,20273 20620,21122 21169) gi+ 3-phase translation of AGS-1 (+strand): . . . . . . 20074 TCATTCTCACTAACAGAGAGAAAAAGCAAAAGCAAAAATGGCTCGATCTTTCGCTATTGC S F S L T E R K S K S K N G S I F R Y C H S H - Q R E K A K A K M A R S F A I A I L T N R E K K Q K Q K W L D L S L L . . . . . . 20134 TGTGATCTGTATCGTCCTCATCGCCGGCGTTACTGGTCAAGCTCCGACTTCACCACCAAC C D L Y R P H R R R Y W S S S D F T T N V I C I V L I A G V T G Q A P T S P P T L - S V S S S S P A L L V K L R L H H Q . . . . . : . 20194 CGCTACACCAGCTCCACCAACTCCAACAACTCCTCCACCAGCAGCA : ACAACATCTCCTCC R Y T S S T N S N N S S T S S : N N I S S A T P A P P T P T T P P P A A : T T S P P P L H Q L H Q L Q Q L L H Q Q Q : Q H L L . . . . . . 20287 TCCAGTCACCACTGCTCCTCCTCCAGCAAATCCACCACCACCAGTCTCTTCTCCTCCTCC S S H H C S S S S K S T T T S L F S S S P V T T A P P P A N P P P P V S S P P P L Q S P L L L L Q Q I H H H Q S L L L L . . . . . . 20347 TGCTTCTCCTCCACCAGCTACTCCTCCTCCAGTCGCTTCTCCTCCTCCTCCCGTTGCTTC C F S S T S Y S S S S R F S S S S R C F A S P P P A T P P P V A S P P P P V A S L L L L H Q L L L L Q S L L L L L P L L . . . . . . 20407 TCCTCCACCAGCAACTCCTCCTCCCGTCGCTACTCCTCCCCCAGCTCCTCTAGCTTCCCC S S T S N S S S R R Y S S P S S S S F P P P P A T P P P V A T P P P A P L A S P L L H Q Q L L L P S L L L P Q L L - L P . . . . . . 20467 TCCTGCTCAAGTTCCAGCTCCAGCACCGACCACGAAGCCAGATTCTCCATCTCCATCTCC S C S S S S S S T D H E A R F S I S I S P A Q V P A P A P T T K P D S P S P S P L L L K F Q L Q H R P R S Q I L H L H L . . . . . . 20527 GTCATCTAGCCCACCTCTTCCATCAAGCGACGCTCCTGGACCTAGCACCGATTCTATCTC V I - P T S S I K R R S W T - H R F Y L S S S P P L P S S D A P G P S T D S I S R H L A H L F H Q A T L L D L A P I L S . . . . : . . 20587 TCCTGCTCCTAGCCCCACTGATGTGAACGACCAG : AATGGAGCTAGCAAGATGGTTTCGAG S C S - P H - C E R P : E W S - Q D G F E P A P S P T D V N D Q : N G A S K M V S S L L L L A P L M - T T R : M E L A R W F R . . . 21148 CTTAGTATTTGGATCTGTTCTC L S I W I C S L V F G S V L A - Y L D L F Maximal non-overlapping open reading frames (>= 64 codons): >39130AAGCTTTACAAACTTTGCATATTTCTCTTCCCGCATCAGCCTCCCAAAATAACATGCAAT+_PGL-1_AGS-1_PPS_1 (20087 20239,20273 20620,21122 21169) (frame '2'; 549 bp, 183 residues) 1 QREKAKAKMA RSFAIAVICI VLIAGVTGQA PTSPPTATPA PPTPTTPPPA ATTSPPPVTT 61 APPPANPPPP VSSPPPASPP PATPPPVASP PPPVASPPPA TPPPVATPPP APLASPPAQV 121 PAPAPTTKPD SPSPSPSSSP PLPSSDAPGP STDSISPAPS PTDVNDQNGA SKMVSSLVFG 181 SVL