GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:37:28 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 485 Minimum sequence length: 485 Maximum sequence length: 485 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 1 < 600: 0 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 42596GAATTCCTGGCAAACGCCCAACAACAGTGAATCCAGCTTATGAAGAGTTCTTCCTAACCA, from 1 to 118136, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 1 Matches indexed, elapsed seconds = 1 HitsTableSize = 1 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 GAAACAAACT CTAAAACCCT AAATTCTTCT TCTCAAGCAG CCACATCTTC CTCTTTCACT 61 AGCTATGGAG TTTTGGGGTA TCGAGATTAA GCCAGGGAAG CCATTTAAGG TGATACAAAA 121 AGATGGATTC ATGGTCCATG CCTCTCAGGT TACCCTTGGT GACGTTGAGA AGGTTAAAAA 181 AGATGAGACT TTTGCCGTTT ATGTGAAGAT TGGTGATGAT GAGAATGGGT TTATGATTGG 241 AAATCTCTCA CAGAAGTTTC CTCAATTTTC TATTGATCTC TACTTAGGGC ACGAGTTTGA 301 GATTTCTCAC AACAGTACAA GCAGTGTCTA TCTTATTGGT TACAGGACCT TTGATGCTTT 361 TGACGAACTG GATGAGGAGA TTGATTCTGA TTCTGAGTTA NATGAATATA TGGAACAACA 421 AATTGCTGCT TTGCCTCAAA ATGANATCAA TCCTGAAGAA GATGATGAAT CCGACTCANA 481 TGAGA Predicted gene structure (within gDNA segment 17896 to 15954): Exon 1 17596 17520 ( 77 n); cDNA 1 77 ( 77 n); score: 1.000 Intron 1 17519 17076 ( 444 n); Pd: 0.997 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 2 17075 17005 ( 71 n); cDNA 78 148 ( 71 n); score: 1.000 Intron 2 17004 16913 ( 92 n); Pd: 0.995 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 3 16912 16692 ( 221 n); cDNA 149 369 ( 221 n); score: 1.000 Intron 3 16691 16580 ( 112 n); Pd: 0.000 (s: 1.00), Pa: 0.000 (s: n/a) Exon 4 16579 16560 ( 20 n); cDNA 370 389 ( 20 n); score: 1.000 Intron 4 16559 16476 ( 84 n); Pd: 0.996 (s: n/a), Pa: 0.980 (s: n/a) Exon 5 16475 16440 ( 36 n); cDNA 390 425 ( 36 n); score: 0.972 Intron 5 16439 16347 ( 93 n); Pd: 0.950 (s: n/a), Pa: 0.996 (s: 0.98) Exon 6 16346 16288 ( 59 n); cDNA 426 484 ( 59 n); score: 0.966 MATCH 42596GAATTCCTGGCAAACGCCCAACAACAGTGAATCCAGCTTATGAAGAGTTCTTCCTAACCA- gi+ 0.995 428 0.882 C PGS_42596GAATTCCTGGCAAACGCCCAACAACAGTGAATCCAGCTTATGAAGAGTTCTTCCTAACCA-_gi+ (17596 17520,17075 17005,16912 16692,16579 16560,16475 16440,16346 16288) Alignment (genomic DNA sequence = upper lines): GAAACAAACT CTAAAACCCT AAATTCTTCT TCTCAAGCAG CCACATCTTC CTCTTTCACT 17537 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAACAAACT CTAAAACCCT AAATTCTTCT TCTCAAGCAG CCACATCTTC CTCTTTCACT 60 AGCTATGGAG TTTTGGGGTA TGTCTCTCTC AAAATCATAT CTTTATTAAG TTCTACTGAA 17477 |||||||||| ||||||| AGCTATGGAG TTTTGGG... .......... .......... .......... .......... 77 TACCTCAATG ACTCCTATTG ACTGTTTTCG CCATTAAAGT TAATCATTTT GGTATTCCTA 17417 .......... .......... .......... .......... .......... .......... 77 AGTTCGTTCT GTGATTAAAA ATCTGGTCTT TATACAAAAG CTTGCCTTTC TTCTGTATTG 17357 .......... .......... .......... .......... .......... .......... 77 AATTCTCTCT ATTACTCAAT ACGCATTTTC TTGTGTTCAA AACTATACAA TTGTTTGGGA 17297 .......... .......... .......... .......... .......... .......... 77 ATATTAGCAT TCTTAGTGTT TAGGTTTCAA TGCATCTATC TTTGTGAGGT TATGTTTCAA 17237 .......... .......... .......... .......... .......... .......... 77 AGTATCTATC TTTAAGAGTT TATGTTTCAA AGCATCTATC TTTGTGAGTT TAGGTGTCAA 17177 .......... .......... .......... .......... .......... .......... 77 AGTATCTATA TCTTTGTGAG TTTAGGTGTC AAAGCATCTA TCTTTGTGAG ATTTTGTATG 17117 .......... .......... .......... .......... .......... .......... 77 AATTTGTACT CAAATGTGTG ATGAATAATG TTCAATTTTA GGTATCGAGA TTAAGCCAGG 17057 ||||||||| |||||||||| .......... .......... .......... .......... .GTATCGAGA TTAAGCCAGG 96 GAAGCCATTT AAGGTGATAC AAAAAGATGG ATTCATGGTC CATGCCTCTC AGGTCTGTAT 16997 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| || GAAGCCATTT AAGGTGATAC AAAAAGATGG ATTCATGGTC CATGCCTCTC AG........ 148 CAGTTTTGTA ATTCCTGCAA GTTTCTCTCT TTGTGTAAAT CCATGATTGA TATTGATTGT 16937 .......... .......... .......... .......... .......... .......... 148 GTTACTTGTT GTAACATCTC TTAGGTTACC CTTGGTGACG TTGAGAAGGT TAAAAAAGAT 16877 |||||| |||||||||| |||||||||| |||||||||| .......... .......... ....GTTACC CTTGGTGACG TTGAGAAGGT TAAAAAAGAT 184 GAGACTTTTG CCGTTTATGT GAAGATTGGT GATGATGAGA ATGGGTTTAT GATTGGAAAT 16817 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAGACTTTTG CCGTTTATGT GAAGATTGGT GATGATGAGA ATGGGTTTAT GATTGGAAAT 244 CTCTCACAGA AGTTTCCTCA ATTTTCTATT GATCTCTACT TAGGGCACGA GTTTGAGATT 16757 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCTCACAGA AGTTTCCTCA ATTTTCTATT GATCTCTACT TAGGGCACGA GTTTGAGATT 304 TCTCACAACA GTACAAGCAG TGTCTATCTT ATTGGTTACA GGACCTTTGA TGCTTTTGAC 16697 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTCACAACA GTACAAGCAG TGTCTATCTT ATTGGTTACA GGACCTTTGA TGCTTTTGAC 364 GAACTATATC CTTTCTTTCA TTGTCTTTTT CTAGTTAAAT TCATACTCAC TTGCAATTTT 16637 ||||| GAACT..... .......... .......... .......... .......... .......... 369 GAAGCTTTGT TTGTTTATCC TTGATTTTGT TTTGCTCTGC TTTTTTCTGA TGCTCACGGA 16577 ||| .......... .......... .......... .......... .......... .......GGA 372 TGAGGAGATT GATTCTGGTG AGTTTATAAT TCTTTAGTTG ATGATTTGGT TTACTCTATT 16517 |||||||||| ||||||| TGAGGAGATT GATTCTG... .......... .......... .......... .......... 389 ATGAAGCTTG AAGCTAATCT TTTGCTTTTT GGTTTGTGTA GATTCTGAGT TAGATGAATA 16457 ||||||||| || ||||||| .......... .......... .......... .......... .ATTCTGAGT TANATGAATA 408 TATGGAACAA CAAATTGGTA TGTTTACTTT TTTTACTAAG CTACTGCTTT GTGGTAGTCA 16397 |||||||||| ||||||| TATGGAACAA CAAATTG... .......... .......... .......... .......... 425 TATTTACATG TGTATCTCTT ATGAAATCTA ACTTTGACAT TGTTGAAAAG CTGCTTTGCC 16337 |||||||||| .......... .......... .......... .......... .......... CTGCTTTGCC 435 TCAAAATGAG ATCAATCCTG AAGAAGATGA TGAATCCGAC TCAGATGAG 16288 ||||||||| |||||||||| |||||||||| |||||||||| ||| ||||| TCAAAATGAN ATCAATCCTG AAGAAGATGA TGAATCCGAC TCANATGAG 484 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (- strand): 17596 16288 AGS-1 (17596 17520,17075 17005,16912 16692,16579 16560,16475 16440,16346 16288) SCR (e 1.000 d 0.997 a 0.997,e 1.000 d 0.995 a 0.997,e 1.000 d 0.000 a 0.000,e 1.000 d 0.996 a 0.980,e 0.972 d 0.950 a 0.996,e 0.966) Exon 1 17596 17520 ( 77 n); score: 1.000 Intron 1 17519 17076 ( 444 n); Pd: 0.997 Pa: 0.997 Exon 2 17075 17005 ( 71 n); score: 1.000 Intron 2 17004 16913 ( 92 n); Pd: 0.995 Pa: 0.997 Exon 3 16912 16692 ( 221 n); score: 1.000 Intron 3 16691 16580 ( 112 n); Pd: 0.000 Pa: 0.000 Exon 4 16579 16560 ( 20 n); score: 1.000 Intron 4 16559 16476 ( 84 n); Pd: 0.996 Pa: 0.980 Exon 5 16475 16440 ( 36 n); score: 0.972 Intron 5 16439 16347 ( 93 n); Pd: 0.950 Pa: 0.996 Exon 6 16346 16288 ( 59 n); score: 0.966 PGS (17596 17520,17075 17005,16912 16692,16579 16560,16475 16440,16346 16288) gi+ 3-phase translation of AGS-1 (-strand): . . . . . . 17596 GAAACAAACTCTAAAACCCTAAATTCTTCTTCTCAAGCAGCCACATCTTCCTCTTTCACT E T N S K T L N S S S Q A A T S S S F T K Q T L K P - I L L L K Q P H L P L S L N K L - N P K F F F S S S H I F L F H . . : . . . . 17536 AGCTATGGAGTTTTGGG : GTATCGAGATTAAGCCAGGGAAGCCATTTAAGGTGATACAAAA S Y G V L G : Y R D - A R E A I - G D T K A M E F W : G I E I K P G K P F K V I Q K - L W S F G : V S R L S Q G S H L R - Y K . . . : . . . 17032 AGATGGATTCATGGTCCATGCCTCTCAG : GTTACCCTTGGTGACGTTGAGAAGGTTAAAAA R W I H G P C L S : G Y P W - R - E G - K D G F M V H A S Q : V T L G D V E K V K K K M D S W S M P L R : L P L V T L R R L K . . . . . . 16880 AGATGAGACTTTTGCCGTTTATGTGAAGATTGGTGATGATGAGAATGGGTTTATGATTGG R - D F C R L C E D W - - - E W V Y D W D E T F A V Y V K I G D D E N G F M I G K M R L L P F M - R L V M M R M G L - L . . . . . . 16820 AAATCTCTCACAGAAGTTTCCTCAATTTTCTATTGATCTCTACTTAGGGCACGAGTTTGA K S L T E V S S I F Y - S L L R A R V - N L S Q K F P Q F S I D L Y L G H E F E E I S H R S F L N F L L I S T - G T S L . . . . . . 16760 GATTTCTCACAACAGTACAAGCAGTGTCTATCTTATTGGTTACAGGACCTTTGATGCTTT D F S Q Q Y K Q C L S Y W L Q D L - C F I S H N S T S S V Y L I G Y R T F D A F R F L T T V Q A V S I L L V T G P L M L . : . . : . . . 16700 TGACGAACT : GGATGAGGAGATTGATTCTG : ATTCTGAGTTAGATGAATATATGGAACAACA - R T : G - G D - F - : F - V R - I Y G T T D E L : D E E I D S : D S E L D E Y M E Q Q L T N : W M R R L I L : I L S - M N I W N N . : . . . . . 16444 AATTG : CTGCTTTGCCTCAAAATGAGATCAATCCTGAAGAAGATGATGAATCCGACTCAGA N C : C F A S K - D Q S - R R - - I R L R I : A A L P Q N E I N P E E D D E S D S D K L : L L C L K M R S I L K K M M N P T Q . 16291 TGAG - E M Maximal non-overlapping open reading frames (>= 64 codons): >42596GAATTCCTGGCAAACGCCCAACAACAGTGAATCCAGCTTATGAAGAGTTCTTCCTAACCA-_PGL-1_AGS-1_PPS_1 (17574 17520,17075 17005,16912 16692,16579 16560,16475 16440,16346 16288) (frame '2'; 462 bp, 154 residues) 1 ILLLKQPHLP LSLAMEFWGI EIKPGKPFKV IQKDGFMVHA SQVTLGDVEK VKKDETFAVY 61 VKIGDDENGF MIGNLSQKFP QFSIDLYLGH EFEISHNSTS SVYLIGYRTF DAFDELDEEI 121 DSDSELDEYM EQQIAALPQN EINPEEDDES DSDE