GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:38:23 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 527 Minimum sequence length: 527 Maximum sequence length: 527 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 0 < 600: 1 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 49202GAATTCCGCAATGCAGGTTAAGAGCTCTGTGAAAGAGGAAAACGAAAAACGCAGAAGGTG, from 1 to 88710, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 1 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 TCCCCGGGCT GCAGGATCGG CACGAGAGAC AGTAGTAATG GAAATGGTGG TACTTCAGAG 61 AATTTGGAAT GTTGTTCTAC TCAGACATCC TATGGAGGCA TCTGAGGGCA CACAGAATGA 121 ACAAGTGGAT GATTCAAAAC AGATGAGAGG ACAAAAGGTG CAAGGAAGGG TCAAGCATGA 181 GAAGACCTCT GGAGGCAAGA ATATTCCATC AGTCTTGGTG AAGAAGAAAA AAGATGGGAA 241 AGTGGTGGCA TCAAATGGTT CTGTTGCTCC AAACGTAAAG CCTGTAAAGT CTCCTAAGAG 301 CAGCTGAAGG GACCAGGTAC AAGCCAAAAC TGAGGGAAAC AAGGAAACAA GTCAATGATA 361 CATCTGAAGA TGATACACAG TATAGTCCAG GAGAAGATGA TGGCGAACCT CGTCGAGCTA 421 GTGCACTTCC AAATTATGGA TTCAGTTTTA GATGTGACCA ACGAGCTGAA AAAAGAAGAG 481 AGTTCTATTC AAAGCTTGAG GAAAAGATCC ATGCGAAAGA AGAAGAA Predicted gene structure (within gDNA segment 42253 to 44038): Exon 1 42678 42757 ( 80 n); cDNA 26 106 ( 81 n); score: 0.975 Intron 1 42758 42848 ( 91 n); Pd: 0.922 (s: 0.96), Pa: 0.998 (s: 1.00) Exon 2 42849 43045 ( 197 n); cDNA 107 303 ( 197 n); score: 0.995 Intron 2 43046 43201 ( 156 n); Pd: 0.000 (s: 0.98), Pa: 0.001 (s: n/a) Exon 3 43202 43214 ( 13 n); cDNA 304 316 ( 13 n); score: 1.000 Intron 3 43215 43360 ( 146 n); Pd: 0.998 (s: n/a), Pa: 0.998 (s: 0.98) Exon 4 43361 43426 ( 66 n); cDNA 317 382 ( 66 n); score: 0.985 Intron 4 43427 43503 ( 77 n); Pd: 0.927 (s: 1.00), Pa: 0.935 (s: 0.94) Exon 5 43504 43603 ( 100 n); cDNA 383 482 ( 100 n); score: 0.970 Intron 5 43604 43680 ( 77 n); Pd: 0.514 (s: 1.00), Pa: 0.952 (s: 1.00) Exon 6 43681 43725 ( 45 n); cDNA 483 527 ( 45 n); score: 1.000 MATCH 49202GAATTCCGCAATGCAGGTTAAGAGCTCTGTGAAAGAGGAAAACGAAAAACGCAGAAGGTG+ gi+ 0.984 443 0.841 C PGS_49202GAATTCCGCAATGCAGGTTAAGAGCTCTGTGAAAGAGGAAAACGAAAAACGCAGAAGGTG+_gi+ (42678 42757,42849 43045,43202 43214,43361 43426,43504 43603,43681 43725) Alignment (genomic DNA sequence = upper lines): GAGACAGTAG TAATGGAAAT GGTGGTACTT CAGAGAATTT GGAATGTTGT TCTACTCAG- 42736 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| GAGACAGTAG TAATGGAAAT GGTGGTACTT CAGAGAATTT GGAATGTTGT TCTACTCAGA 85 CATCCTATGG AGGCATCTGA GGTTGGTGAC AAAGAATTTT AACGTTTTTT TTTTTTTGCC 42796 |||||||||| |||||||||| | CATCCTATGG AGGCATCTGA G......... .......... .......... .......... 106 TTTTCTGCAC AAATAATTGT TATTGACTTT GCAGTTTTTT CGCTTTTTCC AGGGCACACA 42856 |||||||| .......... .......... .......... .......... .......... ..GGCACACA 114 GAATGAACAA GTGGATGATT CAAAACAGAT GAGAGGACAA AAGGTGCAAG GAAGGGTCAA 42916 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAATGAACAA GTGGATGATT CAAAACAGAT GAGAGGACAA AAGGTGCAAG GAAGGGTCAA 174 GCATGAGAAG ACCTCTGGAG GCAAGAATAT TCCATCAGTC TTGGTGAAGA AGAAAAAAGA 42976 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCATGAGAAG ACCTCTGGAG GCAAGAATAT TCCATCAGTC TTGGTGAAGA AGAAAAAAGA 234 TGGGAAAGTG GTGGCATCAA ATGGTTCTGT TGCTCCAAAC GTAAAGCCTG TAAAGTCTCC 43036 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGGAAAGTG GTGGCATCAA ATGGTTCTGT TGCTCCAAAC GTAAAGCCTG TAAAGTCTCC 294 TAAGAGCAAA TCGCTCAATG GTAGAGAGGC TCATGTCACG AAGGTTACTA ATCTCTTTGT 43096 |||||||| TAAGAGCAG. .......... .......... .......... .......... .......... 303 TGCTCCTATA GTTACTGGTG ATTAGCCATG TGTCTTCTTG GCTGAATGTT GTCTCTGATC 43156 .......... .......... .......... .......... .......... .......... 303 TCTTCATTTG TTCTTATCAG CATGGGAACC ATGACTCTCT ACCAGCTGAA GGGACCAGGT 43216 ||||| |||||||| .......... .......... .......... .......... .....CTGAA GGGACCAG.. 316 ACCAATATCA TTCTTTAAAA CATGGTTTCT GTTTTGCATC AGTTCATTGC CAAAACTCTA 43276 .......... .......... .......... .......... .......... .......... 316 AAACGATTAA TGCAAACATT CCATCAAGAT TATGTATTAG TTCCTGTCGT TAAGTCTGGT 43336 .......... .......... .......... .......... .......... .......... 316 AGTTTGTTTC TCTCTTTGAT ACAGGGACAA GCCAAAACTG AGGGAAACAA GGAAACAAGT 43396 | |||| |||||||||| |||||||||| |||||||||| .......... .......... ....GTACAA GCCAAAACTG AGGGAAACAA GGAAACAAGT 352 CAATGATACA TCTGAAGATG ATACACAGTA GTATGTCTTT TTGGGATTCA CTCTTCATTC 43456 |||||||||| |||||||||| |||||||||| CAATGATACA TCTGAAGATG ATACACAGTA .......... .......... .......... 382 TCATTTCCTT AGCACTTTTT TCTGACTAAA ACTTGCTTAC TGTTTAGTAG TCCAAAAGAA 43516 ||| |||| |||| .......... .......... .......... .......... .......TAG TCCAGGAGAA 395 GATGATGGCA AACCTCGTCG AGCTAGTGCA CTTCCAAATT ATGGATTCAG TTTTAGATGT 43576 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATGATGGCG AACCTCGTCG AGCTAGTGCA CTTCCAAATT ATGGATTCAG TTTTAGATGT 455 GACCAACGAG CTGAAAAAAG AAGAGAGGTT TCATTTACTT CACTGTACTT ATCCTCACAA 43636 |||||||||| |||||||||| ||||||| GACCAACGAG CTGAAAAAAG AAGAGAG... .......... .......... .......... 482 CAAACATGCA ACCAAGTATC ATATTTCTCT GTCATGTGAT GCAGTTCTAT TCAAAGCTTG 43696 |||||| |||||||||| .......... .......... .......... .......... ....TTCTAT TCAAAGCTTG 498 AGGAAAAGAT CCATGCGAAA GAAGAAGAA 43725 |||||||||| |||||||||| ||||||||| AGGAAAAGAT CCATGCGAAA GAAGAAGAA 527 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (+ strand): 42678 43725 AGS-1 (42678 42757,42849 43045,43202 43214,43361 43426,43504 43603,43681 43725) SCR (e 0.975 d 0.922 a 0.998,e 0.995 d 0.000 a 0.001,e 1.000 d 0.998 a 0.998,e 0.985 d 0.927 a 0.935,e 0.970 d 0.514 a 0.952,e 1.000) Exon 1 42678 42757 ( 80 n); score: 0.975 Intron 1 42758 42848 ( 91 n); Pd: 0.922 Pa: 0.998 Exon 2 42849 43045 ( 197 n); score: 0.995 Intron 2 43046 43201 ( 156 n); Pd: 0.000 Pa: 0.001 Exon 3 43202 43214 ( 13 n); score: 1.000 Intron 3 43215 43360 ( 146 n); Pd: 0.998 Pa: 0.998 Exon 4 43361 43426 ( 66 n); score: 0.985 Intron 4 43427 43503 ( 77 n); Pd: 0.927 Pa: 0.935 Exon 5 43504 43603 ( 100 n); score: 0.970 Intron 5 43604 43680 ( 77 n); Pd: 0.514 Pa: 0.952 Exon 6 43681 43725 ( 45 n); score: 1.000 PGS (42678 42757,42849 43045,43202 43214,43361 43426,43504 43603,43681 43725) gi+ 3-phase translation of AGS-1 (+strand): . . . . . . 42678 GAGACAGTAGTAATGGAAATGGTGGTACTTCAGAGAATTTGGAATGTTGTTCTACTCAGC E T V V M E M V V L Q R I W N V V L L S R Q - - W K W W Y F R E F G M L F Y S A D S S N G N G G T S E N L E C C S T Q . . : . . . . 42738 ATCCTATGGAGGCATCTGAG : GGCACACAGAATGAACAAGTGGATGATTCAAAACAGATGA I L W R H L R : A H R M N K W M I Q N R - S Y G G I - : G H T E - T S G - F K T D E H P M E A S E : G T Q N E Q V D D S K Q M . . . . . . 42889 GAGGACAAAAGGTGCAAGGAAGGGTCAAGCATGAGAAGACCTCTGGAGGCAAGAATATTC E D K R C K E G S S M R R P L E A R I F R T K G A R K G Q A - E D L W R Q E Y S R G Q K V Q G R V K H E K T S G G K N I . . . . . . 42949 CATCAGTCTTGGTGAAGAAGAAAAAAGATGGGAAAGTGGTGGCATCAAATGGTTCTGTTG H Q S W - R R K K M G K W W H Q M V L L I S L G E E E K R W E S G G I K W F C C P S V L V K K K K D G K V V A S N G S V . . . . : . : . 43009 CTCCAAACGTAAAGCCTGTAAAGTCTCCTAAGAGCAA : CTGAAGGGACCAG : GGACAAGCCA L Q T - S L - S L L R A : T E G T R : D K P S K R K A C K V S - E Q : L K G P : G T S Q A P N V K P V K S P K S N : - R D Q : G Q A . . . . . . : 43371 AAACTGAGGGAAACAAGGAAACAAGTCAATGATACATCTGAAGATGATACACAGTA : TAGT K L R E T R K Q V N D T S E D D T Q Y : S N - G K Q G N K S M I H L K M I H S : I V K T E G N K E T S Q - Y I - R - Y T V : - . . . . . . 43508 CCAAAAGAAGATGATGGCAAACCTCGTCGAGCTAGTGCACTTCCAAATTATGGATTCAGT P K E D D G K P R R A S A L P N Y G F S Q K K M M A N L V E L V H F Q I M D S V S K R R - W Q T S S S - C T S K L W I Q . . . . : . . 43568 TTTAGATGTGACCAACGAGCTGAAAAAAGAAGAGAG : TTCTATTCAAAGCTTGAGGAAAAG F R C D Q R A E K R R E : F Y S K L E E K L D V T N E L K K E E S : S I Q S L R K R F - M - P T S - K K K R : V L F K A - G K . . . 43705 ATCCATGCGAAAGAAGAAGAA I H A K E E E S M R K K K D P C E R R R Maximal non-overlapping open reading frames (>= 64 codons): >49202GAATTCCGCAATGCAGGTTAAGAGCTCTGTGAAAGAGGAAAACGAAAAACGCAGAAGGTG+_PGL-1_AGS-1_PPS_1 (42680 42757,42849 43045,43202 43205) (frame '0'; 276 bp, 92 residues) 1 DSSNGNGGTS ENLECCSTQH PMEASEGTQN EQVDDSKQMR GQKVQGRVKH EKTSGGKNIP 61 SVLVKKKKDG KVVASNGSVA PNVKPVKSPK SN- >49202GAATTCCGCAATGCAGGTTAAGAGCTCTGTGAAAGAGGAAAACGAAAAACGCAGAAGGTG+_PGL-1_AGS-1_PPS_2 (43030 43045,43202 43214,43361 43426,43504 43603,43681 43725) (frame '1'; 240 bp, 80 residues) 1 SLLRATEGTR DKPKLRETRK QVNDTSEDDT QYSPKEDDGK PRRASALPNY GFSFRCDQRA 61 EKRREFYSKL EEKIHAKEEE