GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:39:19 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 598 Minimum sequence length: 598 Maximum sequence length: 598 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 0 < 600: 1 < 700: 0 < 800: 0 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 49280GAATTCATTCATTCATCCCCTTTTTATTGACCAATTAGAACTTGACATGTCATATATTTA, from 1 to 94815, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 0 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 2 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 AACATGGACT CCTCCAAACT CTCATCTCTC TCTCTTTGCC TCTTCCTCAT TTGCATTATC 61 TATCTCCCCC AACATTCTCT CGCATGCGGC TCTTGCAACC CACGGAAGGG CGGAAAGCAC 121 TCCCCTAAAG CCCCTAAGCT ACCGGTTCCT CCGGTGACCG TCCCTAAGCT ACCCGTTCCT 181 CCTGTGACCA TCCCTAAGCT ACCCGTTCCT CCAGTGACTG TACCCAAGCT ACCCGTTCCT 241 CCAGTGACCG TCCCCAAGCT ACCCGTTCCT CCAGTGACAG TCCCAAAGCT ACCCGTTCCC 301 CCAGTAACTG TCCCTAAGCT ACCCGTTCCT CCGGTAACCG TCCCTAAGCT ACCCGTTCCT 361 CCAGTAACCG TCCCTAAGCT ACCCCTTCCT CCGATTTCAG GGCTACCCAT ACCTCCAGTG 421 GTAGGTCCCA ATCTGCCATT GCCACCTTTG CCAATTGTAG GTCCTATTCT TCCACCGGGA 481 ACAACCCCAC CAGCCACAGG AGGGAAGGAC TGTCCTCCAC CGCCAGGGAG CGTNAAGCCA 541 CCATCAGGGG GCGGGAAGGC GACATGTCCA ATACAACCTG AATTANGTGC TTGCGTCG Predicted gene structure (within gDNA segment 3371 to 843): Exon 1 3071 2919 ( 153 n); cDNA 1 153 ( 153 n); score: 0.993 Intron 1 2918 2859 ( 60 n); Pd: 0.001 (s: 0.98), Pa: 0.000 (s: 1.00) Exon 2 2858 2411 ( 448 n); cDNA 154 598 ( 445 n); score: 0.958 MATCH 49280GAATTCATTCATTCATCCCCTTTTTATTGACCAATTAGAACTTGACATGTCATATATTTA- gi+ 0.967 601 1.005 C PGS_49280GAATTCATTCATTCATCCCCTTTTTATTGACCAATTAGAACTTGACATGTCATATATTTA-_gi+ (3071 2919,2858 2411) Alignment (genomic DNA sequence = upper lines): AACATGGACT CCTCCAAACT CTCATCTCTC TCTCTTTGCC TCTTCCTCAT TTGCATTATC 3012 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACATGGACT CCTCCAAACT CTCATCTCTC TCTCTTTGCC TCTTCCTCAT TTGCATTATC 60 TATCTCCCCC AACATTCTCT CGCATGCGGC TCTTGCAACC CACGGAAGGG CGGAAAGCAC 2952 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATCTCCCCC AACATTCTCT CGCATGCGGC TCTTGCAACC CACGGAAGGG CGGAAAGCAC 120 TCCCCTAAAG CCCCTAAGCT ACCAGTTCCT CCGGTGACCG TCCCTAAGCT ACCAGTTCCT 2892 |||||||||| |||||||||| ||| |||||| ||| TCCCCTAAAG CCCCTAAGCT ACCGGTTCCT CCG....... .......... .......... 153 CCGGTGACCG TCCCTAAGCT ACCAGTCCCT CCGGTGACCG TCCCTAAGCT ACCCGTTCCT 2832 ||||||| |||||||||| |||||||||| .......... .......... .......... ...GTGACCG TCCCTAAGCT ACCCGTTCCT 180 CCTGTGACCA TCCCTAAGCT ACCCGTTCCA CCAGTGACTG TACCTAAGCT ACCCGTTCCT 2772 |||||||||| |||||||||| ||||||||| |||||||||| |||| ||||| |||||||||| CCTGTGACCA TCCCTAAGCT ACCCGTTCCT CCAGTGACTG TACCCAAGCT ACCCGTTCCT 240 CCTGTGACCG TCCCCAAGCT ACCCGTTCCT CCAGTGACCG TCCCCAAGCT ACCCGTTCCT 2712 || ||||||| |||||||||| |||||||||| |||||||| | |||| ||||| ||||||||| CCAGTGACCG TCCCCAAGCT ACCCGTTCCT CCAGTGACAG TCCCAAAGCT ACCCGTTCCC 300 CCAGTGACAG TCCCTAAGCT ACCCGTTCCC CCGGTAACTG TACCTAAGCT ACCCGTTCCT 2652 ||||| || | |||||||||| ||||||||| |||||||| | | |||||||| |||||||||| CCAGTAACTG TCCCTAAGCT ACCCGTTCCT CCGGTAACCG TCCCTAAGCT ACCCGTTCCT 360 CCAGTGACCG TCCCTAAGCT ACCCCTTCCT CCGATTTCAG GGCTACCCAT ACCTCCAGTG 2592 ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCAGTAACCG TCCCTAAGCT ACCCCTTCCT CCGATTTCAG GGCTACCCAT ACCTCCAGTG 420 GTAGGTCCCA ATCTGCCATT GCCGCCTTTG CCAATTGTAG GTCCTATTCT TCCACCGGGA 2532 |||||||||| |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| GTAGGTCCCA ATCTGCCATT GCCACCTTTG CCAATTGTAG GTCCTATTCT TCCACCGGGA 480 ACAACCCCAC CAGCCACAGG AGGGAAGGAC TGTCCTCCAC CGCCAGGGAG CGTAAAGCCA 2472 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| |||||| ACAACCCCAC CAGCCACAGG AGGGAAGGAC TGTCCTCCAC CGCCAGGGAG CGTNAAGCCA 540 CCATCAGGGG GCGGGAAGGC GACATGTCCA ATAGACACGC TGAAGTTAGG TGCTTGCGTC 2412 |||||||||| |||||||||| |||||||||| ||| | || | |||| ||| | |||||||||| CCATCAGGGG GCGGGAAGGC GACATGTCCA ATACA-AC-C TGAA-TTANG TGCTTGCGTC 597 G 2411 | G 598 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (- strand): 3071 2411 AGS-1 (3071 2919,2858 2411) SCR (e 0.993 d 0.001 a 0.000,e 0.958) Exon 1 3071 2919 ( 153 n); score: 0.993 Intron 1 2918 2859 ( 60 n); Pd: 0.001 Pa: 0.000 Exon 2 2858 2411 ( 448 n); score: 0.958 PGS (3071 2919,2858 2411) gi+ 3-phase translation of AGS-1 (-strand): . . . . . . 3071 AACATGGACTCCTCCAAACTCTCATCTCTCTCTCTTTGCCTCTTCCTCATTTGCATTATC N M D S S K L S S L S L C L F L I C I I T W T P P N S H L S L F A S S S F A L S H G L L Q T L I S L S L P L P H L H Y . . . . . . 3011 TATCTCCCCCAACATTCTCTCGCATGCGGCTCTTGCAACCCACGGAAGGGCGGAAAGCAC Y L P Q H S L A C G S C N P R K G G K H I S P N I L S H A A L A T H G R A E S T L S P P T F S R M R L L Q P T E G R K A . . . . : . . 2951 TCCCCTAAAGCCCCTAAGCTACCAGTTCCTCCG : GTGACCGTCCCTAAGCTACCCGTTCCT S P K A P K L P V P P : V T V P K L P V P P L K P L S Y Q F L R : - P S L S Y P F L L P - S P - A T S S S : G D R P - A T R S . . . . . . 2831 CCTGTGACCATCCCTAAGCTACCCGTTCCACCAGTGACTGTACCTAAGCTACCCGTTCCT P V T I P K L P V P P V T V P K L P V P L - P S L S Y P F H Q - L Y L S Y P F L S C D H P - A T R S T S D C T - A T R S . . . . . . 2771 CCTGTGACCGTCCCCAAGCTACCCGTTCCTCCAGTGACCGTCCCCAAGCTACCCGTTCCT P V T V P K L P V P P V T V P K L P V P L - P S P S Y P F L Q - P S P S Y P F L S C D R P Q A T R S S S D R P Q A T R S . . . . . . 2711 CCAGTGACAGTCCCTAAGCTACCCGTTCCCCCGGTAACTGTACCTAAGCTACCCGTTCCT P V T V P K L P V P P V T V P K L P V P Q - Q S L S Y P F P R - L Y L S Y P F L S S D S P - A T R S P G N C T - A T R S . . . . . . 2651 CCAGTGACCGTCCCTAAGCTACCCCTTCCTCCGATTTCAGGGCTACCCATACCTCCAGTG P V T V P K L P L P P I S G L P I P P V Q - P S L S Y P F L R F Q G Y P Y L Q W S S D R P - A T P S S D F R A T H T S S . . . . . . 2591 GTAGGTCCCAATCTGCCATTGCCGCCTTTGCCAATTGTAGGTCCTATTCTTCCACCGGGA V G P N L P L P P L P I V G P I L P P G - V P I C H C R L C Q L - V L F F H R E G R S Q S A I A A F A N C R S Y S S T G . . . . . . 2531 ACAACCCCACCAGCCACAGGAGGGAAGGACTGTCCTCCACCGCCAGGGAGCGTAAAGCCA T T P P A T G G K D C P P P P G S V K P Q P H Q P Q E G R T V L H R Q G A - S H N N P T S H R R E G L S S T A R E R K A . . . . . . 2471 CCATCAGGGGGCGGGAAGGCGACATGTCCAATAGACACGCTGAAGTTAGGTGCTTGCGTC P S G G G K A T C P I D T L K L G A C V H Q G A G R R H V Q - T R - S - V L A S T I R G R E G D M S N R H A E V R C L R . 2411 G Maximal non-overlapping open reading frames (>= 64 codons): >49280GAATTCATTCATTCATCCCCTTTTTATTGACCAATTAGAACTTGACATGTCATATATTTA-_PGL-1_AGS-1_PPS_1 (3071 2919,2858 2412) (frame '1'; 600 bp, 200 residues) 1 NMDSSKLSSL SLCLFLICII YLPQHSLACG SCNPRKGGKH SPKAPKLPVP PVTVPKLPVP 61 PVTIPKLPVP PVTVPKLPVP PVTVPKLPVP PVTVPKLPVP PVTVPKLPVP PVTVPKLPVP 121 PVTVPKLPLP PISGLPIPPV VGPNLPLPPL PIVGPILPPG TTPPATGGKD CPPPPGSVKP 181 PSGGGKATCP IDTLKLGACV