GeneSeqer. Version of October 12, 2001. Date run: Wed Feb 26 16:38:01 2003 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 12, MinQuality 30. Total number of ESTs: 1 Total sequence length: 712 Minimum sequence length: 712 Maximum sequence length: 712 Length distribution (number of sequences of specified length): < 100: 0 < 200: 0 < 300: 0 < 400: 0 < 500: 0 < 600: 0 < 700: 0 < 800: 1 < 900: 0 < 1000: 0 >=1000: 0 ________________________________________________________________________________ Sequence 1: 44466AAGCTTTGAAGAATCTAAAGAGGAGATCAAAAGAGAGCTGCTTTTTCCAGACTCTAAGGA, from 1 to 128090, both strands analyzed. EST library file: cdna.seq; matching gDNA +strand ... Found all matches, elapsed seconds = 1 Matches indexed, elapsed seconds = 1 HitsTableSize = 0 EST library file: cdna.seq; matching gDNA -strand ... Found all matches, elapsed seconds = 0 Matches indexed, elapsed seconds = 0 HitsTableSize = 1 ******************************************************************************** EST sequence 1 +strand (File: gi+) 1 GAAGCCACTA GTCGAATTGG AGAAGAAGAT TGTTGATGTG AGGAAGATGG CGAATGAAAC 61 GGGGTTGGAT TTCACTGAGC AGATTATCAC TTTGGAGAAC AAGTATAGAC AGGCACTGAA 121 AGATCTTTAC ACGCATCTTA CTCCGATACA ACGTGTGAAC ATTGCGCGCC ATCCCAACCG 181 ACCTACTTTC CTTGATCATA TACATAACAT AACTGACAAG TTTATGGAGC TTCATGGAGA 241 CCGAGCGGGG TATGATGACC CTGCAATTGT GACGGGTATT GGAACCATAG ATGGAAAACG 301 TTACATGTTC ATAGGTCACC AGAAAGGTAG AAACACCAAA GAAAATATAA TGCGGAACTT 361 TGGTATGCCT ACTCCTCACG GATATAGGAA AGCACTTCGG ATGATGTATT ATGCAGACCA 421 TCACGGTTTT CCAATCGTGA CATTTATCGA CACTCCTGGA GCCTATGCAG ATCTTAAATC 481 CGAGATAGAT GACGAGTACA CTGAAGCTGC AATAGCAGTA GGTTTGGAGG AGAGACTAAC 541 GGCAATGCGC GAAGAGTTCT CGAAAGCGAG TTCAGAAGAG CACCTTATGC ACCCGGTTCT 601 GATCGAGAAA ATTGAGAAGC TCAAGGAAGA ATTCAATACC CGTTTGACTG ACGCACCTAA 661 CTACGAGAGC CTAAAATCTA AGCTTAACAT GCTTAGGGAC TTTTCCAGAG CC Predicted gene structure (within gDNA segment 27423 to 24646): Exon 1 27123 27087 ( 37 n); cDNA 1 37 ( 37 n); score: 1.000 Intron 1 27086 26986 ( 101 n); Pd: 0.932 (s: n/a), Pa: 0.994 (s: 1.00) Exon 2 26985 26911 ( 75 n); cDNA 38 112 ( 75 n); score: 1.000 Intron 2 26910 26777 ( 134 n); Pd: 0.902 (s: 1.00), Pa: 0.790 (s: 1.00) Exon 3 26776 26669 ( 108 n); cDNA 113 220 ( 108 n); score: 1.000 Intron 3 26668 26577 ( 92 n); Pd: 0.891 (s: 1.00), Pa: 0.995 (s: 1.00) Exon 4 26576 26416 ( 161 n); cDNA 221 381 ( 161 n); score: 1.000 Intron 4 26415 26323 ( 93 n); Pd: 0.955 (s: 1.00), Pa: 0.973 (s: 1.00) Exon 5 26322 26220 ( 103 n); cDNA 382 484 ( 103 n); score: 1.000 Intron 5 26219 25179 (1041 n); Pd: 0.000 (s: 1.00), Pa: 0.001 (s: 1.00) Exon 6 25178 24951 ( 228 n); cDNA 485 712 ( 228 n); score: 1.000 MATCH 44466AAGCTTTGAAGAATCTAAAGAGGAGATCAAAAGAGAGCTGCTTTTTCCAGACTCTAAGGA- gi+ 1.000 675 0.948 C PGS_44466AAGCTTTGAAGAATCTAAAGAGGAGATCAAAAGAGAGCTGCTTTTTCCAGACTCTAAGGA-_gi+ (27123 27087,26985 26911,26776 26669,26576 26416,26322 26220,25178 24951) Alignment (genomic DNA sequence = upper lines): GAAGCCACTA GTCGAATTGG AGAAGAAGAT TGTTGATGTA CGAATGTGAA TACTTATCAT 27064 |||||||||| |||||||||| |||||||||| ||||||| GAAGCCACTA GTCGAATTGG AGAAGAAGAT TGTTGAT... .......... .......... 37 CTGTTTGCAT CTTAGTTTGA TTTCCCCTAG CAGTTGTATG GTGTGAAGTT TCTCATAACA 27004 .......... .......... .......... .......... .......... .......... 37 TTGGTTCTTG TGATGCAGGT GAGGAAGATG GCGAATGAAA CGGGGTTGGA TTTCACTGAG 26944 || |||||||||| |||||||||| |||||||||| |||||||||| .......... ........GT GAGGAAGATG GCGAATGAAA CGGGGTTGGA TTTCACTGAG 79 CAGATTATCA CTTTGGAGAA CAAGTATAGA CAGGTAAAAA TGCTAGTATA GGTTGGTTTT 26884 |||||||||| |||||||||| |||||||||| ||| CAGATTATCA CTTTGGAGAA CAAGTATAGA CAG....... .......... .......... 112 CAATTTATAT AGTTGAGGGA GACCCACCAT TCAAATTTGA TGTTAATGTA TAGCTACAAT 26824 .......... .......... .......... .......... .......... .......... 112 TTTGTTTGGG AAAGACTTCT TACTGACCCC CGCCATTTTA ATGTCAGGCA CTGAAAGATC 26764 ||| |||||||||| .......... .......... .......... .......... .......GCA CTGAAAGATC 125 TTTACACGCA TCTTACTCCG ATACAACGTG TGAACATTGC GCGCCATCCC AACCGACCTA 26704 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTACACGCA TCTTACTCCG ATACAACGTG TGAACATTGC GCGCCATCCC AACCGACCTA 185 CTTTCCTTGA TCATATACAT AACATAACTG ACAAGGTTTG GTCTCTCAGT AGTAAGGCTT 26644 |||||||||| |||||||||| |||||||||| ||||| CTTTCCTTGA TCATATACAT AACATAACTG ACAAG..... .......... .......... 220 TTTTGTCATG CATTGACGAT AAATTCTTGT TTTTTTACAT TCTTGGGACT CTCTGCTCTT 26584 .......... .......... .......... .......... .......... .......... 220 GTTTCAGTTT ATGGAGCTTC ATGGAGACCG AGCGGGGTAT GATGACCCTG CAATTGTGAC 26524 ||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......TTT ATGGAGCTTC ATGGAGACCG AGCGGGGTAT GATGACCCTG CAATTGTGAC 273 GGGTATTGGA ACCATAGATG GAAAACGTTA CATGTTCATA GGTCACCAGA AAGGTAGAAA 26464 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGGTATTGGA ACCATAGATG GAAAACGTTA CATGTTCATA GGTCACCAGA AAGGTAGAAA 333 CACCAAAGAA AATATAATGC GGAACTTTGG TATGCCTACT CCTCACGGGT ATGTTTTCAC 26404 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| CACCAAAGAA AATATAATGC GGAACTTTGG TATGCCTACT CCTCACGG.. .......... 381 GAGGAGCTTC CTGTTTGTAG TTAAATAGAA ACATTCAGAG TCAACAACTC TCTTATCAAA 26344 .......... .......... .......... .......... .......... .......... 381 TTTCTGTAAT GTCTACAACA GATATAGGAA AGCACTTCGG ATGATGTATT ATGCAGACCA 26284 ||||||||| |||||||||| |||||||||| |||||||||| .......... .......... .ATATAGGAA AGCACTTCGG ATGATGTATT ATGCAGACCA 420 TCACGGTTTT CCAATCGTGA CATTTATCGA CACTCCTGGA GCCTATGCAG ATCTTAAATC 26224 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCACGGTTTT CCAATCGTGA CATTTATCGA CACTCCTGGA GCCTATGCAG ATCTTAAATC 480 CGAGGAACTT GGACAGGTAG AAGAAATGTC TTTTATGTTT GAAATGTACT ACTGGTTTAT 26164 |||| CGAG...... .......... .......... .......... .......... .......... 484 CTGCATCATA ACATTTTTTT TCTTATCTGT ATAACAGGGT GAAGCGATTG CCAACAATCT 26104 .......... .......... .......... .......... .......... .......... 484 GAGGACGATG TTCGGCCTGA AAGTGCCAAT TCTTTCTATT GTCATTGGGG AAGGTGGTTC 26044 .......... .......... .......... .......... .......... .......... 484 TGGTGGTGCC CTAGCCATTG GCTGTGCGAA TAAAATGCTG ATGCTCGAAA ACGCAGTTTT 25984 .......... .......... .......... .......... .......... .......... 484 CTATGTTGCC AGGTAAACTT TTTCCCAATG TAGTAGGTAT GATGAGATGA TGTTTTTACA 25924 .......... .......... .......... .......... .......... .......... 484 GATTGATTTA TCACGTCTGT TGTTTTGCCT TTCACAGTCC AGAGGCATGT GCAGCGATCT 25864 .......... .......... .......... .......... .......... .......... 484 TGTGGAAGAC TTCTAAGGCT GCTCCTGAGG TATGTACACT GCTCTACTAC TCGATTAAAA 25804 .......... .......... .......... .......... .......... .......... 484 GTGATCATGT AGACTCTAAT ATTACACTTG AATGAAACTC GTGTACGCAG GCTGCTGAAA 25744 .......... .......... .......... .......... .......... .......... 484 AGCTTAGAAT TACCTCCAAG GAGCTGGTCA AGCTTAATGT AGCTGATGGA ATCATTCCTG 25684 .......... .......... .......... .......... .......... .......... 484 TAACTGATCC TACCTCTAAT CTATCTGTTT CCTCTTACAC TTCTTTTCTC ACTTTGGATC 25624 .......... .......... .......... .......... .......... .......... 484 ACTGAGTAAC TCTCACATGT TGTACAGGAA CCGCTTGGAG GGGCCCATGC CGATCCTTCA 25564 .......... .......... .......... .......... .......... .......... 484 TGGACGTCGC AGCAAATAAA GATTGCTATC AATGAAAACA TGAATGTAGG CATAGTTGTT 25504 .......... .......... .......... .......... .......... .......... 484 CTTCTCAACG CTGGATGTGG ATCTAAAAGA GTTCTAACTT TTTTTTGGGA ACTTGTAGGA 25444 .......... .......... .......... .......... .......... .......... 484 ATTCGGAAAA ATGAGTGGGG AGGAGCTCCT GAAACACAGG ATGGCTAAGT ACCGAAAGAT 25384 .......... .......... .......... .......... .......... .......... 484 TGGAGTGTTC ATAGAGGGCG AACCAATAGA GCCAAGTAGG AAAATCAACA TGAAGAAAAG 25324 .......... .......... .......... .......... .......... .......... 484 GGAAGCCGTG TTCTCAGATA GCCGGAAGCT GCAGGGTGAG GTTGACAAGC TGAAGGAGCA 25264 .......... .......... .......... .......... .......... .......... 484 GATTCTGAAA GCCAAGGAGA CGTCTACGGA AGCCGAGCCT TCGAGTGAAG TTCTTAATGA 25204 .......... .......... .......... .......... .......... .......... 484 GATGATTGAG AAACTCAAAT CCGAGATAGA TGACGAGTAC ACTGAAGCTG CAATAGCAGT 25144 ||||| |||||||||| |||||||||| |||||||||| .......... .......... .....ATAGA TGACGAGTAC ACTGAAGCTG CAATAGCAGT 519 AGGTTTGGAG GAGAGACTAA CGGCAATGCG CGAAGAGTTC TCGAAAGCGA GTTCAGAAGA 25084 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGTTTGGAG GAGAGACTAA CGGCAATGCG CGAAGAGTTC TCGAAAGCGA GTTCAGAAGA 579 GCACCTTATG CACCCGGTTC TGATCGAGAA AATTGAGAAG CTCAAGGAAG AATTCAATAC 25024 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCACCTTATG CACCCGGTTC TGATCGAGAA AATTGAGAAG CTCAAGGAAG AATTCAATAC 639 CCGTTTGACT GACGCACCTA ACTACGAGAG CCTAAAATCT AAGCTTAACA TGCTTAGGGA 24964 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCGTTTGACT GACGCACCTA ACTACGAGAG CCTAAAATCT AAGCTTAACA TGCTTAGGGA 699 CTTTTCCAGA GCC 24951 |||||||||| ||| CTTTTCCAGA GCC 712 Total number of EST alignments reported: 1 ________________________________________________________________________________ Predicted gene locations (1): PGL 1 (- strand): 27123 24951 AGS-1 (27123 27087,26985 26911,26776 26669,26576 26416,26322 26220,25178 24951) SCR (e 1.000 d 0.932 a 0.994,e 1.000 d 0.902 a 0.790,e 1.000 d 0.891 a 0.995,e 1.000 d 0.955 a 0.973,e 1.000 d 0.000 a 0.001,e 1.000) Exon 1 27123 27087 ( 37 n); score: 1.000 Intron 1 27086 26986 ( 101 n); Pd: 0.932 Pa: 0.994 Exon 2 26985 26911 ( 75 n); score: 1.000 Intron 2 26910 26777 ( 134 n); Pd: 0.902 Pa: 0.790 Exon 3 26776 26669 ( 108 n); score: 1.000 Intron 3 26668 26577 ( 92 n); Pd: 0.891 Pa: 0.995 Exon 4 26576 26416 ( 161 n); score: 1.000 Intron 4 26415 26323 ( 93 n); Pd: 0.955 Pa: 0.973 Exon 5 26322 26220 ( 103 n); score: 1.000 Intron 5 26219 25179 (1041 n); Pd: 0.000 Pa: 0.001 Exon 6 25178 24951 ( 228 n); score: 1.000 PGS (27123 27087,26985 26911,26776 26669,26576 26416,26322 26220,25178 24951) gi+ 3-phase translation of AGS-1 (-strand): . . . . : . . 27123 GAAGCCACTAGTCGAATTGGAGAAGAAGATTGTTGAT : GTGAGGAAGATGGCGAATGAAAC E A T S R I G E E D C - : C E E D G E - N K P L V E L E K K I V D : V R K M A N E T S H - S N W R R R L L M : - G R W R M K . . . . . . : 26962 GGGGTTGGATTTCACTGAGCAGATTATCACTTTGGAGAACAAGTATAGACAG : GCACTGAA G V G F H - A D Y H F G E Q V - T : G T E G L D F T E Q I I T L E N K Y R Q : A L K R G W I S L S R L S L W R T S I D R : H - . . . . . . 26768 AGATCTTTACACGCATCTTACTCCGATACAACGTGTGAACATTGCGCGCCATCCCAACCG R S L H A S Y S D T T C E H C A P S Q P D L Y T H L T P I Q R V N I A R H P N R K I F T R I L L R Y N V - T L R A I P T . . . . : . . 26708 ACCTACTTTCCTTGATCATATACATAACATAACTGACAAG : TTTATGGAGCTTCATGGAGA T Y F P - S Y T - H N - Q : V Y G A S W R P T F L D H I H N I T D K : F M E L H G D D L L S L I I Y I T - L T S : L W S F M E . . . . . . 26556 CCGAGCGGGGTATGATGACCCTGCAATTGTGACGGGTATTGGAACCATAGATGGAAAACG P S G V - - P C N C D G Y W N H R W K T R A G Y D D P A I V T G I G T I D G K R T E R G M M T L Q L - R V L E P - M E N . . . . . . 26496 TTACATGTTCATAGGTCACCAGAAAGGTAGAAACACCAAAGAAAATATAATGCGGAACTT L H V H R S P E R - K H Q R K Y N A E L Y M F I G H Q K G R N T K E N I M R N F V T C S - V T R K V E T P K K I - C G T . . . : . . . 26436 TGGTATGCCTACTCCTCACGG : ATATAGGAAAGCACTTCGGATGATGTATTATGCAGACCA W Y A Y S S R : I - E S T S D D V L C R P G M P T P H G : Y R K A L R M M Y Y A D H L V C L L L T : D I G K H F G - C I M Q T . . . . . . 26283 TCACGGTTTTCCAATCGTGACATTTATCGACACTCCTGGAGCCTATGCAGATCTTAAATC S R F S N R D I Y R H S W S L C R S - I H G F P I V T F I D T P G A Y A D L K S I T V F Q S - H L S T L L E P M Q I L N . : . . . . . 26223 CGAG : ATAGATGACGAGTACACTGAAGCTGCAATAGCAGTAGGTTTGGAGGAGAGACTAAC R : D R - R V H - S C N S S R F G G E T N E : I D D E Y T E A A I A V G L E E R L T P R : - M T S T L K L Q - Q - V W R R D - . . . . . . 25122 GGCAATGCGCGAAGAGTTCTCGAAAGCGAGTTCAGAAGAGCACCTTATGCACCCGGTTCT G N A R R V L E S E F R R A P Y A P G S A M R E E F S K A S S E E H L M H P V L R Q C A K S S R K R V Q K S T L C T R F . . . . . . 25062 GATCGAGAAAATTGAGAAGCTCAAGGAAGAATTCAATACCCGTTTGACTGACGCACCTAA D R E N - E A Q G R I Q Y P F D - R T - I E K I E K L K E E F N T R L T D A P N - S R K L R S S R K N S I P V - L T H L . . . . . . 25002 CTACGAGAGCCTAAAATCTAAGCTTAACATGCTTAGGGACTTTTCCAGAGCC L R E P K I - A - H A - G L F Q S Y E S L K S K L N M L R D F S R A T T R A - N L S L T C L G T F P E Maximal non-overlapping open reading frames (>= 64 codons): >44466AAGCTTTGAAGAATCTAAAGAGGAGATCAAAAGAGAGCTGCTTTTTCCAGACTCTAAGGA-_PGL-1_AGS-1_PPS_1 (27122 27087,26985 26911,26776 26669,26576 26416,26322 26220,25178 24951) (frame '2'; 711 bp, 237 residues) 1 KPLVELEKKI VDVRKMANET GLDFTEQIIT LENKYRQALK DLYTHLTPIQ RVNIARHPNR 61 PTFLDHIHNI TDKFMELHGD RAGYDDPAIV TGIGTIDGKR YMFIGHQKGR NTKENIMRNF 121 GMPTPHGYRK ALRMMYYADH HGFPIVTFID TPGAYADLKS EIDDEYTEAA IAVGLEERLT 181 AMREEFSKAS SEEHLMHPVL IEKIEKLKEE FNTRLTDAPN YESLKSKLNM LRDFSRA