|  Entamoeba histolytica Genome Project
  TIGR Home  |  TIGR Database  |  Genome Home  |  Introduction  |  Related Links  |  Help   

What's New


Sequencingmethodology


Annotation Database


Search

    Sequence Search
    Gene Name Search
    Contig Name Search
    Locus Search
    HMM Search


Download


Data Release Policy


E.histolytica


What's New - The TIGR Entamoeba histolytica Genome Project

Funding from the NIH, grant No #R01-AI46516
  • December 17, 2003

    Sequence data from the free living amoeba Acanthamoeba castellanii has been posted and is available for download and searching. ~7,500 sequence reads have been assembled to represent almost 5.7 Mb of uniquee sequence and are now availible. This data will promote understanding of the organism, the amoebic lifestyle and may assist in the annotation and interpretation of the data coming from the Entamoeba species genome projects.

  • November 11, 2003

    The final sequence assembly representing scaffolds from approximately 12X genome coverage of Entamoeba histolytica using data from both TIGR and the Sanger centre is now available for searching and download. The subset of this assembly being used for genome analysis is called final_ASM.fasta. The CDS and predicted protein datasets are split up into those predicted proteins that have been identified as LINE or SINE elements LINE_SINE.pep and LINE_SINE_CDS.seq and the remaining dataset of genes and predicted proteins ALL_WO_LINE_SINE.pep ALL_WO_LINE_SINE_CDS.seq. These datasets may be subjected to slight changes as we remove errors in the automated identification or predicted coding regions however the majority of the predicted proteins are expected to remain intact.

  • June 17, 2003

    A sequence assembly representing scaffolds from approximately 9X genome coverage of Entamoeba histolytica is now available for searching and download.

  • April 12, 2003

    A sequence assembly representing sequence reads from Entamoeba dispar is now available for searching and download. The scaffolded assemblies including singleton reads represent approximately 1.5 - 2 fold sequence coverage of the Entamoeba dispar genome. This data should be analysed with caution as there is known to currently exist a small amount of contamination from Crithidia and other organisms.

  • March 26, 2003

    A fasta file of predicted peptide sequences which are predicted by the program TMHMM http://www.cbs.dtu.dk/services/TMHMM/ to contain 2 or more transmembrane helices is now available for searching and download.

  • March 10, 2003

    A sequence assembly representing sequence reads from Entamoeba invadens representing data from both TIGR and the Sanger Centre are now available for searching and download. The scaffolded assemblies represent approximately 2.8 fold genome coverage of the Entamoeba invadens genome. This data should be analysed with caution as there is known to currently exist a small amount of contamination from Entamoeba histolytica sequence reads.

  • Feb 28, 2003

    Both upstream and downstream sequences between the Predicted CDs's from the automated annotation of the 7X assembly are now available for searching and download. These are based on the predicted genes and may well contain portions of genes in some cases.

  • Jan 28, 2003

    Predicted CDs's and predicted protein sequences from the automated annotation of the 7X assembly now available for download.

  • Jan 27, 2003

    Automated annotation now available for the assemblies of >10kb from an approximately 7-fold sequence coverage assembly of the Entamoeba histolytica genome. These auto-annotated assemblies represent approximately 12 Mb of unique sequence coverage and a large majority of the coding regions of Entamoeba histolytica. The 7-fold coverage assembly is derived from sequence data from both TIGR and the Sanger Centre who has kindly provided their reads.

  • Dec 12, 2002

    Bulk download and searching capabilities available for an approximately 7-fold sequence coverage assembly of the Entamoeba histolytica genome. This 7-fold coverage represents sequence data from both TIGR and the Sanger Centre who has kindly provided their reads. Also available for searching is a dataset of 16,000 Entamoeba invadens sequence reads.

  • Jan 24, 2002

    The Wellcome Trust has recently funded the Sanger Institute Pathogen Sequencing Unit, in collaboration with Graham Clark at the London School of Hygiene and Tropical Medicine, to undertake whole genome shotgun sequencing of E. histolytica. A collaborative approach is now underway at both centers to achieve a significantly higher sequence coverage of the Entamoeba genome.
    For more info go to http://www.sanger.ac.uk/Projects/E_histolytica

  • Sep 20, 2001

    Bulk / individual download of 2x assemblies available

  • Aug 27, 2001

    6621 sequence reads submitted to the GSS subdivision of Genbank

  • Aug 07, 2001

    9,495 sequence reads submitted to the GSS subdivision of Genbank

  • Dec 14, 2000

    26,309 sequence reads submitted to the GSS subdivision of Genbank

  • Nov 14, 2000

    22,821 sequence reads submitted to the GSS subdivision of Genbank




For Comments/Questions send mail to enta@tigr.org.