Genetic Data Environment version 2.2

Table of Contents

Introduction

The Genetic Data Environment is part of a growing set of programs for manipulating and analyzing "genetic" data. It differs in design from other analysis programs in that it is intended to be an expandable and customizable system, while still being easy to use.

There are a tremendous number of publicly available programs for sequence analysis. Many of these programs have found their way into commercial packages which incorporate them into integrated, easy to use systems. The goal of the GDE is to minimize the amount of effort required to integrate sequence analysis functions into a common environment. The GDE takes care of the user interface issues, and allows the programmer to concentrate on the analysis itself. Existing programs can be tied into the GDE in a matter of hours (or minutes) as apposed to days or weeks. Programs may be written in any language, and still seamlessly be incorporated into the GDE.

These programs are, and will continue to be, available at no charge. It is the hope that this system will grow in functionality as more and more people see the benefits of a modular analysis environment. Users are encouraged to make modifications to the system, and forward all changes and additions to Steven Smith at smith@bioimage.millipore.com.

What's New for this Release

GDE 2.2 represents a maintainence release. Several small bugs have been fixed, as well as new editing features and user interface elements. Also, I have tried to update all of the contributed external programs to their latest release. Updated programs include: Phylip, Treetool, LoopTool, Readseq, Blast, and Fasta. Improved versions of printing, and translate are included as well. As for new editing features, a useful "yanking" feature has been added by Scott Ferguson from Exxon Research, and the capability to export the colormap for a seqeunce (see appendix A and appendix C). Among the bugs fixed in this release are:

System Requirements

GDE 2.2 currently runs on the Sun family of workstations. This includes the Sun3 and Sun4 (Sparcstation) systems. It was written in XView, and runs on Suns using OpenWindows 3.0 or MIT's X Windows. It runs in both monochrome, and color, and can be run remotely on any system capable of running X Windows Release 4. You should have at least 15 meg of free disk space available. The binary release for SparcStations was compiled under SunOS 4.1.2 and Openwindows 3.0.

We are also supporting a DECStation version of GDE. This is running under XView 3.0/X11R5. We encourage interested people to port the programs to their favorite Unix platform. There are informal ports to the SGI line of unix machines.

Note to Motif users

GDE2.2 can be run using different window managers. The most common alternative to olwm is the Motif window manager (mwm). The only problem in using another window manager is that the status line is not displayed. We have added a "Message panel" as an option under "File->Properties" which displays all of the information contained on the status line.

People using other window managers may also prefer using xterm, and xedit as default terminals and file editors. This can be accomplished by replacing all occurrences of 'shelltool' and 'textedit' with 'xterm -e' and 'xedit' in the $GDE_HELP_DIR/.GDEmenus file.

Installing the GDE

Instructions for the source code release are included in the README.install file.

The binary installations consist of creating a GDE directory, such as /usr/local/GDE, and un-taring the installation tarfile into the directory. If you are installing the GDE for your own use, then you can simply make a GDE subdirectory. There is no need to be superuser (root) to do the installation in your own directory. For example:

After this, each new user will need to add two lines to their .cshrc file so that they can find the gde programs and files.

You may wish to make a copy of the .GDEmenus file from the help directory into your home directory. This is only necessary if you wish to modify your menus. Copy the demo files from /usr/local/GDE/demo into your local directory, and you are now ready to use the GDE.

FastA and Blast need to have the properly formatted databases installed in the $GDE_HELP_DIR under the directories FASTA/PIR, FASTA/GENBANK, BLAST/pir BLAST/genbank. For FASTA, simply copy a version of PIR and Genbank into the proper directory. Alternately, the PIR and GENBANK files can be symbolic links to copies of Genbank held elsewhere on your system. You may need to look at the .GDEmenus file in $GDE_HELP_DIR to verify that you are using the same divisions for these databases.

Blast installation involves converting PIR and GENBANK to a temporary FASTA format (using pir2fasta and gb2fasta) and then using pressdb for nucleic acid, and setdb for amino acid to reformat the databases again into blast format. The .GDEmenus file is currently set up to search with blast using the following databases: pir, genpept, genupdate, and genbank. If you wish to divide these into subdivisions, then the .GDEmenus file will have to be edited.

The most up to date release of blast can be obtained via anonymous ftp to ncbi.nlm.nih.gov. The most recent release of FASTA can be obtained via anonymous ftp to uvaarpa.virginia.edu. It is strongly recommended that you retrieve these copies, and become familiar with their setup.


Using the GDE

It is assumed that the user is familiar with the Unix, and OpenWindows/Xwindows environments. It is also assumed that people running standard MIT X-Windows will be using the OpenLook window manager (olwm). Other window managers work with varied success. If you are not certain as to how your system is set up, please contact your systems administrator.

Once the window system has started, and a terminal window (xterm, shelltool etc.) you can start up the GDE by typing: gde tRNAs. This should load the sample data set tRNAs into GDE, and the following window should appear:



This is the sequence alignment editor. It consists of a color alignment display, a set of command menus, horizontal and vertical scroll bars to navigate the alignment, a list of short sequence names (usually the LOCUS of a Genbank entry), and a status line. The cursor is located in the upper left corner.


Data Types

The GDE supports several data types. The data types supported in 2.2 are DNA, RNA, protein (single letter codes), mask sequence, and text.


Menu Functions


External Functions

See appendix C for a full description of functions supported in GDE 2.2 All external functions are described in the configuration file .GDEmenus. Here is a brief description of some of the basic functions included.

Citation of work

We ask that any published work using any of the external functions in GDE cite the appropriate authors. Please see Appendix C for references.

Acknowledgments

I would like to thank the following people for their input and assistance and code used in the development of the GDE: Carl Woese, Gary Olsen and Mike Maciukenas at University of Illinois Dept of Microbiology, Ross Overbeek at Argonne National Laboratories,Walter Gilbert, Patrick Gillevet, Chunwei Wang, Susan Russo and Erik Bunce at the Harvard Genome Laboratory. I would also like to personally thank the following people for their permission to include their software with this release of GDE.

Many thanks to all the people who have directly and indirectly helped with the ongoing support of GDE. It is only by the generosity of these people that GDE has been successful.

Copyright Notice

The Genetic Data Environment (GDE) software and documentation are not in the public domain. Portions of this code are owned and copyrighted by the The Board of Trustees of the University of Illinois and by Steven Smith. External functions used by GDE are the proporty of, their respective authors. This release of the GDE program and documentation may not be sold, or incorporated into a commercial product, in whole or in part without the expressed written consent of the University of Illinois and of its author, Steven Smith.

All interested parties may redistribute the GDE as long as all copies are accompanied by this documentation, and all copyright notices remain intact. Parties interested in redistribution must do so on a non-profit basis, charging only for cost of media. Modifications to the GDE core editor should be forwarded to the author Steven Smith. External programs used by the GDE are copyright by, and are the property of their respective authors unless otherwise stated.

While all attempts have been made to insure the integrity of these programs:

Disclaimer

THE UNIVERSITY OF ILLINOIS, HARVARD UNIVERSITY AND THE AUTHOR, STEVEN SMITH GIVE NO WARRANTIES, EXPRESSED OR IMPLIED FOR THE SOFTWARE AND DOCUMENTATION PROVIDED, INCLUDING, BUT NOT LIMITED TO WARRANTY OF MERCHANTABILITY AND WARRANTY OF FITNESS FOR A PARTICULAR PURPOSE. User understands the software is a research tool for which no warranties as to capabilities or accuracy are made, and user accepts the software "as is." User assumes the entire risk as to the results and performance of the software and documentation. The above parties cannot be held liable for any direct, indirect, consequential or incidental damages with respect to any claim by user or any third party on account of, or arising from the use of software and associated materials. This disclaimer covers both the GDE core editor and all external programs used by the GDE.