home *** CD-ROM | disk | FTP | other *** search
- Features of DNA Translator and Aligner HyperCard 2.0 stacks
- Version 1.0k 6/14/92 Copyright 1990,1991,1992 D. J. Eernisse
- Email: usergdef@um.cc.umich.edu (internet) or usergdef@umichum (bitnet)
- Mailing address: Museum of Zoology, Univ. of Michigan, Ann Arbor, MI 48109
- For further description see:
-
- Eernisse, D. J. 1992. DNA Translator and Aligner: HyperCard utilities
- to aid phylogenetic analysis of molecules. CABIOS 8: 177-184.
-
- The DNA Translator is a HyperCard 2.0 stack providing three complementary
- sets of utilities for viewing and manipulating molecular data on a
- Macintosh computer. First, DNA Translator displays codon and amino acid
- usage data as it differs for a wide variety of organisms and organelles.
- Dynamically-constructed graphs display percentage usage for a particular
- codon relative to the other 63 codons or to only other codons with
- equivalent amino acid coding. A user can select from a diverse list of some
- 70 taxon-organelle combinations. Second, a "gene mapping" facility draws
- and displays 2 linearized gene maps for comparison, automatically
- adjustable to desired scales and locations along all or part of the mapped
- molecule. Maps and select corresponding complete sequence data are provided
- for most available fully documented mitochondrial and chloroplast DNA
- (mtDNA and cpDNA) sequences, which include 9 animal, 1 yeast, and 1 ciliate
- mtDNAs and 3 green plant cpDNAs. Additional gene maps may be user-created
- by a direct file conversion of standard GenBank- or EMBL-format documented
- sequences and their features tables. Third, DNA Translator is a powerful
- workbench of specialized sequence manipulation tools, catering especially
- to those with interests in phylogenetic analysis. A user can extract the
- sequence of any particular mapped gene or region by clicking on
- the corresponding place on the gene map or by menu selection of the gene or
- feature and mapped taxon. DNA Translator can also import single or
- multiple sequences in a variety of formats, including multiple aligned
- sequence output of various programs (EuGene, Prophet, CLUSTAL, Nexus,
- PHYLIP, etc.), and then further manipulate, interleave, compare or
- translate the gene sequences to amino acids. Multiple aligned sequences can
- be converted to Nexus, Hennig86 or PHYLIP for subsequent phylogenetic
- analysis. Most known deviations from the "universal" code, which are
- typical for mtDNAs, may optionally be used during translation.
- The codon usage, translation and gene mapping data may be exported or
- imported in spreadsheet format for incorporating additional molecules of
- interest or in various standard formats (UWGCG, GenBank,
- EMBL,Intelligenetics). A built-in editor supports sequence entry with
- optional computer-speaking for error checking, and a variety of output
- conversion options. The current version is freely available via anonymous
- ftp for noncommercial use only.
-
- A. Import Formats Accepted
- 1. Multiple sequence alignments
- a. Simple text files of string sequences, with or without
- interleaves or match characters
- b. MBIR (EuGene, Prophet) "Doolittle" progressive alignments
- c. CLUSTAL nucleo- or peptide output.
- d. Nexus (PAUP, MacClade) files
- e. Phylip "Result" output files
- 2. String sequences
- a. String sequence text files of the form: Name AGCTACCT...
- b. Data input from built-in sequence entry editor
- c. Sequences extracted from the provided gene mapper
- d. String output generated by many commonly used programs
- 3. GenBank/EMBL or PIR-CODATA documented sequence files
- a. Converted to string sequences for use in conversions below
- b. GenBank or EMBL documented features to spreadsheet matrix
- c. Matrix directly convertable to rescalable gene map
- d. Gene map allows extraction of any mapped or custom subsequence
- B. Conversions provided
- 1. Multiple sequence alignments (A1) converted/exported
- in the following formats:
- a. Nexus (PAUP, MacClade) with optional cost matrices added
- b. Hennig86 (Nucleotide data only)
- c. Phylip 3.2 or 3.3 formats
- d. Multiple sequence strings (A2a) with gaps preserved or deleted
- 2. String sequences (A2a,B1d) converted/exported in the
- following formats:
- a. Unmodified to be reimported or converted as needed
- b. As straight single sequence strings for import by MBIR
- (EuGene, Prophet), Authorin, Gene Construction Kit, etc.
- c. Intelligenetics format for import by many programs
- d. GenBank, EMBL, FASTP, or PIR-CODATA formats for viewing or
- import by many programs
- e. Simple interleaved format with optional match characters
- for manual alignment or reimport (A1a)
- f. Direct export of corresponding sequences (e.g., all animal
- mtDNA cyt. B genes) from gene mapping facility to Aligner stack
- g. A subblock of aligned strings to MULFOLD (RNA folding) format
- h. Consensus sequence created by combining two or more sequences
- employing ambiguous nucleotide symbols (IUPAC-IUB)
- i. Subsequence, DNA <-> RNA, upper <-> lower case, ACGTU -> 01233,
- filter IUPAC-IUB codes, complementary strand sequence conversions
- j. Tally and display autapomorphies (site-unique nucleotides or gaps)
- for sets of aligned nucleotide strings
- k. Compute compositional statistics of each nucleotide string
- 3. DNA/RNA -> peptide translation
- > 1st, 1st & 2nd, or all 3 possible reading frames output
- > Formatted output displays codons below amino acid abbreviations
- with either 1- or 3-letter amino acid abbreviations
- > String output is useful for export or conversion (B2)
- > Can use standard or any of the provided codon usage tables
- > Custom codon usage tables can be added using a table editor
- > Codons with ambiguous nucleotide symbols (IUPAC-IUB) are
- translated as appropriate when translation is unambiguous
- > Optional termination when stop codons are encountered
- 4. Peptide -> DNA translation
- > Peptide strings can be backtranslated, using the reverse of
- whichever codon usage table the user selects
- > Backtranslation uses IUPAC-IUB conventions for ambiguous
- nucleotides
- D. Nucleotide or peptide sequence entry
- > Special Editor field has autoformatting capabilities
- > Adjustable computer speaking supported during or after entry
- > Sequences can be edited, manipulated, or exported (B2)
- E. Data provided with stack
- > Codon usage patterns for about 70 organism/organelle combinations
- > Most mitochondrial variation in coding supported (3)
- > Gene maps of 11 mtDNAs plus string sequences
- > Gene maps and matrices for 3 green plant chloroplast DNAs
- F. General Features
- > Online help facility is available from any card in stack
- > References, taxon names, and sample data provided
- > Pulldown or popup menus or dialogs used for commands
- > Buttons or pulldown menus are used for stack navigation
- > Data fields can be locked or unlocked for text entry
- > Data fields shrink down or expand by clicking
- > Contents of fields exportable as text or printable
- > Codon usage tables output for any incorporated taxon
- > Gene maps and codon usage graphs have special printing options
- > Default word processor for exported text files can be chosen
- > PAUP/MacClade specification for exported Nexus files
- > PAUP/MacClade open directly from the stack
- > Add gene mapping or utility cards as required
- > All data, scripts and resources used are easily accessible
- > Custom XFCNs by Nigel Perry enable efficient manipulations
- G. Hardware and System Requirements
- > Requires any Macintosh that can run HyperCard 2.0
- > HyperCard 2.x (2.0 or 2.1) (Widely available for Macintosh users)
- > Macintosh System 6.0.5 or greater (for HyperCard 2.0)
- > 2 or more MB Ram recommended
- H. Availability of current version
- > Anonymous ftp to um.cc.umich.edu (35.1.1.43)
- After connection type 'cd gdef' and 'get dnastack.hqx'
- Resulting file needs to be "debinhexed"
- This will most likely be more current than the following
- due to more frequent updates.
- > Anonymous ftp to at least the following site
- Ftp.Bio.Indiana.Edu (after connection: 'cd molbio/mac'
- and 'get DNASTACK.HQX')
- > EMBL file server by sending the command:
- GET MAC_SOFTWARE:DNATRANSLATOR.HQX to NETSERV@EMBL.BITNET
- Entire package is approximately 550K in binhexed form
- > As of 9/24/91 includes Aligner stack described below
-
- Aligner is a freeware HyperCard 2.x stack for manual alignment of
- sequences by D. J. Eernisse, author of DNA Translator stack. Aligner was
- inspired by a stack called MultiDNA by Ralph Gonzalez, but was created
- completely from scratch. Enhancements compared to MultiDNA include: 1)
- Number of sequences that can be aligned increased from 6 to 100; 2)
- interleave formatting and number lines automatically adjust for the
- number of sequences imported; 3) now accomodates sequences up to 30,000
- bp or amino acid residues long; 4) imports or exports multiple sequences
- that are simple return-delineated strings or in 'DNA Translator' stack
- string format (i.e., 'Name<space>AGCTGA...<rtn>'); 5) optionally
- interleaves exported sequences; 6) enlarged for use on a 640 x 480
- standard color or larger monitor (automatically adjusts for standard
- small Mac screens); 7) much faster switching between sequences to be
- edited; 8) can toggle between match characters (dashes) to the first
- sequence and no match characters; 9) speaks nucleotide/peptide
- characters during keyboard entry or after entry starting from the
- current insertion point; 10) various sequence manipulations, such as
- lower <-> UPPER case; 11) can add/delete gaps to/from all but the
- currently selected sequence; 12) align amino acids, then introduce gaps
- to the corresponding unaligned nucleotide strings to preserve the
- amino acid alignment; 13) custom menus; 14) several options to color
- text in the alignment including a color coding of nucleotide triplets
- according to their amino acid coding, with multiple alternate genetic
- codes supported; 15) error checking routines for comparing the
- nongap characters against your starting file; 16) import multiple
- sequences for a particular gene directly from DNA Translator stack
- gene mapping facility (e.g., all avaialable animal mtDNA sequences
- encoding cyt B gene); 17) optional help facility displays the function
- of fields or buttons as the cursor enters them.
-
- This stack is distributed separately or as part of the more extensive
- DNA Translator stack package. The best way to get the most current
- version of either stack is via anonymous ftp to "um.cc.umich.edu) then
- "cd gdef" and "get aligner.hqx" or "get dnastack.hqx" for Aligner stack
- only or the entire package, respectively. Then debinhex the resulting files.
- Aligner by itself is under 300 K uncompressed, while DNA Translator and
- associated files occupy about 1 MB on your hard disk (500K in compressed
- Binhex format).
-
- This stack is only free for any noncommercial use and is not public
- domain. It is copyrighted 1991,1992 by D. J. Eernisse, and uses some XFCN
- resources copyrighted by Nigel Perry with similar copyright restrictions
- (see stack script for details). For more information, contact:
-
- Douglas J. Eernisse Museum of Zoology, Univ. of Michigan, Ann
- Arbor, MI 48109 USA usergdef@um.cc.umich.edu or usergdef@umichum
-
- For further information, tips, trouble-shooting, and version history,
- see the online help facility.