Otherware

home *** CD-ROM | disk | FTP | other *** search

/ Otherware / Otherware_1_SB_Development.iso / mac / misc / medical / dna_stac.cpt / dnastack.readme

Wrap

Text File | 1992-07-30 | 12KB | 198 lines

Features of DNA Translator and Aligner HyperCard 2.0 stacks Version 1.0k 6/14/92 Copyright 1990,1991,1992 D. J. Eernisse Email: usergdef@um.cc.umich.edu (internet) or usergdef@umichum (bitnet) Mailing address: Museum of Zoology, Univ. of Michigan, Ann Arbor, MI 48109 For further description see: Eernisse, D. J. 1992. DNA Translator and Aligner: HyperCard utilities to aid phylogenetic analysis of molecules. CABIOS 8: 177-184. The DNA Translator is a HyperCard 2.0 stack providing three complementary sets of utilities for viewing and manipulating molecular data on a Macintosh computer. First, DNA Translator displays codon and amino acid usage data as it differs for a wide variety of organisms and organelles. Dynamically-constructed graphs display percentage usage for a particular codon relative to the other 63 codons or to only other codons with equivalent amino acid coding. A user can select from a diverse list of some 70 taxon-organelle combinations. Second, a "gene mapping" facility draws and displays 2 linearized gene maps for comparison, automatically adjustable to desired scales and locations along all or part of the mapped molecule. Maps and select corresponding complete sequence data are provided for most available fully documented mitochondrial and chloroplast DNA (mtDNA and cpDNA) sequences, which include 9 animal, 1 yeast, and 1 ciliate mtDNAs and 3 green plant cpDNAs. Additional gene maps may be user-created by a direct file conversion of standard GenBank- or EMBL-format documented sequences and their features tables. Third, DNA Translator is a powerful workbench of specialized sequence manipulation tools, catering especially to those with interests in phylogenetic analysis. A user can extract the sequence of any particular mapped gene or region by clicking on the corresponding place on the gene map or by menu selection of the gene or feature and mapped taxon. DNA Translator can also import single or multiple sequences in a variety of formats, including multiple aligned sequence output of various programs (EuGene, Prophet, CLUSTAL, Nexus, PHYLIP, etc.), and then further manipulate, interleave, compare or translate the gene sequences to amino acids. Multiple aligned sequences can be converted to Nexus, Hennig86 or PHYLIP for subsequent phylogenetic analysis. Most known deviations from the "universal" code, which are typical for mtDNAs, may optionally be used during translation. The codon usage, translation and gene mapping data may be exported or imported in spreadsheet format for incorporating additional molecules of interest or in various standard formats (UWGCG, GenBank, EMBL,Intelligenetics). A built-in editor supports sequence entry with optional computer-speaking for error checking, and a variety of output conversion options. The current version is freely available via anonymous ftp for noncommercial use only. A. Import Formats Accepted 1. Multiple sequence alignments a. Simple text files of string sequences, with or without interleaves or match characters b. MBIR (EuGene, Prophet) "Doolittle" progressive alignments c. CLUSTAL nucleo- or peptide output. d. Nexus (PAUP, MacClade) files e. Phylip "Result" output files 2. String sequences a. String sequence text files of the form: Name AGCTACCT... b. Data input from built-in sequence entry editor c. Sequences extracted from the provided gene mapper d. String output generated by many commonly used programs 3. GenBank/EMBL or PIR-CODATA documented sequence files a. Converted to string sequences for use in conversions below b. GenBank or EMBL documented features to spreadsheet matrix c. Matrix directly convertable to rescalable gene map d. Gene map allows extraction of any mapped or custom subsequence B. Conversions provided 1. Multiple sequence alignments (A1) converted/exported in the following formats: a. Nexus (PAUP, MacClade) with optional cost matrices added b. Hennig86 (Nucleotide data only) c. Phylip 3.2 or 3.3 formats d. Multiple sequence strings (A2a) with gaps preserved or deleted 2. String sequences (A2a,B1d) converted/exported in the following formats: a. Unmodified to be reimported or converted as needed b. As straight single sequence strings for import by MBIR (EuGene, Prophet), Authorin, Gene Construction Kit, etc. c. Intelligenetics format for import by many programs d. GenBank, EMBL, FASTP, or PIR-CODATA formats for viewing or import by many programs e. Simple interleaved format with optional match characters for manual alignment or reimport (A1a) f. Direct export of corresponding sequences (e.g., all animal mtDNA cyt. B genes) from gene mapping facility to Aligner stack g. A subblock of aligned strings to MULFOLD (RNA folding) format h. Consensus sequence created by combining two or more sequences employing ambiguous nucleotide symbols (IUPAC-IUB) i. Subsequence, DNA <-> RNA, upper <-> lower case, ACGTU -> 01233, filter IUPAC-IUB codes, complementary strand sequence conversions j. Tally and display autapomorphies (site-unique nucleotides or gaps) for sets of aligned nucleotide strings k. Compute compositional statistics of each nucleotide string 3. DNA/RNA -> peptide translation > 1st, 1st & 2nd, or all 3 possible reading frames output > Formatted output displays codons below amino acid abbreviations with either 1- or 3-letter amino acid abbreviations > String output is useful for export or conversion (B2) > Can use standard or any of the provided codon usage tables > Custom codon usage tables can be added using a table editor > Codons with ambiguous nucleotide symbols (IUPAC-IUB) are translated as appropriate when translation is unambiguous > Optional termination when stop codons are encountered 4. Peptide -> DNA translation > Peptide strings can be backtranslated, using the reverse of whichever codon usage table the user selects > Backtranslation uses IUPAC-IUB conventions for ambiguous nucleotides D. Nucleotide or peptide sequence entry > Special Editor field has autoformatting capabilities > Adjustable computer speaking supported during or after entry > Sequences can be edited, manipulated, or exported (B2) E. Data provided with stack > Codon usage patterns for about 70 organism/organelle combinations > Most mitochondrial variation in coding supported (3) > Gene maps of 11 mtDNAs plus string sequences > Gene maps and matrices for 3 green plant chloroplast DNAs F. General Features > Online help facility is available from any card in stack > References, taxon names, and sample data provided > Pulldown or popup menus or dialogs used for commands > Buttons or pulldown menus are used for stack navigation > Data fields can be locked or unlocked for text entry > Data fields shrink down or expand by clicking > Contents of fields exportable as text or printable > Codon usage tables output for any incorporated taxon > Gene maps and codon usage graphs have special printing options > Default word processor for exported text files can be chosen > PAUP/MacClade specification for exported Nexus files > PAUP/MacClade open directly from the stack > Add gene mapping or utility cards as required > All data, scripts and resources used are easily accessible > Custom XFCNs by Nigel Perry enable efficient manipulations G. Hardware and System Requirements > Requires any Macintosh that can run HyperCard 2.0 > HyperCard 2.x (2.0 or 2.1) (Widely available for Macintosh users) > Macintosh System 6.0.5 or greater (for HyperCard 2.0) > 2 or more MB Ram recommended H. Availability of current version > Anonymous ftp to um.cc.umich.edu (35.1.1.43) After connection type 'cd gdef' and 'get dnastack.hqx' Resulting file needs to be "debinhexed" This will most likely be more current than the following due to more frequent updates. > Anonymous ftp to at least the following site Ftp.Bio.Indiana.Edu (after connection: 'cd molbio/mac' and 'get DNASTACK.HQX') > EMBL file server by sending the command: GET MAC_SOFTWARE:DNATRANSLATOR.HQX to NETSERV@EMBL.BITNET Entire package is approximately 550K in binhexed form > As of 9/24/91 includes Aligner stack described below Aligner is a freeware HyperCard 2.x stack for manual alignment of sequences by D. J. Eernisse, author of DNA Translator stack. Aligner was inspired by a stack called MultiDNA by Ralph Gonzalez, but was created completely from scratch. Enhancements compared to MultiDNA include: 1) Number of sequences that can be aligned increased from 6 to 100; 2) interleave formatting and number lines automatically adjust for the number of sequences imported; 3) now accomodates sequences up to 30,000 bp or amino acid residues long; 4) imports or exports multiple sequences that are simple return-delineated strings or in 'DNA Translator' stack string format (i.e., 'Name<space>AGCTGA...<rtn>'); 5) optionally interleaves exported sequences; 6) enlarged for use on a 640 x 480 standard color or larger monitor (automatically adjusts for standard small Mac screens); 7) much faster switching between sequences to be edited; 8) can toggle between match characters (dashes) to the first sequence and no match characters; 9) speaks nucleotide/peptide characters during keyboard entry or after entry starting from the current insertion point; 10) various sequence manipulations, such as lower <-> UPPER case; 11) can add/delete gaps to/from all but the currently selected sequence; 12) align amino acids, then introduce gaps to the corresponding unaligned nucleotide strings to preserve the amino acid alignment; 13) custom menus; 14) several options to color text in the alignment including a color coding of nucleotide triplets according to their amino acid coding, with multiple alternate genetic codes supported; 15) error checking routines for comparing the nongap characters against your starting file; 16) import multiple sequences for a particular gene directly from DNA Translator stack gene mapping facility (e.g., all avaialable animal mtDNA sequences encoding cyt B gene); 17) optional help facility displays the function of fields or buttons as the cursor enters them. This stack is distributed separately or as part of the more extensive DNA Translator stack package. The best way to get the most current version of either stack is via anonymous ftp to "um.cc.umich.edu) then "cd gdef" and "get aligner.hqx" or "get dnastack.hqx" for Aligner stack only or the entire package, respectively. Then debinhex the resulting files. Aligner by itself is under 300 K uncompressed, while DNA Translator and associated files occupy about 1 MB on your hard disk (500K in compressed Binhex format). This stack is only free for any noncommercial use and is not public domain. It is copyrighted 1991,1992 by D. J. Eernisse, and uses some XFCN resources copyrighted by Nigel Perry with similar copyright restrictions (see stack script for details). For more information, contact: Douglas J. Eernisse Museum of Zoology, Univ. of Michigan, Ann Arbor, MI 48109 USA usergdef@um.cc.umich.edu or usergdef@umichum For further information, tips, trouble-shooting, and version history, see the online help facility.