HYPHYMPI(1)

Hypothesis testing using Phylogenies (pthreads version)

Section 1 hyphy-mpi bookworm source

Description

HYPHY

NAME

hyphy - Hypothesis testing using Phylogenies (pthreads version)

DESCRIPTION

usage: hyphy or HYPHYMPI [-h] [--help][-c] [-d] [-i] [-p] [BASEPATH=directory path] [CPU=integer] [LIBPATH=library path] [USEPATH=library path] [<standard analysis name> or <path to hyphy batch file>] [--keyword value ...] [positional arguments ...]

Execute a HyPhy analysis, either interactively, or in batch mode optional flags:
-h --help

show this help message and exit

-c

calculator mode; causes HyPhy to drop into an expression evaluation until ’exit’ is typed

-d

debug mode; causes HyPhy to drop into an expression evaluation mode upon script error

-i

interactive mode; causes HyPhy to always prompt the user for analysis options, even when defaults are available

-p

postprocessor mode; drops HyPhy into an interactive mode where general post-processing scripts can be selected upon analysis completion

-m

write diagnostic messages to messages.log

optional global arguments:

BASEPATH=directory path

defines the base directory for all path operations (default is pwd)

CPU=integer

if compiled with OpenMP multithreading support, requests this many threads; HyPhy could use fewer than this but never more; default is the number of CPU cores (as computed by OpenMP) on the system

LIBPATH=directory path

defines the directory where HyPhy library files are located (default installed location is /usr/local/lib/hyphy or as configured during CMake installation

USEPATH=directory path

specifies the optional working and relative path directory (default is BASEPATH)

ENV=expression

set HBL environment variables via explicit statements for example ENV=’DEBUG_MESSAGES=1;WRITE_LOGS=1’

batch file to run

if specified, execute this file, otherwise drop into an interactive mode

analysis arguments

if batch file is present, all remaining positional arguments are interpreted as inputs to analysis prompts

optional keyword arguments (can appear anywhere); will be consumed by the requested analysis
--keyword
value

will be passed to the analysis (which uses KeywordArgument directives) multiple values for the same keywords are treated as an array of values for multiple selectors

usage examples:

Select a standard analysis from the list :

hyphy -i

Run a standard analysis with default options and one required user argument;

hyphy busted --alignment path/to/file

Run a standard analysis with additional keyword arguments

hyphy busted --alignment path/to/file --srv No

See which arguments are understood by a standard analysis

hyphy busted --help

Run a custom analysis and pass it some arguments

hyphy path/to/hyphy.script argument1 ’argument 2’

Available standard keyword analyses (located in /home/nilesh/packages/hyphy/hyphy/res/)

meme

[MEME] Test for episodic site-level selection using MEME (Mixed Effects Model of Evolution).

mh

Merge two datafiles by combining sites (horizontal merge).

mv

Merge two datafiles by combining sequences (vertical merge).

mcc

Compare mean within-clade branch length or pairwise divergence between two or more non-nested cladesd in a tree

mclk

Test for the presence of a global molecular clock on the tree using its root (the resulting clock tree is unrooted, but one of the root branches can be divided in such a way as to enforce the clock).

mgvsgy

Compare the fits of MG94 and GY94 models (crossed with an arbitrary nucleotide bias) on codon data.

mt

Select an evolutionary model for nucleotide data, using the methods of ’ModelTest’ - a program by David Posada and Keith Crandall.

fel

[FEL] Test for pervasive site-level selection using FEL (Fixed Effects Likelihood).

fubar

[FUBAR] Test for pervasive site-level selection using FUBAR (Fast Unconstrained Bayesian AppRoximation for inferring selection).

fade

[FADE] Test a protein alignment for directional selection towards specific amino acids along a specified set of test branches using FADE (a FUBAR Approach to Directional Evolution).

faa

Fit a multiple fitness class model to amino acid data.

fmm

"Fit a model that permits double (and triple) instantaneous nucleotide substitutions"

fst

Compute various measures of F_ST and (optionally) perform permutation tests.

slac

[SLAC] Test for pervasive site-level selection using SLAC (Single Likelihood Ancestor Counting).

sm

Peform a classic and structured Slatkin-Maddison test for the number migrations.

sns

Parse a codon alignment for ambiguous codons and output a complete list/resolutions/syn and ns counts by sequence/position

sw

Perform a sliding window analysis of sequence data.

sa

Perform a phylogeny reconstuction for nucleotide, protein or codon data with user-selectable models using the method of sequential addition.

sbl

Search an alignment for a single breakpoint.

spl

Plot genetic distances (similarity) of one sequence against all others in an alignment, using a sliding window. Optionally, determine NJ-based clustering and bootstrap support in every window. This is a HyPhy adaptation of the excellent (but Windows only tool) SimPlot (and/or VarPlot) written by Stuart Ray (http://sray.med.som.jhmi.edu/SCRoftware/simplot/)

busted

[BUSTED] Test for episodic gene-wide selection using BUSTED (Branch-site Unrestricted Statistical Test of Episodic Diversification).

bgm

[BGM] Apply Bayesian Graphical Model inference to substitution histories at individual sites.

bva

Run a selection analysis using a general discrete bivariate (dN AND dS) distribution; the appropriate number of rate classes is determined automatically.

brp

Interpret bivariate codon rate analysis results.

bsel

Split a tree into two clades (compartments) and a separating branch and test for equality of dN/dS between compartments and for selection along the separating branch using a series of Likelihood Ratio Tests.

bst

Use the improved branch-site REL method of Yang and Nielsen (2005) to look for episodic selection in sequences.

bt

Test whether a branch (or branches) in the tree evolves under different dN and dS than the rest of the tree.

absrel

[aBSREL] Test for lineage-specific evolution using the branch-site method aBS-REL (Adaptive Branch-Site Random Effects Likelihood).

acd

Analyse codon data with a variery of standard models using given tree.

ad

Analyse nucleotide or aminoacid data with a variery of standard models using given tree.

adn

Analyse di-nucleotide data with a variery of standard models using given tree.

afd

Analyse nucleotide data with a variery of standard models using given tree, estimating equilibrium frequencies as parameters

ana

Run a selection analysis.

ai

Peter Simmonds’ Association Index (AI).

relax

[RELAX] Test for relaxation of selection pressure along a specified set of test branches using RELAX (a random effects test of selection relaxation).

red

Replace sufficiently close sequence with their MRCA

rpc

Interpret analysis results.

rmv

Remove sequences with stop codons from the data.

rble

Use a series of random effects branch-site models to perform robust model-averaged branch length estimation under a codon model with episodic selection.

rclk

Test for the presence of a global molecular clock on the tree. The tree is rooted at every possible branch.

rr

Use relative rate test on three species and a variety of standard models

rrt

Use relative ratio test on 2 datasets and a variety of standard models

prime

[PRIME]

protein

Compare the fit of several amino-acid substitution models to an alignment using AIC and c-AIC.

prr

Using the model and the outgroup provided by the user, perform relative rate tests with all possible pair of species from the data file.

prrti

Given a list of files (and optionally genetic code tables), perform relative ratio tests on all possible pair of the data files.

pdf

Read sequence data, select a contiguous subset of sites and save it to another datafile.

phb

Run an example file from our book chapter in ’The Phylogentic Handbook’ (2nd edition).

psm

Test for positive selection using the approach of Nielsen and Yang, by sampling global dN/dS from an array of distributions, and using Bayesian posterior to identify the sites with dN/dS>1. Multiple subsets of one data set with shared dN/dS.

parris

A PARtitioning approach for Robust Inference of Selection (written by K. Scheffler)

contrast-fel

[Contrast-FEL] "Perform a site-level test for differences in selective pressures between predetermined sets of branches."

conv

Translate an in-frame codon alignment to proteins.

corr

Assess the correlation between phylogenetic and compartment segregation using generalized correlation coefficients and permutation tests.

cod

Compare all 203 reversible nucleotide models composed with MG94 to extend them to codon data, and perform LRT and AIC model selection.

cmp

Use a series of LR tests to decide if dN and dS rate distributions are the same or different between two codon alignments.

caln

Align coding sequences to reference (assuming star topology) using a codon-based dynamic programming algorithm (good for fixing multiple frameshifts). Designed with within-patient HIV sequences in mind.

clg

Remove ’gappy’ sites from alignments based on a user-specified gap threshold.

cln

Convert sequence names to HyPhy valid identifiers if needed and replace stop codons with gaps in codon data if any are present.

clsr

Partition sequences into clusters based on a distance matrix.

clst

Apply clustering methods for phylogeny reconstruction (UPGMA,WPGMA,complete or minimal linkage) to nucleotide, protein and codon data, using MLE of pairwise distances with user-selectable models. These methods produce trees with global molecular clock.

leisr

Infer relative evolutionary rates on a nucleotide or protein alignment, in a spirit similar to Rate4Site (PMID: 12169533).

lz

Compute Lempel-Ziv complexity and entropy of (possibly unaligned) sequences

lclk

Test for the presence of a local molecular clock. Every subtree of the given tree is subjected to the clock constraint, while the remainder of the tree is free of the clock constraint.

lht

A Likelihood Ratio Test to detect conflicting phylogenetic signal Huelsenbeck and Bull, 1996. [Contributed by Olivier Fedrigo].

tc

Test whether a group of sequences in a sample cluster together

ts

Perform an exhaustive tree space search for nucleotide or protein data with user-selectable models. Should only be used for data sets with less than 10 taxa!

dtr

Read sequence data (#,PHYLIP,NEXUS) and convert to a different format

dist

Generate a pairwise sequence distance matrix in PHYLIP format.

kh

Perform a Kishino-Hasegawa test on two competing phylogenies

ub

Obtain an upper bound on the likelihood score of a dataset.

nuc

Compare all 203 reversible nucleotide models and perform LRT and AIC model selection.

nj

Perform a phylogeny reconstuction for nucleotide, protein or codon data with user-selectable models using the method of neighbor joining.

ny

Test for positive selection using the approach of Nielsen and Yabg, by sampling global dN/dS from an array of distributions, and using Bayesian posterior to identify the sites with dN/dS>1.

gard

[GARD] Screen an alignment using GARD (requires an MPI environment).

AUTHOR

This manpage was written by Nilesh Patra for the Debian distribution and
can be used for any other usage of the program.