BEAST, Bayesian evolutionary analysis sampling trees
Alexei Drummond, Bioinformatics, Department of Computer Science
BEAST is a cross-platform program for Bayesian MCMC analysis of molecular sequences. It is entirely oriented towards rooted, time- measured phylogenies inferred using strict or relaxed molecular clock models. It can be used as a method of reconstructing phylogenies but is also a framework for testing evolutionary hypotheses without conditioning on a single tree topology. BEAST uses MCMC to explore tree space in such a way that each tree is weighted proportional to its posterior probability. We include a simple to use user-interface program for setting up standard analyses and a suit of programs for analysing the results.
BEAST 2 (www.beast2.org) develops new features to allow resuming an MCMC chain, and real time tracking of ESSs while running a chain.
It is easily extendible, for example, the BEAST 2 packages supports multi-chain MCMC, some experimental likelihood calculations that are potentially faster than the base implementation, and a spread sheet GUI for manipulating models.
BEAST 2 has a plug-in facility. BEAST 2 packages provide more models, such as
- Birth Death Serial Skyline Model
- Reversible jump substitution model
- SNAPP Species trees from SNP and AFLP analysis
- Substitution Bayesian Model Averaging
- Sampled Ancestors
Output of a BEAST analysis of viral samples taken over different years — samples from the same year tend to cluster together. Blue line represents a summary tree of the posterior sample. Green lines indicate uncertainty in the tree distribution. It illustrates how both topology and node height estimates vary in the posterior sample.
They can be easily installed/un-installed from BEAUti (BEAST’s GUI). More packages are being developed (http://www.beast2.org/beagle-beast-2-in-cluster) .
BEAST developers use the NeSI Pan cluster for running simulation studies to validate new models, and run large analyses with questions of considerable scientific interest. For example, BEAST was used for dating the split of Cichlid species between Africa and South America, determining the origin of Indo-European languages (see http://language.cs.auckland.ac.nz), and analysing epidemiological histories of diseases such as HIV and influenza.
See more case study projects
Our Voices: using innovative techniques to collect, analyse and amplify the lived experiences of young people in Aotearoa
Painting the brain: multiplexed tissue labelling of human brain tissue to facilitate discoveries in neuroanatomy
Detecting anomalous matches in professional sports: a novel approach using advanced anomaly detection techniques
Benefits of linking routine medical records to the GUiNZ longitudinal birth cohort: Childhood injury predictors
Using a virtual machine-based machine learning algorithm to obtain comprehensive behavioural information in an in vivo Alzheimer’s disease model
Mapping livability: the “15-minute city” concept for car-dependent districts in Auckland, New Zealand
Travelling Heads – Measuring Reproducibility and Repeatability of Magnetic Resonance Imaging in Dementia
Novel Subject-Specific Method of Visualising Group Differences from Multiple DTI Metrics without Averaging
Re-assess urban spaces under COVID-19 impact: sensing Auckland social ‘hotspots’ with mobile location data
Aotearoa New Zealand’s changing coastline – Resilience to Nature’s Challenges (National Science Challenge)
Proteins under a computational microscope: designing in-silico strategies to understand and develop molecular functionalities in Life Sciences and Engineering
Coastal image classification and nalysis based on convolutional neural betworks and pattern recognition
Determinants of translation efficiency in the evolutionarily-divergent protist Trichomonas vaginalis
Measuring impact of entrepreneurship activities on students’ mindset, capabilities and entrepreneurial intentions
Using Zebra Finch data and deep learning classification to identify individual bird calls from audio recordings
Automated measurement of intracranial cerebrospinal fluid volume and outcome after endovascular thrombectomy for ischemic stroke
Using simple models to explore complex dynamics: A case study of macomona liliana (wedge-shell) and nutrient variations
Fully coupled thermo-hydro-mechanical modelling of permeability enhancement by the finite element method
Modelling dual reflux pressure swing adsorption (DR-PSA) units for gas separation in natural gas processing
Molecular phylogenetics uses genetic data to reconstruct the evolutionary history of individuals, populations or species
Wandering around the molecular landscape: embracing virtual reality as a research showcasing outreach and teaching tool