Red algae comprise an anciently diverged, species-rich phylum with morphologies that span unicells to large seaweeds. Here, leveraging a rich red algal genome and transcriptome dataset, we used 298 single-copy orthologous nuclear genes from 15 red algal species to erect a robust multi-gene phylogeny of Rhodophyta. This tree places red seaweeds (Bangiophyceae and Florideophyceae) at the base of the mesophilic red algae with the remaining non-seaweed mesophilic lineages forming a well-supported sister group. The early divergence of seaweeds contrasts with the evolution of multicellular land plants and brown algae that are nested among multiple, unicellular or filamentous sister lineages. Using this novel perspective on red algal evolution, we studied the evolution of the pathways for isoprenoid biosynthesis. This analysis revealed losses of the mevalonate pathway on at least three separate occasions in lineages that contain Cyanidioschyzon, Porphyridium, and Chondrus. Our results establish a framework for in-depth studies of the origin and evolution of genes and metabolic pathways in Rhodophyta.
The ancestor of Paulinella chromatophora established a symbiotic relationship with cyanobacteria related to the Prochloroccocus/Synechococcus clade. This event has been described as a second primary endosymbiosis leading to a plastid in the making. Based on the rate of pseudogene disintegration in the endosymbiotic bacteria Buchnera aphidicola, it was suggested that the chromatophore in P. chromatophora has a minimum age of ~60 Myr. Here we revisit this estimation by using a lognormal relaxed molecular clock on the 18S rRNA of P. chromatophora. Our time estimates show that depending on the assumptions made to calibrate the molecular clock, P. chromatophora diverged from heterotrophic Paulinella spp. ~ 90 to 140 Myr ago, thus establishing a maximum date for the origin of the chromatophore.
The katydid genus Neoconocephalus is characterized by high diversity of the acoustic communication system. Both male signals and female preferences have been thoroughly studied in the past. This study used Bayesian character state reconstruction to elucidate the evolutionary history of diverse call traits, based on an existing, well supported phylogenetic hypothesis. The most common male call pattern consisted of continuous calls comprising one fast pulse rate; this pattern is the likely ancestral state in this genus. Three lines of call divergence existed among the species of the genus. First, four species had significantly slower pulse rates. Second, five species had alternating pulse periods, resulting in a double pulse rhythm. Third, several species had discontinuous calls, when pulses were grouped into rhythmically repeated verses. Bayesian character state reconstruction revealed that the double-pulse pattern likely evolved convergently five times; the slow pulse rate also evolved four times independently. Discontinuous calls have evolved twice and occur in two clades; each of which contains reversals to the ancestral continuous calls. Pairwise phylogenetically independent contrast analyses among the three call traits found no significant correlations among the character states of the different traits, supporting the independent evolution of the three call traits.
Phylogenetic trees are used by researchers across multiple fields of study to display historical relationships between organisms or genes. Trees are used to examine the speciation process in evolutionary biology, to classify families of viruses in epidemiology, to demonstrate co-speciation in host and pathogen studies, and to explore genetic changes occurring during the disease process in cancer, among other applications. Due to their complexity and the amount of data they present in visual form, phylogenetic trees have generally been difficult to render for publication and challenging to directly interact with in digital form. To address these limitations, we developed PhyloPen, an experimental novel multi-touch and pen application that renders a phylogenetic tree and allows users to interactively navigate within the tree, examining nodes, branches, and auxiliary information, and annotate the tree for note-taking and collaboration. We present a discussion of the interactions implemented in PhyloPen and the results of a formative study that examines how the application was received after use by practicing biologists — faculty members and graduate students in the discipline. These results are to be later used for a fully supported implementation of the software where the community will be welcomed to participate in its development.
Recently developed molecular methods enable geneticists to target and sequence thousands of orthologous loci and infer evolutionary relationships across the tree of life. Large numbers of genetic markers benefit species tree inference but visual inspection of alignment quality, as traditionally conducted, is challenging with thousands of loci. Furthermore, due to the impracticality of repeated visual inspection with alternative filtering criteria, the potential consequences of using datasets with different degrees of missing data remain nominally explored in most empirical phylogenomic studies. In this short communication, I describe a flexible high-throughput pipeline designed to assess alignment quality and filter exonic sequence data for subsequent inference. The stringency criteria for alignment quality and missing data can be adapted based on the expected level of sequence divergence. Each alignment is automatically evaluated based on the stringency criteria specified, significantly reducing the number of alignments that require visual inspection. By developing a rapid method for alignment filtering and quality assessment, the consistency of phylogenetic estimation based on exonic sequence alignments can be further explored across distinct inference methods, while accounting for different degrees of missing data.
Incomplete lineage sorting (ILS), modelled by the multi-species coalescent, is a process that results in a gene tree being different from the species tree. Because ILS is expected to occur for at least some loci within genome-scale analyses, the evaluation of species tree estimation methods in the presence of ILS is of great interest. Performance on simulated and biological data have suggested that concatenation analyses can result in the wrong tree with high support under some conditions, and a recent theoretical result by Roch and Steel proved that concatenation using unpartitioned maximum likelihood analysis can be statistically inconsistent in the presence of ILS. In this study, we survey the major species tree estimation methods, including the newly proposed “statistical binning” methods, and discuss their theoretical properties. We also note that there are two interpretations of the term “statistical consistency”, and discuss the theoretical results proven under both interpretations.
Phylogeneticists have long understood that several biological processes can cause a gene tree to disagree with its species tree. In recent years, molecular phylogeneticists have increasingly foregone traditional supermatrix approaches in favor of species tree methods that account for one such source of error, incomplete lineage sorting (ILS). While gene tree-species tree discordance no doubt poses a significant challenge to phylogenetic inference with molecular data, researchers have only recently begun to systematically evaluate the relative accuracy of traditional and ILS-sensitive methods. Here, we report on simulations demonstrating that concatenation can perform as well or better than methods that attempt to account for sources of error introduced by ILS. Based on these and similar results from other researchers, we argue that concatenation remains a useful component of the phylogeneticist’s toolbox and highlight that phylogeneticists should continue to make explicit comparisons of results produced by contemporaneous and classical methods.
Correction In table 1 an incorrect value was provided for the “Contigs” value for “Filamoeba nolandi” (Column 3, row 2). The corrected table is provided below: Table 1: Transcriptome statistics for Amoebozoa used in case study. Taxon ATCC Contigs SSU Bacterial Eukaryotic Unknown Genes Filamoeba nolandi 50430 21671 17 2409 12205 7057 171 Pessonella sp. […]
Since the ever-increasing availability of phylogenetic informative data, the last decade has seen an upsurge of ecological studies incorporating information on evolutionary relationships among species. However, detailed species-level phylogenies are still lacking for many large groups and regions, which are necessary for comprehensive large-scale eco-phylogenetic analyses. Here, we provide a dataset of 100 dated phylogenetic trees for all European tetrapods based on a mixture of supermatrix and supertree approaches. Phylogenetic inference was performed separately for each of the main Tetrapoda groups of Europe except mammals (i.e. amphibians, birds, squamates and turtles) by means of maximum likelihood (ML) analyses of supermatrix applying a tree constraint at the family (amphibians and squamates) or order (birds and turtles) levels based on consensus knowledge. For each group, we inferred 100 ML trees to be able to provide a phylogenetic dataset that accounts for phylogenetic uncertainty, and assessed node support with bootstrap analyses. Each tree was dated using penalized-likelihood and fossil calibration. The trees obtained were well-supported by existing knowledge and previous phylogenetic studies. For mammals, we modified the most complete supertree dataset available on the literature to include a recent update of the Carnivora clade. As a final step, we merged the phylogenetic trees of all groups to obtain a set of 100 phylogenetic trees for all European Tetrapoda species for which data was available (91%). We provide this phylogenetic dataset (100 chronograms) for the purpose of comparative analyses, macro-ecological or community ecology studies aiming to incorporate phylogenetic information while accounting for phylogenetic uncertainty.