{"id":4950,"date":"2018-12-01T00:01:21","date_gmt":"2018-12-01T00:01:21","guid":{"rendered":"https:\/\/www.palaeontologyonline.com\/?p=4950"},"modified":"2018-11-29T23:05:33","modified_gmt":"2018-11-29T23:05:33","slug":"deducing-the-tree-of-life","status":"publish","type":"post","link":"https:\/\/www.palaeontologyonline.com\/?p=4950","title":{"rendered":"Patterns in Palaeontology \u2014 Deducing the tree of life"},"content":{"rendered":"<p>by <a href=\"https:\/\/34.32.27.218\/articles\/tag\/russell-j-garwood\/\">Russell Garwood<\/a>*<sup>1<\/sup><\/p>\n<h2>Introduction<\/h2>\n<p><i>\u201cIncreasing knowledge leads to triumphant loss of clarity\u201d \u2014 Palaeontologist Alfred Romer<\/i><\/p>\n<p>Some areas of life and human endeavour have the luxury of certainty. Along these paths of discovery, there are things we can know to be true or false. In others, it is impossible to assess the concept of truth: it can\u2019t be established, or just isn\u2019t a consideration. And between these extremes is a whole mess of important stuff. Palaeontology almost always lies somewhere on this gradation. Researchers studying past life are often juggling multiple layers of uncertainty. We try to balance the need to say something useful \u2014 something with meaning, that moves a field and its consensus closer to the truth \u2014 with the risk of over-interpreting our data. If the data is too incomplete, we could be moving closer or further away from the truth, and wouldn\u2019t be able to tell. As such, palaeontologists have to draw a line somewhere, and where might differ between people. In other words, palaeontology is very much a human endeavour. It is subject to paradigm shifts in our understanding brought about by new discoveries and methods, but is also influenced by the human nature of those who practise it as fashions and traditions change \u2014 normally in search of a better way of doing things. Often, these shifts are driven by arguments that explode onto the scene in which proponents of different ideas \u2014 held with passion and fervour \u2014 disagree about something harder to pin down and less concrete than a new fossil. This is a position in which palaeontologists who enjoy trying to work out the shape of the tree of life currently find ourselves. Two competing approaches to working out the relationships between different species \u2014 their <a href=\"https:\/\/34.32.27.218\/glossary\/p\/phylogenetics\/\">phylogeny<\/a> \u2014 are battling it out in the scientific literature. It\u2019s exciting, engaging and undeniably driven by a desire to improve understanding of the natural world in all its complexity. But it\u2019s also one of those situations in which working out what is closest to the truth can be challenging. Before I write about it any further, we need some context. This article provides both the history of, and current debates surrounding, how we deduce the shape of the tree of life.<\/p>\n<h2>From taxonomy to cladistics<\/h2>\n<p>Carl Linnaeus was born in 1707 in R\u00e5shult, Sweden. He was a physician and naturalist, working in both botany and zoology. He is now remembered for the system of classification that bears his name. This <a href=\"https:\/\/34.32.27.218\/glossary\/l\/linnean-taxonomic-system\/\">Linnaean taxonomy<\/a> gives species a binomial name (sometimes known as a Latin name although not always Latin in origin), and then places them into decreasingly specific levels. So, the beautiful regal jumping spider (figure 1) is <i>Phidippus regius<\/i>: <i>regius<\/i> is the species and <i>Phidippus<\/i> is the genus.<\/p>\n<p>This genus is itself in a family (Salticidae, the jumping spiders), which is in an order (Araneae, the spiders), in a class (<a href=\"https:\/\/34.32.27.218\/glossary\/a\/arachnids\/\">Arachnida<\/a>, which also includes scorpions, mites and a host of other creepy crawlies). That is in the phylum <a href=\"https:\/\/34.32.27.218\/glossary\/a\/arthropods\/\">Arthropoda<\/a> with hexapods (insects and their kin), myriapods (millipedes, centipedes and a couple of other groups), crustaceans (crabs, lobsters, woodlice \u2014 and many other collections of mostly marine creatures) and the extinct <a href=\"https:\/\/34.32.27.218\/articles\/2013\/fossil-focus-trilobites\/\">trilobites<\/a>. And this is all within the animals (kingdom Animalia).<\/p>\n<figure id=\"attachment_4954\" aria-describedby=\"caption-attachment-4954\" style=\"width: 740px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/34.32.27.218\/?attachment_id=4954\" rel=\"attachment wp-att-4954\"><img fetchpriority=\"high\" decoding=\"async\" class=\"size-large wp-image-4954\" src=\"http:\/\/34.32.27.218\/wp-content\/uploads\/2018\/12\/Figure_01-1024x708.jpg\" alt=\"Figure 1 - The jumping spider Phiddipus regius\" width=\"740\" height=\"512\" srcset=\"https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_01-1024x708.jpg 1024w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_01-300x207.jpg 300w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_01-768x531.jpg 768w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_01-100x70.jpg 100w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_01.jpg 1280w\" sizes=\"(max-width: 740px) 100vw, 740px\" \/><\/a><figcaption id=\"caption-attachment-4954\" class=\"wp-caption-text\">Figure 1\u00a0\u2014 The regal jumping spider <i>Phidippus regius<\/i>. Photography by Thomas Shahan (published under a CC BY 2.0 license).<\/figcaption><\/figure>\n<p>So far, so good? Well, not quite. This system has been used ever since Linnaeus\u2019s time to categorize all elements of the tree of life, so it is certainly useful. But actually, we can think of the tree of life as a nested series of groups derived from a common ancestor. The animals, for example, have a really deep split between the <a href=\"https:\/\/34.32.27.218\/glossary\/s\/sponges\/\">sponges<\/a> \u2014 which don\u2019t really have tissues or organs \u2014 and all other animals. We think (this is actually rather controversial, and another ongoing debate). The rest of the animals can then be broadly split into those without bilateral (two-way) symmetry, and those with it (which typically also have a mouth, anus and through-gut, for example). Those, in turn, can probably be split into two major groups. In fact, every time the tree splits, we get another two groups, which share a common ancestor (a <a href=\"https:\/\/34.32.27.218\/glossary\/c\/clade\/\">clade<\/a>, figure 2).<\/p>\n<figure id=\"attachment_4956\" aria-describedby=\"caption-attachment-4956\" style=\"width: 740px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/34.32.27.218\/?attachment_id=4956\" rel=\"attachment wp-att-4956\"><img decoding=\"async\" class=\"size-large wp-image-4956\" src=\"http:\/\/34.32.27.218\/wp-content\/uploads\/2018\/12\/Figure_02-1024x260.png\" alt=\"Figure 2 \u2014 An example of an evolutionary tree, or cladogram, comprising five species. Points at which splits occur are called nodes; nodes are linked to each other, or the species themselves (terminals), by branches. Coloured in green are two groups that form clades \u2014 they share a common ancestor at the node marked with a red dot. On the right are two groups that don\u2019t form a clade \u2014 they share a common ancestor, but don\u2019t comprise all descendants of that ancestor.\" width=\"740\" height=\"188\" srcset=\"https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_02-1024x260.png 1024w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_02-300x76.png 300w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_02-768x195.png 768w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_02.png 1600w\" sizes=\"(max-width: 740px) 100vw, 740px\" \/><\/a><figcaption id=\"caption-attachment-4956\" class=\"wp-caption-text\">Figure 2 \u2014 An example of an evolutionary tree, or cladogram, comprising five species. Points at which splits occur are called nodes; nodes are linked to each other, or the species themselves (terminals), by branches. Coloured in green are two groups that form clades \u2014 they share a common ancestor at the node marked with a red dot. On the right are two groups that don\u2019t form a clade \u2014 they share a common ancestor, but don\u2019t comprise all descendants of that ancestor.<\/figcaption><\/figure>\n<p>This is a good way of thinking about the tree of life, because it reflects the evolutionary history of all of these groups (which is often \u2014 but not always \u2014 what people are looking at when they study animals). It is also a really useful way to communicate relationships at a huge variety of levels, whether you\u2019re trying to understand the deepest splits in the animals, in the arachnids or in the spiders. It doesn\u2019t, however, map directly on to Linnaean taxonomy, because every clade \u2014 all the way back down to our jumping spider \u2014 could have its own name (figure 3), and you can\u2019t split that nested series of groups into a limited number of levels like that of Linnaeus\u2019s scheme.<\/p>\n<figure id=\"attachment_4957\" aria-describedby=\"caption-attachment-4957\" style=\"width: 740px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/34.32.27.218\/?attachment_id=4957\" rel=\"attachment wp-att-4957\"><img decoding=\"async\" class=\"wp-image-4957 size-large\" src=\"http:\/\/34.32.27.218\/wp-content\/uploads\/2018\/12\/Figure_03-1024x486.png\" alt=\"Figure 3 \u2014 An evolutionary tree (cladogram) showing a series of nested clades to which jumping spiders (far right) belong. These include the spiders (Araneae); a group of arachnids that includes spiders and whips spiders (the Tetrapulmonata); the arachnids; and the arthropods, which also includes crustaceans, millipedes and centipedes, and insects. The broader clade in which these all sit is the protostomes (which includes, for example, molluscs), and all of these clades are bilaterian animals. The clades that are also Linnaean ranks are shown in normal red type, whereas those that are unranked are shown in bold black type.\" width=\"740\" height=\"351\" srcset=\"https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_03-1024x486.png 1024w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_03-300x143.png 300w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_03-768x365.png 768w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_03-548x260.png 548w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_03.png 1600w\" sizes=\"(max-width: 740px) 100vw, 740px\" \/><\/a><figcaption id=\"caption-attachment-4957\" class=\"wp-caption-text\">Figure 3 \u2014 An evolutionary tree (<a href=\"https:\/\/34.32.27.218\/glossary\/e\/evolutionary-tree\/\">cladogram<\/a>) showing a series of nested clades to which jumping spiders (far right) belong. These include the spiders (Araneae); a group of arachnids that includes spiders and whips spiders (the Tetrapulmonata); the arachnids; and the arthropods, which also includes crustaceans, millipedes and centipedes, and insects. The broader clade in which these all sit is the protostomes (which includes, for example, molluscs), and all of these clades are bilaterian animals. The clades that are also Linnaean ranks are shown in normal red type, whereas those that are unranked are shown in bold black type.<\/figcaption><\/figure>\n<p>In response to this, a shift in approach kicked off in the 1960s. As one might expect, there were competing methods: for example, phenetics, which groups organisms by their anatomical similarity, versus schools of thought that focus on the evolutionary history of groups. Although the former approach can be helpful in answering some questions, it was the latter that caught on. German entomologist Willi Hennig was a key figure in this period and in the establishment of classifications that reflect evolutionary history \u2014 although the roots of this type of thought lie much deeper. Hennig, born in 1913, published and publicized a scheme that he called phylogenetic systematics. This classifies organisms on the basis of clades that are defined by shared features \u2014 such as the through-gut and symmetry of bilaterian animals. This is now commonly referred to as cladistics (although the meaning of this phrase has subtly shifted since it was first coined).<\/p>\n<h2>Adding computers to cladistics<\/h2>\n<p>Cladistics is an attractive approach for understanding the evolutionary history of a group of organisms, but it is also very challenging if the only tools for building your phylogenies are a pen and paper. People have been visualizing the history life in the form of a tree since before the publication of Charles Darwin\u2019s <i>On The Origin Of Species<\/i> in 1859, and have done so increasingly since; phylogenetic systematics is a logical extension of this. Traditionally, trees were constructed by studying the organisms to include, then drawing inferences from their anatomy. This is difficult for other researchers to reproduce, and tree shape can result from \u2014 or necessitate \u2014 a researcher placing particular importance on some elements of a group\u2019s anatomy over others.<\/p>\n<p>These issues have, to an extent, been overcome through the advent and application of powerful modern computers. Researchers generally establish phylogenies for fossils \u2014 and, up until the 1990s, commonly living species \u2014 by coding their anatomy. You study an animal (or member of any other group) and list what is known as a series of characters: for example, how many eyes they have (for our jumping spider, eight) or the number of legs. It is also possible to include measurements or ratios (called continuous characters). It\u2019s often good to think of characters as a way to test whether two features might be related. As an example, we might code both moths and spiders as capable of making silk \u2014 but the many other anatomical differences between these creatures should still keep them in separate groups. With all of this data in hand, we can then try to deduce a tree. We have generally done this since the 1970s using an approach called maximum parsimony, described in glorious and unflinching depth in <a href=\"https:\/\/34.32.27.218\/articles\/2012\/patterns-in-palaeontology-parsimony-and-palaeobiology\/\">this Palaeontology [online]<\/a> article. The basic aim is quite straightforward: to find the trees that require the smallest number of character changes between clades. The underlying principle of maximum parsimony is that the fewer assumptions required, the better (an approach sometimes called Occam\u2019s razor; figure 4).<\/p>\n<figure id=\"attachment_4959\" aria-describedby=\"caption-attachment-4959\" style=\"width: 740px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/34.32.27.218\/?attachment_id=4959\" rel=\"attachment wp-att-4959\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-4959 size-large\" src=\"http:\/\/34.32.27.218\/wp-content\/uploads\/2018\/12\/Figure_04-1024x316.png\" alt=\"Figure 4 \u2014 Two possible trees of five species, A\u2013E. These have characters represented as empty and filled shapes. Assuming that empty is the original condition for all, the left tree requires one fewer state change than that on the right, and as such is more parsimonious.\" width=\"740\" height=\"228\" srcset=\"https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_04-1024x316.png 1024w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_04-300x92.png 300w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_04-768x237.png 768w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_04.png 1600w\" sizes=\"(max-width: 740px) 100vw, 740px\" \/><\/a><figcaption id=\"caption-attachment-4959\" class=\"wp-caption-text\">Figure 4 \u2014 Two possible trees of five species, A\u2013E. These have characters represented as empty and filled shapes. Assuming that empty is the original condition for all, the left tree requires one fewer state change than that on the right, and as such is more parsimonious.<\/figcaption><\/figure>\n<p>How this works practically is a tiny bit more complex. The collection of all possible arrangements of trees for a set of species is sometimes referred to as tree space, and we have to search this. We start with a random tree, count the number of changes of characters that it necessitates, then change the tree shape in some way, count again, and repeat until we are confident that we have found the arrangements of groups (whether that be one, or several, trees) that require the least number of character changes. The reason we search like this, rather than trying every possible tree, is that tree space is vast; for twenty species, there are 2.22 \u00d7 1020 possible rearrangements. Once you hit 50 species, there are more possible shapes than there are atoms in the Universe. The scale of this task thus calls for computer-based methods.<\/p>\n<p>There are several approaches for searching tree space for those shapes with the smallest number of character changes, but we would hope that they all find the same trees. There can, of course, be multiple trees that imply the same (smallest) number of character changes. In these cases, we summarize them by creating a consensus tree: one which shows all the relationships they agree on, but collapses other relationships (figure 5). This approach of searching tree space and finding the most parsimonious trees has allowed researchers to deduce ever bigger trees (phylogenies) from larger data sets since the 1970s, as tools have developed.<\/p>\n<figure id=\"attachment_4960\" aria-describedby=\"caption-attachment-4960\" style=\"width: 300px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/34.32.27.218\/?attachment_id=4960\" rel=\"attachment wp-att-4960\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-4960 size-medium\" src=\"http:\/\/34.32.27.218\/wp-content\/uploads\/2018\/12\/Figure_05-300x258.png\" alt=\"Figure 5 \u2014 How we create a consensus tree to summarize data. If the two trees on the left were the most parsimonious trees for an analysis, the consensus (right) would collapse those relationships that differ between them, but keep those they have in common.\" width=\"300\" height=\"258\" srcset=\"https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_05-300x258.png 300w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_05-768x660.png 768w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_05-1024x881.png 1024w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_05.png 1600w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a><figcaption id=\"caption-attachment-4960\" class=\"wp-caption-text\">Figure 5 \u2014 How we create a consensus tree to summarize data. If the two trees on the left were the most parsimonious trees for an analysis, the consensus (right) would collapse those relationships that differ between them, but keep those they have in common.<\/figcaption><\/figure>\n<p>The advantages of cladistics over what came before are that tree searches and their results are reproducible, and that all the assumptions that have gone into building a tree are documented. This is good. But, as before, many researchers believe this approach has imperfections. It is clear from studying the natural world that evolution doesn\u2019t always follow the smallest number of character changes. To choose two examples: snakes evolved from limbed ancestors, rather than those without limbs; and animals have moved from the sea to land (and back again) repeatedly.<\/p>\n<h2>Molecules and models<\/h2>\n<p>Since the late 1980s, when looking at living organisms, we have been able to use <a href=\"https:\/\/34.32.27.218\/glossary\/d\/dna\/\">DNA<\/a> as well as anatomy to deduce their relationships. The principle is similar to that we\u2019ve already seen: DNA is just another form of data, albeit one comprised of sequences of four nucleotides (adenine, thymine, guanine and cytosine). The more closely related organisms are, the more similarities we find in their DNA (in the same way that closely related organisms tend to have similar anatomy). At this molecular level, when species evolve as distinct lineages, they do so through mutations in their DNA, and the longer it has been since two species split (that is, the more distantly related they are), the more mutations will have accrued. Maximum parsimony can struggle here, however. Just as marine reptiles and marine mammals have both evolved to have flippers, despite not being closely related, there are distinct patterns in the changes we see in DNA, which can make sequences start to look alike for distantly related species. This is, in part, because there are just four options at any point in a snippet of DNA, but also because even within one strand, different parts serve very different roles and some don\u2019t really affect the nature of the organism. All this means that using parsimony can start to cluster distantly related species (those on the ends of long branches of the phylogeny, along which lots of DNA mutations have occurred). When this happens, the more molecular data you set to a task, the stronger this incorrect species clustering pattern is. You might ask how we know it is incorrect, but there are normally other lines of evidence pointing towards this mistake when it occurs.<\/p>\n<p>Because of this, researchers creating molecular phylogenies \u2014 trees built using DNA \u2014 have started using model-based approaches. These have become more common in recent years, in part because computers have become powerful enough to implement them, but also because we can now create a realistic model for how DNA evolves. Model-based approaches come in a number of flavours, but today I want to introduce just one, which has started to be applied to morphology in the past decade. More of that soon. This approach is called Bayesian phylogenetics. It\u2019s named after Thomas Bayes, an English minister and statistician who worked on a theory of probability that eventually took his name \u2014 but was actually published by a colleague, on the basis of Bayes\u2019 notes, after his death. Bayes\u2019 theorem, in its simplest form, allows us to deduce the probability of something \u2014 say, an event, or the shape of an evolutionary tree \u2014 given prior knowledge of the conditions related to it. In the case of the tree, this knowledge could be the sequences of DNA from the species in a phylogeny, and a model of how their nucleobases change. This amounts to the probability over time of switches between any of the nucleobases \u2014 something that can be mapped (or modelled) on to a tree shape, and the probability of that tree can then be quantified. This is called the posterior probability. The next obvious question is, how do we actually use this to derive a tree? Well, that\u2019s achieved using an algorithm called a Markov chain Monte Carlo (MCMC), which samples the posterior-probability distribution of possible trees.<\/p>\n<p>Let\u2019s break this down and explain what that actually means. To do so, we have to consider changing the trees \u2014 the relationships and length of branches (the amount of change that has occurred along them). This gives us a space we can explore that might be best imagined as a rugged, mountainous landscape. The <i>x<\/i> and <i>y<\/i> coordinates of this space \u2014 the position on a map of the terrain \u2014 could be thought of as the tree shape, and the height of the landscape is the posterior probability of the trees at that point given the data and the model of evolution. An MCMC analysis explores this landscape: it starts from a random place, and repeatedly changes the tree. It then accepts a new tree if its posterior probability is higher, or a little lower, than the previous one (that is, the change from the last tree allows the analysis to climb up one of our imaginary mountains or stay about level). If the posterior probability of a tree is much lower than what came before (that is, it takes us downhill), then the MCMC discards that tree, and sticks with the previous one. This process is then repeated, hundreds of thousands to millions of time.<\/p>\n<p>Eventually, by doing this, the algorithm reaches an equilibrium: it is just wandering around the same area over and over again, and visiting the higher area more often (in fact, how often it visits each area is proportional to their posterior probability; figure 6). Thus, our route over this landscape represents the most probable trees, but also takes into account uncertainty in the data. If we take all of the trees we\u2019re wandering over, and create a summary of them, this is a good way of deriving the relationships between species given their data and our model of evolution, while incorporating uncertainty. It\u2019s a well-established approach when using DNA, and is widely used to work out the relationships between living species.<\/p>\n<figure id=\"attachment_4961\" aria-describedby=\"caption-attachment-4961\" style=\"width: 740px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/34.32.27.218\/?attachment_id=4961\" rel=\"attachment wp-att-4961\"><img loading=\"lazy\" decoding=\"async\" class=\"size-large wp-image-4961\" src=\"http:\/\/34.32.27.218\/wp-content\/uploads\/2018\/12\/Figure_06-1024x216.png\" alt=\"Figure 6 \u2014 The inner workings of an MCMC analysis. The two figures on the left show the posterior probabilities of a range of possible trees on the x axis, and the probabilities of these trees on the y axis. The far left shows any single iteration of the algorithm \u2014 the current tree might be the star coloured in green, and a change to the tree shape might improve the posterior probability (and move the tree to A). Any such change will be accepted. Another change might move the tree towards a lower posterior probability (B). Such changes would only occasionally be accepted. If you do this repeatedly, as shown in the middle figure, from a starting point marked by the blue star, then eventually the algorithm will sample the areas of highest probability the most, and we can summarize those trees. Because there are actually lots of dimensions here, we can think of the trees as two coordinates. The figure on the right shows this, with the peaks coming out of the page towards us. The exploration by the MCMC chain from the middle panel could be represented equally well as that on the right.\" width=\"740\" height=\"156\" srcset=\"https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_06-1024x216.png 1024w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_06-300x63.png 300w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_06-768x162.png 768w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_06.png 1600w\" sizes=\"(max-width: 740px) 100vw, 740px\" \/><\/a><figcaption id=\"caption-attachment-4961\" class=\"wp-caption-text\">Figure 6 \u2014 The inner workings of an MCMC analysis. The two figures on the left show the posterior probabilities of a range of possible trees on the x axis, and the probabilities of these trees on the y axis. The far left shows any single iteration of the algorithm \u2014 the current tree might be the star coloured in green, and a change to the tree shape might improve the posterior probability (and move the tree to A). Any such change will be accepted. Another change might move the tree towards a lower posterior probability (B). Such changes would only occasionally be accepted. If you do this repeatedly, as shown in the middle figure, from a starting point marked by the blue star, then eventually the algorithm will sample the areas of highest probability the most, and we can summarize those trees. Because there are actually lots of dimensions here, we can think of the trees as two coordinates. The figure on the right shows this, with the peaks coming out of the page towards us. The exploration by the MCMC chain from the middle panel could be represented equally well as that on the right.<\/figcaption><\/figure>\n<h2>Morphology and models<\/h2>\n<p>So why \u2014 I am sure you are wondering by this point \u2014 have I subjected you to all of this hardcore phylogenetics in an article for Palaeontology [online]? It\u2019s because in recent years, these methods have started to have an impact on palaeontology. We can\u2019t recover the DNA of fossils so, to use Bayesian approaches on extinct organisms, we need a model for the evolution of anatomy. This will let us work out the posterior probability of a tree given morphological character data (the same data that we might have gathered for a parsimony-based analysis). Now, this is pretty tough, and currently for morphological Bayesian phylogenetic analyses we use something called the Lewis or MK model, in which switches between any character states (in any direction) are equally likely. This is an assumption, but it does allow Bayesian MCMC approaches to be used for fossils. Fossils often lack data, and so using Bayesian analysis is quite attractive, because it incorporates uncertainty \u2014 we can think of it, perhaps, as a slightly more cautious way of deriving a tree. It has a few other benefits, as well. But then we get to another tricky problem: how do we actually assess which approach is better, parsimony or Bayesian? We don\u2019t have the true tree to check them against (otherwise we wouldn\u2019t need to do this), and DNA-based analyses have their own potential issues, so we can\u2019t necessarily treat those as correct.<\/p>\n<figure id=\"attachment_4962\" aria-describedby=\"caption-attachment-4962\" style=\"width: 740px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/34.32.27.218\/?attachment_id=4962\" rel=\"attachment wp-att-4962\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-4962 size-large\" src=\"http:\/\/34.32.27.218\/wp-content\/uploads\/2018\/12\/Figure_07-1024x486.png\" alt=\"Figure 7 \u2014 A figure from a recent simulation study by O'Reilly and colleagues. The y axis shows the distance from the true tree (higher is worse), and the x axis shows the number of nodes (towards the right is more, indicating higher precision). The colours show how many derived trees sit at any position, with red the most. All this shows that parsimony approaches tend to be higher on the y axis, and further right \u00a0on the x axis; that means that they are generally less correct but also more resolved. Figure modified from O'Reilly et al. (2016; original published under a CC BY 4.0 license).\" width=\"740\" height=\"351\" srcset=\"https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_07-1024x486.png 1024w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_07-300x142.png 300w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_07-768x364.png 768w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_07-548x260.png 548w, https:\/\/www.palaeontologyonline.com\/wp-content\/uploads\/2018\/12\/Figure_07.png 1600w\" sizes=\"(max-width: 740px) 100vw, 740px\" \/><\/a><figcaption id=\"caption-attachment-4962\" class=\"wp-caption-text\">Figure 7 \u2014 A figure from a recent simulation study by O&#8217;Reilly and colleagues. The <i>y<\/i> axis shows the distance from the true tree (higher is worse), and the <i>x<\/i> axis shows the number of nodes (towards the right is more, indicating higher precision). The colours show how many derived trees sit at any position, with red the most. All this shows that parsimony approaches tend to be higher on the <i>y<\/i> axis, and further right \u00a0on the<i> x<\/i> axis; that means that they are generally less correct but also more resolved. Figure modified from <a href=\"http:\/\/rsbl.royalsocietypublishing.org\/content\/12\/4\/20160081\">O&#8217;Reilly et al.<\/a> (2016; original published under a CC BY 4.0 license).<\/figcaption><\/figure>\n<p>The past few years have been exciting in the world of morphology and phylogenetics because a slew of papers have used simulations to ask this very question. Simulations allow us to take a tree and generate data that reflects its shape. In this situation, we have both the true tree and data that reflects it. If we use the data and parsimony, Bayesian and other approaches to try to reconstruct the tree, we can compare the derived tree with the truth \u2014 and ultimately work out which way of building trees is better. We hope. But the devil is always in the details, and it turns out that researchers have impassioned views about which approach should be used, so there has been a heated debate. A series of papers have used more and more complex models of molecular evolution (increasingly far removed from the Lewis model of morphological evolution) to generate data onto trees \u2014 see the papers led by Wright, O\u2019Reilly or Puttick in Further Reading. These articles have shown that Bayesian phylogenetics has an edge over parsimony (figure 7): the take-home message has been that parsimony-based methods are less accurate than Bayesian (they are more different from the true tree), but also that parsimony methods are more precise (they resolve more relationships in general \u2014 but of course, that\u2019s not particularly useful if those relationships are not correct!).<\/p>\n<p>This suggests that palaeontologists should build their trees using Bayesian inference. But enter parsimony proponents. Argentinian arachnologist and parsimony-software developer Pablo Goloboff and his colleagues have penned replies to the above papers, calling into question their narrative. One thrust of the argument is that the models used to generate data (and then deduce trees to compare methods) favour Bayesian over parsimony. Adding spice to the mix is the suggestion that the methods used to compare the similarity of derived and true trees are also very sensitive to particular types of difference, and that more methods should be used. This debate is continuing \u2014 the authors of the earlier papers have responded to the criticisms, and while I have been writing this very article a new paper from Pablo Goloboff has appeared highlighting the lack of realism of the Lewis model of morphological evolution. What does this all mean? Unfulfilling as it is, I don\u2019t think there is a conclusion in sight \u2014 yet. But I think this situation does allow us to make some interesting observations about how we do science.<\/p>\n<h2>Human nature and science<\/h2>\n<p>A really interesting factor in all of this is how very human everything is getting \u2014 not surprising given that researchers are humans, but an excellent illustration of how science may strive for objectivity, but other forces remain in play. In science, as in other human endeavours, cliques and fashions can develop and disappear, and everything is swayed by human nature. This is especially noticeable in an episode such as this, where differences are subtle and truth is hard to pin down. One outcome is that strong opinions form, and the defence of a preferred technique can become impassioned. As an example, the Goloboff <i>et al<\/i>. (2017) paper contains some remarkably strongly worded statements. One is:<\/p>\n<p><i>Although they generated their data sets with models specifically chosen to make Bayesian methods perform better \u00a0than parsimony, Wright and Hillis (2014), O\u2019Reilly et al. (2016) and Puttick et al. (2017) asserted, with typical grandiloquence, that Bayesian methods are superior to parsimony in general.<\/i><\/p>\n<p>The contents of this statement can be debated, yet you wouldn\u2019t guess that from the words.<\/p>\n<p>And all this is taking place in a community of researchers, which will affect what happens now. While arguments are raging in some circles, others are marked by inertia. Changing how people infer trees necessitates teaching them new techniques. It requires \u2018traditional\u2019 approaches to be abandoned, and researchers must reassess the body of knowledge that we have built on top of the relationships constructed using parsimony. It could be that Bayesian techniques will not resolve some relationships at all. If this is the case, is it better to have some hypothesis to test when new fossils are discovered, constructed using parsimony but potentially wrong, or is it best to conclude that we just don\u2019t have enough data yet? Add to this dilemma the fact that Bayesian versus parsimony doesn\u2019t have a cut and dried answer. You then have the question of when would or should we make the switch to mainly using Bayesian \u2014 how certain do we need to be that it is better? Will there be a parallel of the cladistics takeover for Bayesian? Or will this all fizzle out if we can\u2019t work out which approach is better?<\/p>\n<p>I don\u2019t have any answers to these questions, but it certainly makes trying to work out the relationships between extinct species an exciting and rapidly developing field right now. So, in lieu of answering these questions \u2014 because I can\u2019t \u2014 I will finish with some personal thoughts. At the moment, we don\u2019t have a clear-cut winner when it comes to reconstructing evolutionary relationships using anatomy. Until we do, perhaps a good approach would be to use both methods: if they agree, then we can probably have some confidence in the relationships they infer. If they don\u2019t, then we know that there may be a weak signal in the data we are using, and further work needs to be done. If we settle on one technique down the line, then future readers can place more weight on its results. But above and beyond this, the key uncertainty in both simulation studies and model-based approaches to building evolutionary trees is our lack of a clear model of how anatomy evolves in the real world. With better models for the evolution of morphology, we can both simulate better data to test inference techniques, and derive better trees from real-world data. I think this is where our efforts might be best placed. We know evolution isn\u2019t parsimonious: it doesn\u2019t follow the simplest path. So ultimately, with better models for morphological evolution, we should be able to build better trees using Bayesian than using parsimony. But we are not there yet \u2014 not even close. There is lots still to be done. That\u2019s not an awful place to be, because, damn, it\u2019s exciting.<\/p>\n<h2>Suggestions for further reading<\/h2>\n<p>Goloboff, P. A., Torres, A. &amp; Arias, J. S. Weighted parsimony outperforms other methods of phylogenetic inference under models appropriate for morphology. <i>Cladistics<\/i> <b>34,<\/b> 407\u2013437 (2018). DOI: 10.1111\/cla.12205<\/p>\n<p>Goloboff, P. A., Torres Galvis, A. &amp; Arias, J. S. Parsimony and model\u2010based phylogenetic methods for morphological data: comments on O&#8217;Reilly et al. <i>Palaeontology <\/i><b>61,<\/b> 625\u2013630 (2018). DOI: 10.1111\/pala.12353<\/p>\n<p>O&#8217;Reilly, J. E., Puttick, M. N., Parry, L., Tanner, A. R., Tarver, J. E., Fleming, J., Pisani, D. &amp; Donoghue, P. C. Bayesian methods outperform parsimony but at the expense of precision in the estimation of phylogeny from discrete morphological data. <i>Biology Letters<\/i>, <b>12,<\/b> 20160081 (2016). DOI: 10.1098\/rsbl.2016.0081<\/p>\n<p>O&#8217;Reilly, J. E., Puttick, M. N., Pisani, D. &amp; Donoghue, P. C. Empirical realism of simulated data is more important than the model used to generate it: a reply to Goloboff <i>et al<\/i>. <i>Palaeontology \u00a0<\/i><b>61,<\/b> 631\u2013635 (2018). DOI: 10.1111\/pala.12361<\/p>\n<p>Puttick, M. N., O&#8217;Reilly, J. E., Tanner, A. R., Fleming, J. F., Clark, J., Holloway, L., Lozano-Fernandez, J., Parry, L. A., Tarver, J. E., Pisani, D. &amp; Donoghue, P. C. Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data. <i>Proceedings of the Royal Society B<\/i> <b>284,<\/b> 20162290 (2017). DOI: 10.1098\/rspb.2016.2290<\/p>\n<p>Wright, A. M. and Hillis, D. M. Bayesian analysis using a simple likelihood model outperforms parsimony for estimation of phylogeny from discrete morphological data. <i>PLoS One<\/i> <b>9,<\/b> e109210 (2014). DOI: 10.1371\/journal.pone.0109210<\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p><sup>1<\/sup> School of Earth and Environmental Sciences, University of Manchester, Manchester M13 9PL, UK<\/p>\n","protected":false},"excerpt":{"rendered":"<p>by Russell Garwood*1 Introduction \u201cIncreasing knowledge leads to triumphant loss of clarity\u201d \u2014 Palaeontologist Alfred Romer Some areas of life and human endeavour have the luxury of certainty. Along these paths of discovery, there are things we can know to be true or false. In others, it is impossible to assess the concept of truth: [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":4964,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11],"tags":[10],"class_list":["post-4950","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-patterns-in-palaeontology","tag-russell-j-garwood"],"_links":{"self":[{"href":"https:\/\/www.palaeontologyonline.com\/index.php?rest_route=\/wp\/v2\/posts\/4950","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.palaeontologyonline.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.palaeontologyonline.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.palaeontologyonline.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.palaeontologyonline.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=4950"}],"version-history":[{"count":5,"href":"https:\/\/www.palaeontologyonline.com\/index.php?rest_route=\/wp\/v2\/posts\/4950\/revisions"}],"predecessor-version":[{"id":4963,"href":"https:\/\/www.palaeontologyonline.com\/index.php?rest_route=\/wp\/v2\/posts\/4950\/revisions\/4963"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.palaeontologyonline.com\/index.php?rest_route=\/wp\/v2\/media\/4964"}],"wp:attachment":[{"href":"https:\/\/www.palaeontologyonline.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=4950"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.palaeontologyonline.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=4950"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.palaeontologyonline.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=4950"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}