Table Of ContentJournalofExperimentalBotany,Vol.57,No.13,pp.3471–3503,2006
MajorThemesinFloweringResearchSpecialIssue
doi:10.1093/jxb/erl128
Morphological and molecular phylogenetic context of the
angiosperms: contrasting the ‘top-down’ and ‘bottom-up’
approaches used to infer the likely characteristics of the
first flowers
D
Richard M. Bateman1,*, Jason Hilton2 and Paula J. Rudall1 ow
n
1 Jodrell Laboratory, Royal Botanic Gardens Kew, Richmond, Surrey TW9 3AB, UK loa
d
e
2 School of Geography, Earth and Environmental Sciences, University of Birmingham, Edgbaston, d
Birmingham B15 2TT, UK fro
m
h
Received4May2006;Accepted13July2006 ttps
://a
c
a
d
e
m
Abstract dictatingtheperceivedevolutionarypolarityofcharac- ic.o
u
tertransitionsamongtheearly-divergentangiosperms. p
Recentattemptstoaddressthelong-debated‘origin’of .c
These factors still permit a remarkable range of con- om
theangiospermsdependonaphylogeneticframework trasting, yet credible, hypotheses regarding the order /jx
derived from a matrix of taxa versus characters; most b
of acquisition of the many phenotypic characters, /a
assumethatempiricalrigourisproportionaltothesize reproductive and vegetative, that distinguish ‘classic’ rtic
of the matrix. Sequence-based genotypic approaches angiospermy from ‘classic’ gymnospermy. The flower le-a
iirnnecdsreterlisac)steeidnthsthepeencmutrmuamtbreixrofboufetxctaharnaertacscoptenercfisinee(snd,uwctolheeotrhteideaeshsimgahonlrdy- rreemmaaiinnss hillo-dtleyfidniesdpuatnedd;itssommeoddeefi(onritimonosdeasn)dohfyoproigthin- bstract/5
eses of evolutionary relationships preclude a role for 7
phology-based approaches increase the number of the flower in delimiting the angiosperms. We advocate /13/3
phylogenetically informative taxa (including fossils) at maintenance of parallel, reciprocally illuminating pro- 47
theexpenseofaccessingonlyarestrictedspectrumof grammes of morphological and molecular phylogeny 1/4
7
phenotypic characters. The two approaches are cur- reconstruction, respectively supported by homology 75
rently delivering strongly contrasting hypotheses of testing through additional taxa (especially fossils) 39
b
relationship. Most molecular studies indicate that all and evolutionary-developmental genetic studies that y g
extantgymnospermsformanaturalgroup,suggesting explore genes potentially responsible for major phe- ue
s
surprisingly early divergence of the lineage that led to notypic transitions. t o
n
angiosperms, whereas morphology-only phylogenies 2
9
indicate that a succession of (mostly extinct) gymno- Key words: Angiosperm, character optimization, congruence, M
sperms preceded a later angiosperm origin. Causes of development, evolutionary-developmental genetics, flower, arc
h
this conflict include: (i) the vast phenotypic and fossil, gene duplication, gymnosperm, morphology, ontogeny, 2
0
genotypic lacuna, largely reflecting pre-Cenozoic ex- outgroup,phylogeny,pteridophyte,taxonsampling,treerooting. 19
tinctions, that separates early-divergent living angio-
sperms from their closest relatives among the living
gymnosperms; (ii) profound uncertainty regarding Introduction
which (a) extant and (b) extinct angiosperms are
most closely related to gymnosperms; and (iii) pro- ‘An early hope was that the relationships indicated
found uncertainty regarding which (a) extant and (b) among species by DNA were more likely to be correct
extinct gymnosperms are most closely related to thanthosebasedonmorphology;thisnowseemsna¨ıve.’
angiosperms, and thus best serve as ‘outgroups’ (Judd et al., 1999, p. 99)
*Towhomcorrespondenceshouldbeaddressed.E-mail:[email protected]
ªTheAuthor[2006].PublishedbyOxfordUniversityPress[onbehalfoftheSocietyforExperimentalBiology].Allrightsreserved.
ForPermissions,pleasee-mail:[email protected]
3472 Batemanetal.
‘The primary motivating force for preparing this book Table 1. Representative spectrum of definitions of a flower
was the dramatic change in our understanding of
‘...agrowthcomprisingthereproductiveorgansofseedplants’
angiosperm phylogeny during the past 10 years. Many
(ChambersDictionary,1998,p.617)
long-standing[morphological]viewsofdeep-levelrelation- ‘...anaxisonwhichtherestofthefloralorgansareborne’(Fahn,
shipswerealteredsuddenlyandsubstantivelyasadirect 1974,p.425)
‘...[a]specializedreproductiveshoot,consistingofanaxis
result of molecular analyses.’ (Soltis et al., 2005, p. ix)
(receptacle)onwhichareinsertedfourdifferentsortsoforgans
[sepals,petals,stamens,carpels]’(PenguinDictionaryofBiology,
‘Occam’s razor? But that’s for circumcision, surely?’
1971,p.102)
(Tom Sharpe, 1995; Grantchester Grind, London: ‘...anaxis(orreceptacle)bearing,initscompleteform,fourzonesof
Macmillan, p. 82) appendagesthatareconsideredthehomologsofleaves’(Bierhorst,
1971,p.511)
In this review we address one of the most popular ‘...ahighlymodifiedshootbearingspecializedappendages(modified
leaves)’(Juddetal.,1999,p.53)
discussiontopicsinevolutionarybiology:theoriginofthe
‘...anamphisporangiatestrobilusofdeterminategrowthandwithan
D
flower, and, by implication, the origin of the flowering involucrumofmodifiedbracts’(ArberandParkin,1907) o
w
plants (note that we pay much less attention to the better ‘...thereproductivestructureofAnthophyta,derivedevolutionarily n
fromaleafyshootinwhichleaveshavebecomemodifiedintopetals, lo
documented subsequent, family-level radiation of the sepalsandintothecarpelsandstamensinwhichthegametesare ad
e
angiosperms; cf. Friis et al., 2006). We do not attempt to formed’(DictionaryofBiologicalTerms,1995;Harlow:Longman, d
offer definitive answers on the fundamental nature of the p.206) fro
‘...ashootbesetwithsporophylls:...leavesbearingsporangia’ m
flowerbut,instead,aimtoestablishamorerigorouscontext h
for future research. Our main objective is to review and, ‘..(.Gaoesebcetli,o1n9o0f5,aps.h4o6o9t,)orbranchresemblingashortshoot,which ttps
where possible, clarify several of the key issues in this bearsleaforganswhichserveforsexualreproductionandwhichare ://a
broad field. ‘..t.raanspfaorrtmicuedlaractycopredoinfgdlye’te(rWmienbaetrelisnpgo,r1o9g8en9o,ups.2sh)oot...oneofits cade
We willingly follow the modern convention of placing mostdefinitiveorgansisthecarpel;...amegasporophyll[thatis] m
our discussion in the context of an explicit phylogenetic morphologicallydistinctivebecausetheovules(i.e.megasporangia) ic.o
areusuallyenclosedwithinahollowbasalportiondesignatedasthe u
framework, but within this framework we have chosen to ovary’(FosterandGifford,1971,pp.593–594) p.c
o
exercise certain prejudices. Unlike most recent contri- ‘...acompressed,bisexualreproductiveaxiscomposedofstamens, m
butions to this debate, we do not focus on a single phylo- carpelsandasterileperianth’(BaumandHileman,2006) /jx
genetic hypothesis, preferring instead to consider the ‘...abisexualstructurewithmoreorlessrecognizablecarpels;that b/a
implications of a variety of matrix-based phylogenies i(sa,dlaaxmiailn)asrusrtfrauccet,uarensdlawrigtehlymeonrecloosrinlegssthreecoovgunliezsabolnetshtaemireunpsp’er rticle
derivedfromhighlycontrastingkindsofdata.Inassessing (Frohlich,2006) -a
b
‘...adeterminateaxisterminatinginmegasporangiathatare s
the relative merits of these hypotheses, we emphasize surroundedbymicrosporangiaandarecollectivelysubtendedbyat tra
conceptual rigour over statistical robustness. We are leastonesterilelaminarorgan’(presentstudy) ct/5
especially concerned with identifying the optimal roles 7/1
fordifferentcategoriesofdatathatpertaineitherdirectlyor 3/3
4
indirectly to phylogeny reconstruction. We therefore dis- racted from this aggregate of definitions: form, function, 7
1
cuss both molecular and morphological data sources, homology, and taxonomy. /4
7
7
including fossils, are discussed. This issue is explored in Themajorityofdefinitionsidentify(sexual)reproduction 5
3
the context of two contrasting perspectives on land-plant as the primary function of the flower; an uncontentious 9 b
phylogeny: the top-down perspective looks backward statement, but one that does not in itself separate a flower y g
u
through evolutionary time from the present, whereas the from the reproductive structure(s) of any other land plant. e
s
bottom-upperspectivelooksforward throughevolutionary Two definitions include explicit references to taxonom- t o
n
time from the deep past. ically (and phylogenetically) delimited groups of organ- 2
9
isms: the clade of groups that bear flower-like structures M
a
(Anthophytes) and, more controversially, the more inc- rc
h
lusive clade of groups that bear seeds (Spermatophytes). 2
What is a flower? 0
The definition from Chambers Dictionary encompasses 19
Giventhevasttractsofpublishedtextdevotedtotheorigin gymnosperms as well as angiosperms. It therefore begs
of the flower (and of the flowering plants), this is one immediate rejection by all knowledgeable botanists, who
question that could reasonably be assumed to have been routinely equate the structure ‘flower’ with the (mono-
unequivocally answered. Indeed, most glossaries included phyletic) group ‘angiosperms’, and who consequently
in textbooks and reviews omit the term ‘flower’, pres- attempt to define gymnosperms in part by their absence
umably on the assumption that the term is universally of flowers.
understood and the underlying concept is familiar to even With regard to form, several definitions describe the
the most lackadaisical student of botany. However, comp- flower as a composite structure, referring explicitly to the
arisonof the definitions assembledin Table 1reveals little presence of four differentiable categories of (usually
unanimity. Four ostensibly distinct elements can be ext- physically discrete) organs that are organized in a specific
Phylogeneticcontextoftheflower 3473
and reliable linear sequence along the axis towards its various degrees of completeness, cannot adequately be
apex:sepals,petals,stamens,andcarpels.Afewdefinitions definedinwords’.This,andmanyothersimilarstatements
emphasized the function of stamens and carpels for in the literature, refer primarily to the considerable floral
generating gametes (male and female, respectively), and variationrevealedbysurveysofextantangiosperms.Some
Foster and Gifford (1971) and Frohlich (2006) further species lack sepals, petals, or both. For example, in one
identifiedthecarpelasbeingthemostdiagnosticfeatureof clade(SaururaceaeplusPiperaceae)ofthemagnoliidorder
a flower. Foster and Gifford also noted that definitions of Piperales,flowersareentirelyperianth-less,afeaturethatis
afloweromittingreferencetothesestructures,suchasthat closely correlated (at least, in this group) with possession
of Goebel (1905), implicitly encompass not only all of indeterminate inflorescences (Remizowa et al., 2005).
angiosperms but also all gymnosperms and even some of Manyotherspeciesshowmorphologicalgradationbetween
the more derived, reproductively complex, pteridophyte sepals, petals, and/or stamens. Yet others contravene the
groups (thereby rendering less bizarre the aforementioned requirement for bisexuality by bearing functional stamens
D
Chambers Dictionary definition: Table 1). and carpels on separate flowers, either on the same o
w
Crucial to the majority of definitions is an almost individual (monoecy) or on separate individuals (dioecy); n
lo
universally accepted interpretation of the flower as a det- organs of the opposite gender have either been rendered ad
e
erminate reproductive shoot with a defined number of sterile or completely suppressed. For example, the much- d
floral organs, typically (but not universally) arranged in at discussed ‘primitive’ extant angiosperm Amborella (Fig. fro
m
least four distinct whorls: sepals, petals, stamens, and 1A, B) is commonly (though not reliably) dioecious, h
carpels. Specifically, the flower is homologized with an bearing either exclusively male flowers (lacking even ttps
axis bearing sporophylls (i.e. evolutionarily modified sterile carpels) or female flowers that occasionally bear ://a
c
sporangium-bearing leaves or, perhaps more conserva- staminodes, sometimes with a developmental transition ad
e
tively, leaf-like structures). Implicit in these definitions is between stamens and carpels (Buzgo et al., 2004). Many m
ic
the dynamic concept of a transition from an earlier, other early-divergent extant angiosperms even lack carpel .o
u
profoundly different (and thus recognizable) ancestral closurebytissuefusion(EndressandIgersheim,2000a,b; p.c
o
form of reproductive organ. The identity of the ancestral Endress, 2001a) or double fertilization (Williams and m
group is almost universally accepted, namely the gymno- Friedman,2002).Thecolumellarinfratectuminthemicro- /jx
b
sperms; an origin among the (by definition) seed-less sporewallofangiosperms,functionallylinkedtochemical /a
pteridophytes requires a set of morphological transitions recognition systems on the stigmatic surface, is also un- rticle
that seemingly is too radical for any modern botanist to reliably present in basally divergent extant angiosperms -a
b
s
seriously contemplate. (Sampson, 2000; Doyle, 2006; Frohlich, 2006). tra
Thus, we can draw on several categories of information Thus,severalsupposedlydefinitivefeaturesoftheflower ct/5
in order to define a flower. And, having defined a flower, are frequently absent from angiosperms, especially early- 7/1
surely we have by default defined a flowering plant—an divergent angiosperms: these include hermaphroditism, 3/3
4
angiosperm.Whythenhavethemoreinformedamongthe fully closed carpels, and a distinctly whorledarrangement. 7
1
land-plantmorphologistsappendedfirmcautionarynotesto Admittedly, the presence on the ovule of a second (outer) /4
7
7
their preferred definitions? For example, according to integument, and the remarkably conservative structure of 5
3
Foster and Gifford (1971, p. 593), ‘As angiosperms are the stamen (two thecae joined by a connective, each con- 9 b
commonly designated the flowering plants, it might be sistingoftwoembeddedmicrosporangia),aremoreconsis- y g
u
assumed that there is rather general agreement about the tent features of the angiosperm flower, though they too e
s
scientificconceptofaflower.Unfortunately,thisisnotthe have potential homologues in some gymnosperm groups t o
n
case, and the literature on floral organography, ontogeny, (Doyle, 2006). 2
9
and structure displays widely divergent viewpoints of the Furthermore, the flower-subtending bract, normally M
a
fundamental nature of the flower as well as on the regarded as extra-floral, can strongly influence, or even rc
h
interpretation of its component organs (sepals, petals, be considered part of, the flower (e.g. Remizowa et al., 2
0
stamens, and carpels). One of the basic difficulties lies in 2005; Buzgo et al., 2006). Bringing a palaeobotanical 19
ourcomplete[sic]ignoranceoftheevolutionaryhistoryof perspective to bear, Stewart and Rothwell (1993, p. 440)
the flower ...; it becomes largely a matter of conjecture coined the phrase ‘accessory reproductive structures’ to
whether it is justifiable to draw comparisons between encompass sterile organs associated with reproductive
modern angiospermous flowers and the spore-producing functions—not only perianth members (tepals, or petals
structures of other tracheophytes [vascular land-plants]. If plussepals)butalsoanother(lesswell-researched)leaf-like
suchcomparisonsareattempted,itisquitepossibletoreach organ,theflower-subtendingbract.Bracts(andbracteoles)
eitheraverybroadoraveryrestrictedconceptordefinition share with perianthmembersthe characteristics of exhibit-
of a flower.’ ingmanyleaf-likefeaturesbutbeingpositionallyfixedwith
Bierhorst (1971, p. 511) preceded his definition by respect to either a flower or an inflorescence. In some
stating that a flower is ‘a structure that, because of its angiosperms, the bract has become intimately integrated
3474 Batemanetal.
D
o
w
n
lo
a
d
e
d
fro
m
h
ttp
s
://a
c
a
d
e
m
ic
.o
u
p
.c
o
m
/jx
b
/a
rtic
le
-a
b
s
tra
c
t/5
7
/1
3
/3
4
7
Fig. 1. (A)Femaleand(B)male flowersoftheputative basallydivergentextant angiosperm Amborellatrichopoda.(C)Male (above)andfemale 1
/4
(below)reproductivestructuresofthemuch-discussedJurassicfossilgymnospermCaytonia(Caytoniales)and(D)hermaphroditereproductivestructure 7
7
of the Jurassic fossil gymnosperm Williamsoniella (Bennettitales); bothare candidate sister-groups to theangiosperms. (C and Dreproduced, with 5
3
permission,fromfigs1.8and1.11,respectively,ofSoltisetal.,2005,andreprintedfromCrane,1985.)Scalebar=100lm. 9
b
y
g
u
e
s
into the flower; for example, Amborella (Fig. 1A, B) and ‘anthostrobilus’ for the modern angiosperm flower, and t o
n
several other early-divergent angiosperms (e.g. Austro- ‘pro-anthostrobilus’ for the type of cone manifested not 2
9
baileya, Trimenia) exhibit a morphological continuum onlybytheMesozoicbennettiteWilliamsoniella(Fig.1D) M
a
between bracts and perianth (Endress, 2001b; Buzgo but also by a hypothetical group of extinct stem-group rc
h
et al., 2004). In others (e.g. Araceae, Cornaceae, Saurur- angiosperms that they termed ‘Hemiangiospermae’. They 2
0
aceae) the inflorescence bracts are conspicuously petaloid noted that the bennettite cone possessed a series of sterile 19
and hence perform at least one of the functions (pollinator leaf-likeorgansthattheyinterpretedasan‘undifferentiated
attraction)thataremoretypicallyperformedbypetals.For primitive perianth’. They also insightfully defined a major
example, in two genera (Houttuynia and Anemopsis) of subsetof(derived)angiospermsaspossessing‘aeuanthos-
the perianth-less family Saururaceae, the inflorescence trobilus, of which the distinctive features are the presence
bracts form a pseudocorolla, and the entire inflorescence ofthespecialtypeofmicrosporophylltermedastamen,and
has a flower-like appearance (Tucker, 1981). of closed carpels’ (Arber and Parkin, 1907, p. 75). Thus,
Moresignificantly,manyofthegymnospermousgroups, if flowers are at least partly defined not by possession of
bothextantandextinct,whicharebydefinitionconsidered sepals and/or petals but by possession of the broader
to lack a differentiated perianth, unequivocally possess category of leaf-like accessory reproductive structures, the
bract-likeorgans.ArberandParkin(1907)coinedtheterms flowerisnolongerseenasaunique(anddefining)attribute
Phylogeneticcontextoftheflower 3475
of the angiosperms, and when using the term ‘flowering (carpels) (Arber and Parkin, 1907; Arber, 1937). By
plants’ certain gymnospermous groups, especially extinct contrast, the more diverse pseudanthial theories all per-
taxa such as Bennettitales, should strictly be included (cf. ceive the flower as having condensed from a multiaxial
Crane, 1988). This conclusion is implicit in our own structure (Wettstein, 1907; Melville, 1960; Eames, 1961;
(inevitablyimperfect)definitionofaflower:adeterminate Meeuse, 1975, 1987, Stuessy, 2004). Thus, the difference
axis bearing megasporangia that are surrounded by betweenthetwoconflictinghypothesesrelatesmoretothe
microsporangia and are collectively subtended by at least nature of individual organs than to the flower as a whole.
one sterile laminar organ. In formulating this definition, Several authors have postulated secondary derivation of
the orientation of our discussion has, in practice, switched flower-like structures from inflorescences (i.e. a secondary
from top-down to bottom-up. We note that, although we pseudanthial origin in certain phylogenetic groups), based
do not necessarily exclude multiple origins of flower-like on ontogenetic evidence. These studies relate to both
structures(seebelow),ourdefinitionhasconvergedonthat gymnosperms (Gnetales: Mundry and Stu¨tzel, 2004) and
D
of Arber and Parkin (1907, p. 75), who characterized the angiosperms (e.g. alismatid monocots: many authors, o
w
anthostrobilusas‘aspecialformofamphisporangiatecone, reviewed by Sokoloff et al., 2006). Multiple origins of n
lo
distinguishedbythepeculiarjuxtapositionofthemega-and flower-like structures, both within angiosperms and other ad
e
microsporophylls, and by possessing a well-marked peri- seed plants, are implicit in these pseudanthial hypotheses. d
anth’. We also note that Arber and Parkin primarily (and Studies of flowers prompted by the various insights fro
m
almost uniquely) employed a bottom-up approach in what outlinedabove were, to varying degrees, comparative,and h
becameabenchmarkstudyinfloralevolutionthatremained in some cases hypothesis-testing. However, they often (i) ttps
influential throughout the ensuing century. focused on limited suites of morphological characters ://a
c
consideredaprioritobeofparticularimportance,(ii)failed ad
e
to determine whether the characters of interest were ever m
ic
specifiedbyasinglegenome(i.e.wereobservedinasingle .o
Which are the (other) benchmark studies in u
floral evolution? individual), and/or (iii) unjustifiably equated extreme div- p.co
ersityofformwithincreasedlikelihoodofmultipleorigins m
Certain conceptual thresholds can readily be identified in of the feature in question. More importantly, they lacked /jx
b
thestudyofflowersandfloweringplants.Inhisbenchmark aunifyingconceptualframework.Thiscrucialprerequisite /a
classification of flowering plants, Linnaeus (Linne´, 1735) has been provided, first by phylogeny reconstruction and rticle
emphasizedthesignificanceofthenumberandarrangement then by evolutionary-developmental genetics, epitomized -a
b
s
ofstamensandcarpels.Thepossibleequivalenceofsepals, by the ABC model of floral whorl control (e.g. Coen and tra
petals, and stamen filaments (though not explicitly the Meyerowitz, 1991; Irish and Kramer, 1998; Kramer and ct/5
carpels) to modified leaves was greatly elaborated by Irish, 1999; Lawton-Rauh et al., 2000; Cronk et al., 2002; 7/1
Goethe (1790) in an essentialist essay partly stimulated Theissenetal.,2002). 3/3
4
by studying plant teratologies. However, Goethe’s ideas 7
1
about simple equivalence, summarized by the famous /4
7
7
phrase ‘all is leaf’, had no phylogenetic implications 5
Which are the benchmark studies in 3
(Lo¨nnig, 1994). As noted by adherents of Zimmerman’s reconstructing seed-plant phylogeny? 9 b
telome theory (Zimmerman, 1930, 1938), such as Wilson y g
u
(1937), spore-bearing organs (i.e. sporangia) evolved The now ubiquitous tree motif routinely used to repre- e
s
before leaves in early land plants. By contrast, Darwin’s sent the supposed sequence of divergence of evolutionary t o
n
(1859) developmentof a credibleevolutionary mechanism lineages was famously employed by Darwin (1859) and 2
9
to explain (albeit in a uniformly gradualistic manner) such elaborated by authors such as Haeckel (1894). However, M
a
radical morphological transitions was followed by the onlywiththeearlycladisticworksofentomologistsHennig rc
h
harnessing of Mendelian genetics to address the control (1966) and Brundin (1972) were we provided with the 2
0
ofsuchtransitionsbyauthorssuchasDeVries(1906)and rigorous conceptual framework needed to generate such 19
later Wardlaw (1965). trees with a large degree of objectivity from matrices of
These new data sources informed some increasingly coded taxa, each explicitly scored for competing states of
sharp exchanges between proponents of the two main sets a broad suite of characters. Central to this approach is the
of theories competing to explain the origin of the flower. conceptofthecongruencetestofhomology(e.g.Patterson,
The morecommon euanthial theory that waspioneered by 1988). The preferred phylogeny for the scored taxa is the
Goethe(andisimplicitinmostofthedefinitionsofaflower one requiring fewest transitions between character-states.
collated in Table 1) postulates derivation from an un- Wecanthenobserveintheresultingtree(s)whichcharacter-
branched,uniaxialstructure,andhenceinterpretstheflower states delimit which taxonomic groups—in other words,
as a condensed sporophyll-bearing single axis with prox- whichpriorstatementsofhomologybetween species have
imalmicrosporophylls(stamens)anddistalmegasporophylls been upheld in the most-parsimonious tree(s). The
3476 Batemanetal.
particulate nature of the numerically scored character stickfor comparative studies that focus simultaneously on
states is preserved in the resulting trees, thereby simpli- changes in key developmental genes and the phenotypic
fying various kinds of post hoc character analysis. charactersthatthey‘control’—agrowingdisciplinetermed
The cladistic approach was first used to compare the evolutionary-developmental genetics (e.g. Cronk et al.,
majorgroupsofseed-plantsbyHillandCrane(1982).With 2002). This approach is especially attractive, as it repre-
theassistanceofrapidlyimprovingcomputerhardwareand sents an alternative, and potentially more informative, test
software, it spawned an increasing number of morpholog- of a priori homology statements that also explicitly links
ical phylogenies based on parsimony analyses during the genotype with the resulting phenotype, and thence ulti-
1980s and early 1990s. If we consider the development matelywithspecificbiologicalfunction(s).Thus,themajor
of taxonomically broad morphological matrices for seed- phenotypic transitions first explored morphologically by
plants, four main lineages can be recognized, nucleating Goethe (1790), functionally by Darwin (e.g. 1859), and
around PR Crane (e.g. Hill and Crane, 1982; Crane, 1985, genetically by De Vries (1906) can at last begin to be
D
1988), JA Doyle (e.g. Doyle and Donoghue, 1986, 1987, examined for fundamental causation (e.g. Frohlich and o
w
1992; Doyle, 1996, 1998a, b, 2006; Hilton and Bateman, Parker, 2000; Bateman and DiMichele, 2002; Vergara- n
lo
2006), GW Rothwell (e.g. Rothwell and Serbet, 1994; Silva, 2003; Baum and Hileman, 2006; Hintz et al., 2006; ad
e
Rothwell and Nixon, 2006), and DW Stevenson (e.g. Theissen, 2006). d
Loconte and Stevenson, 1990, 1991; Nixon et al., 1994). Despite their conflicting topologies (Fig. 2), these fro
m
Thesestudiesplacedasclosestrelativesofangiosperms,in different approaches share some common elements with h
various paraphyletic combinations, several groups (most respecttoseed-plantrelationships.Angiospermmonophyly ttps
of them wholly extinct) that possess reproductive organs is routinely inferred. Nonetheless, a few brave authors ://a
c
showing some flower-like properties, notably Gnetales, generally viewed as mavericks, notably Meeuse (1976, ad
e
Bennettitales (Fig. 1D), Pentoxylon, Caytonia (Fig. 1C), 1987), have penned morphologically based (though not m
ic
andglossopterids.Sucharrangementssoonbecameknown matrix-based)argumentsforpolyphyly.Cladisticanalyses, .o
u
collectively as the anthophyte hypothesis, implying that both morphological and molecular, consistently place the p.c
o
the flower evolved only once and, hence, that all flowers clade containing all extant angiosperm lineages (the m
are fundamentally homologous. ‘crown-group’ angiosperms: Fig. 3) on a long branch /jx
b
By1990,these morphologicalanalyses werecompeting with respect to the remaining seed plants. ‘Missing links’ /a
with(andsoonlargelysupersededby)phylogeneticstudies (i.e. stem-group) angiosperms have been postulated from rticle
employing as characters the sequences of nucleotides (and the fossil record but have not hitherto obtained universal -a
b
s
insertion–deletion mutations) in specific regions of the acceptance (e.g. Archaefructus, discussed below). Further- tra
three plant genomes: nuclear-chromosomal, plastid, and more,relationshipsamongextantgymnosperms,andhence ct/5
mitochondrial.Asthenumberofreadilysequencedregions homologies among their reproductive structures, remain 7/1
(and hence the number of usable characters) increased contentious. These issues can only be partly circumvented 3/3
4
exponentially, driven by advances in sequencing technol- by adopting a top-down approach but are fundamental to 7
1
ogy, maximum parsimony was supplemented with more any bottom-up approach. /4
7
7
mathematically complex methods of generating trees, 5
3
notably maximum likelihood and, latterly, Bayesian ap- 9 b
proaches (e.g. Page and Holmes, 1998). The current spe- y g
u
cies richness and ecological pre-eminence of angiosperms Which kinds of phylogeny are of greatest value? e
s
have caused them to be preferentially sampled for phylo- t o
Why do tensions remain within the phylogenetic com- n
genetic studies. The result is a framework of relationships 2
munity regarding molecular versus morphological char- 9
that is generally viewed as well-sampled and increasingly M
acters and extant versus extinct taxa? Over the past a
(though by no means universally) as reliable (e.g. Soltis rc
two decades, molecular phylogenies have in practice h
et al., 2004, 2005), and underpins higher classifications 2
largely superseded morphological phylogenies. This shift 0
thatarebasedon‘natural’monophyleticgroups(e.g.APGII: 19
of emphasis has had profound consequences, yet it is
Angiosperm Phylogeny Group, 2003).
now rarely discussed. Arguments most commonly adv-
At present, increasing numbers of species have been
anced against morphological rather than molecular phylo-
sequenced for the complete plastid and/or mitochondrial
genetic analyses are:
genomes (e.g. Goremykin et al., 2005), and a few for the
entire nuclear genome (e.g. rice, Arabidopsis; Arabidopsis (i) the limited number of characters available (and the
GenomeInitiative,2000;Bennetzen2002;Yuetal.,2002; existence of an asymptote of the number of phyloge-
Yamadaetal.,2003;InternationalRiceGenomeSequenc- netically informative character states available as taxa
ing Project, 2005). These sequence-based data not only are successively added to a matrix: Bateman, 1992);
allow character-rich (if presently species-poor) phylogeny (ii) the high cost in time expended per unit character
reconstructions of extant species but also provide a yard- coded(bothintermsofactualcharacterscoringandthe
Phylogeneticcontextoftheflower 3477
D
o
w
n
lo
a
d
e
d
fro
m
h
ttp
s
://a
c
a
d
e
m
Fig. 2. Crudeconsensusofphylogeneticrelationshipsrecognizedbymorphologicalanalysesusingextantandextincttaxa(A)andsequence-based ic
analysesusingonlyextanttaxa(B),illustratingthedominanceofparaphylyintheformerandmonophylyinthelatter. .o
u
p
.c
o
m
‘informalapprenticeship’thatmustfirstbeundertaken deletion events (in other words, in assigning some /jx
b
in order to describe and code characters correctly); states of some taxa to the correct character); /a
(iii) an inevitable degree of subjectivity involved in (iv) the availability of the same restricted set of a maxi- rtic
le
making a priori homology assessments (in other mumoffour(orarguablyfive:A,C,G,T,andabsent) -a
b
words, in delimiting characters before each is in turn states for each position/character artificially masks s
tra
divided into alternative character states); much of the actual homoplasy (e.g. when a particular c
(iv) the supposed comparatively high level of homoplasy A mutates to a T but later reverts to an A); t/57
/1
(this is caused by conflicting character states, which (v) the potential in expressed regions of the genome for 3
/3
are considered to mainly reflect similar responses to particular categories of nucleotide to behave differ- 4
7
similar pressures of directional or disruptive selection ently (e.g. contrasting mutational rates in different 1/4
7
in lineages that are in fact only distantly related: compartments and regions of the genome, first and 7
5
Scotland et al., 2003), and; second versus third bases within codons, and radical 39
(v) the potential developmental correlation of apparently contrasts in the GC:AT ratio). by
g
unrelated characters (e.g. pleiotropic expression of u
e
a single critical mutation). Amdodrpithioonloaglicpahleannodmmenoalectuhlaatr mcaantrinceesgainticvleuldye:affect both st on
2
9
Atleastsomeobserversperceivecorrespondingconstraints (i) strong heterogeneity of rates of change within the M
a
on molecular phylogenies: study group, encouraging the most rapidly changing rc
h
branches to falsely coalesce; this now thoroughly 2
0
(i) the limited number, and sporadic phylogenetic distri- researched, but still problematic, phenomenon is 19
bution, of taxa available, due to the inability to allow termed long-branch attraction (e.g. Sanderson et al.,
molecularly recalcitrant extinct taxa to participate in 2000; Felsenstein, 2004);
the tree-building procedure (a central issue of this (ii) migration of genes (and the phenotypic characters
paper); that they underpin) between lineages through hybrid-
(ii) the fact that routinely sequenced regions of the three ization, lateral gene transfer, and organelle capture,
genomes do not participate directly in the phenotypic and;
transitions that signal a macroevolutionary event (iii) the necessity of rooting the tree, typically through
(Bateman,1999); outgroup choice, in order to polarize characters and
(iii) an inevitable degree of subjectivity in aligning thereby study character evolution (another central
nucleotides in matrices that are rich in insertion– issue of this paper: see below).
3478 Batemanetal.
andthenmapped,therebyeffectivelygeneratingtautologous
interpretationsofevolution(Bateman,1999).
How significant are optimization and outgroups
choice?
The concept of character mapping leads naturally into
discussionofanotherareaofphylogeneticsthatispivotalto
thequestionsaddressedbythispaper,namelyoptimization.
If we wish to know what the first flower looked like, but
have not found it in the fossil record (such a discovery is
D
highly improbable, given the undoubted patchiness of the o
w
fossil record of land-plants), we need to reconstruct that n
lo
flower conceptually. This is achieved using combinations ad
e
ofcharactersfoundinspecieswhosemorphologyhasbeen d
carefully described and whose phylogenetic relationships fro
m
have been rigorously inferred (this phrase is not oxymor- h
onic; it is important to remember that even the most ttps
rigorously reconstructed phylogeny remains wholly infer- ://a
cFoign.ce3p.tsHoyfpocrthoewtincaglrpohuyploagnednysteilmlusgtrraotuinpg, athnedrethlaetisviigsntiicficnaantucreeoofftthhee ential).Providedwehaveaccesstoamatrixthatthoroughly cade
subtendingnodesinanyattempttoreconstructthenatureofhypothetical describesthemorphologyofallorgansofthespeciesunder m
ancestors. The diagram also contrasts the node-based and apomorphy- scrutiny, then we can reconstruct the morphology of the ic.o
basedapproaches to delimiting monophyletic groups. Dashed branches u
subtendextincttaxa,cross-strikesonthebranchimmediatelybelowthean- hypothetical ancestors that lie on each node of the clado- p.c
o
giospermcrown-groupnodeindicateindividualcharacter-statetransitions. gram. Optimization is most readily achieved by a simple m
logical protocol that works downward through the tree /jx
b
In addition, the main source of operator bias influencing from the terminal taxa toward the outgroup node (Fig. 3) /a
the resulting topology is a priori homology assessment (e.g. Maddison and Maddison, 2001), though other, more rticle
for morphological data, whereas for molecular data it is complex, models can be applied, sometimes with advanta- -a
b
s
selectivity among (i) available characters and (ii) tree- geousresults(OakleyandCunningham,2000;Polly,2001; tra
buildingalgorithms. Moreover,bothcategories ofanalysis Webster and Purvis, 2002; Crisp and Cook, 2005). ct/5
are prone to culling of ‘troublesome’ taxa from matrices, Amost-parsimonioustree,derivedfromaparticulardata 7/1
and revised outgroup choices made in search of more matrix, yields similar information about the relationships 3/3
4
‘intuitively acceptable’ topologies. of the coded taxa and the relative lengths of the various 7
1
Many phylogeneticists advocate a compromise approach branches within the tree, irrespective of whether it is /4
7
7
to the relative treatment of morphological and molecular unrooted or rooted. However, an unrooted tree lacks 5
3
characters. In one frequently used protocol, morphological polarity; it cannot be read in terms of transitions from 9 b
charactersarecombinedwiththemolecularcharactersprior ancestral character states (plesiomorphies) to derived y g
u
totreebuilding,butonlyaftertheinitialmorphologicalmatrix character states (apomorphies). This in turn means that it e
s
hasbeenreducedtoits bareessentials by cullingcharacters is not possible to optimize character states in order to t o
n
thatevokesuspicion,mostcommonlyduetothedifficultyof reconstructthehypotheticalancestorsoccupyingthenodes 2
9
dividing a complex of continuous character into discrete ofthetree.Inearlier(andmoreexperimental)phasesinthe M
a
states. However, a recent detailed analysis by Wortley and evolution of methods of phylogenetic analysis, several rc
h
Scotland(2006)convincinglyrefutesthisapproach,demon- conceptually distinct approaches to rooting were explored, 2
0
strating that the culled characters contain the same average but most proved impractical and all appeared at least 19
strength of phylogenetic signal as the supposedly superior partially tautologous.
characters that survive the cull. The alternative, and more Over the last two decades a single method, termed
commonly used, approach is generally termed ‘mapping’. outgroupcomparison(e.g.NixonandCarpenter,1993),has
Here,themorphologicalcharactersareplacedalongthetips become ubiquitous, to the point where it is routinely
of the phylogeny after it has been constructed. Mapping adopted without serious thought by almost all practising
prevents morphological characters from contributing in any phylogeneticists. It requires the analyst to select a priori
waytothetree-buildingexercise.Mappingcanalsodissuade a set of comparable taxa (preferably species) that are of
researchersfromseeking(andthenoccasionallydiscovering) interest and are suspected of being an inclusive, mono-
newphylogeneticallyvaluablecharacters.Alltoooften,well- phyleticgroup(inpractice,thestrongestguidetoperceived
known phenotypic characters of a priori interest are scored monophyly is a previous, and taxonomically broader,
Phylogeneticcontextoftheflower 3479
phylogenetic analysis; thus, the outgroup method is offerthemostobviousmeansofmaximizingthenumberof
vulnerable to accusations of logical tautology). Then, one characters per taxon.
or more additional taxa, thought to lie phylogenetically Given the continuing uncertainties regarding relation-
outside the chosen study group, and hence operationally ships among the major groups of land plants (discussed
termed outgroups, are chosen to simultaneously root the below), and their potential influence on optimizations of
tree and polarize the scored characters. If multiple out- nodes,severalbranch-pointsdistantfromthetargetnode(s),
groups are chosen they constitute a test (albeit flawed) of we sought phylogenies that encompassed all such major
themonophylyoftheingroup,sinceanoutgrouptaxonthat groups. In the case of morphological phylogenies, no
is placed phylogenetically within the ingroup contradicts existing study convincingly stretched from the bryophytes
ingroup monophyly. Clearly, the larger the numbers of to the more derived angiosperms (eudicots sensu APGII:
ingroup and outgroup taxa, the stronger is this test. AngiospermPhylogenyGroup,2003).Wethereforetookthe
Outgroup choice leaves the analyst with a further controversial step of grafting a selected most-parsimonious
D
quandary. In most cases, the ingroup is chosen because it tree from our recent analysis of 48 lignophytes (angio- o
w
is a morphologically and/or molecularly distinct aggregate sperms plus gymnosperms, plus progymnospermous pter- n
lo
ofspecies,andsoislikelytodifferconsiderablyfromeven idophytes as operational outgroups: Hilton and Bateman, ad
e
itsphylogeneticallyclosestoutgroups(hence,intheresult- 2006, fig. 10) onto the product of a recent parsimony d
ingtrees,amonophyleticingroupislikelytobesubtended analysisof52codedtaxathatfocusedonpteridophytesbut fro
m
by a relatively long branch). These differences make used the fossil non-vascular polysporangiophyte Aglao- h
characterdelimitationmoredifficultforbothmorphological phytonasoutgroupandjustfivetaxa(oneprogymnosperm ttps
andmolecularanalysts.Forthemorphologist,theoutgroup plus three primitive pteridospermous gymnosperms and ://a
c
is likely to lack structures found in the ingroup, possess the extant Pinus) as representative ‘placeholders’ for the ad
e
structuresnotfoundintheingroup,orthetwogroupsmay monophyleticlignophytes(RothwellandNixon,2006,fig. m
ic
possess structures that are broadly similar yet sufficiently 3). The resulting composite phylogeny contains 95 coded .o
u
different that their homology cannot be adequately asses- taxa:47whollyextinctand48containingatleastoneextant p.c
o
sed. For the molecular researcher, sequence alignment species (Fig. 4). m
becomes more challenging for many genic regions, and Selecting among the many broad-brush molecular trees /jx
b
base saturation becomes an ever-increasing risk. These in order to produce Fig. 5 proved even more problematic. /a
problems of strong ingroup–outgroup divergence can, in The controversial analysis by Goremykin et al. (2003, rticle
theory, be averted by identifying one or more members of 2004;seealsoSoltisetal.,2004;Martinetal.,2005)stood -a
b
s
theingroupasoperationaloutgroups,buttheanalystisthen outasbeingespeciallycharacter-rich,sinceitwasbasedon tra
imposing a strong subjective overprint on the topology of nearly completely sequenced plastid genomes. The trade- ct/5
the resulting tree(s). Certainly, the polarity of characters, offisthattheanalysislackeddatafromtheothertwoplant 7/1
and thus the optimizations that allow reconstruction of the genomes (nucleus and mitochondrion), and was restricted 3/3
4
properties of the hypothetical ancestor of the ingroup, to <20 coded taxa; moreover, the taxa were selected as 7
1
would be strongly influenced by such a decision. much for their economic importance as their likely /4
7
7
But before we contemplate implementing of an optim- phylogenetic significance. This left the trees open to 5
3
ization procedure, we must first select our preferred accusations of distortion by long-branch attraction (Soltis 9 b
topology from among the very broad spectrum of land- et al., 2004; Stefanovic et al., 2004; Leebens-Mack et al., y g
u
plant phylogenies generated since 1982. 2005),thoughamorerecentstudysuggeststhatlikelihood e
s
model mis-specification is a more probable source of t o
n
systematic error in the trees (Goremykin and Hellwig, 2
9
2006). The matrix of Soltis et al. (2002) encompassed M
What is the best way to choose among the a
a similar number of coded taxa, albeit more evenly rc
plethora of phylogenies? h
distributed across the extant land-plant phylogeny, and 2
0
Oneofourprimaryobjectiveshereistoexploretherelative sequenced eight genic regions distributed among the 19
merits of maximizing sampling of taxa versus that of plastid (rbcL, atpB, psaA, psbB), mitochondrion (mtSSU,
characters per taxon in taxonomically broad (i.e. at least cox1, atpA), and nucleus (18S rDNA).
Class level) phylogenetic analyses. Morphological studies Attheotherendofthespectrumofmolecularcharacters
that include the best-understood fossils (i.e. conceptual per coded taxon are analyses based on a single region but
whole plants reconstructed from their component parts: with substantial taxon sampling. A good example is the
Chaloner,1986;Bateman,1992)offerthebestopportunity 2538-taxon rbcL study of land plants by Ka¨llersjo¨ et al.
to maximize sampling of taxa that provide strongly (1998, 1999), subsequently reduced to an illuminating
contrasting combinations of character states. By contrast, series of analyses of various combinations of 80 taxa
recentmoleculartrees,someofwhichcompareentireplas- sampled evenly across the phylogenyof extantland plants
tid genomes (e.g. Goremykin et al., 2003, 2004, 2005), by Rydin and Ka¨llersjo¨ (2002). Matrices of intermediate
3480 Batemanetal.
D
o
w
n
lo
a
d
e
d
fro
m
h
ttp
s
://a
c
a
d
e
m
ic
.o
u
p
.c
o
m
/jx
b
/a
rtic
le
-a
b
s
tra
c
t/5
7
/1
3
/3
4
7
1
/4
7
7
5
3
9
b
y
g
u
e
s
t o
n
2
9
M
a
rc
h
2
0
1
9
Fig. 4. Compositemorphologicalphylogenyof95taxa(47fossil,inboldface)obtainedbygrafting(atarrow)aseed-planttreeofHiltonandBateman
(2006,fig.10)ontoapteridophytetreeofRothwellandNixon(2006,fig.3a).
Description:a growth comprising the reproductive organs of seed plants'. (Chambers Dictionary infuriating, study on 'resolving incongruence' in phyloge-.