分類が困難な種群に対する統合的種識別フレームワークを開発（Researchers Develop Integrative Framework for Species Identification in Taxonomically Complex Groups）

2026-04-24

2026-04-20 中国科学院（CAS）

中国科学院西双版納熱帯植物園（XTBG）の研究チームは、多倍体化や交雑により種の判別が困難な植物群に対し、統合的な種同定フレームワークを開発した。形態的特徴の安定性評価、標本の再検討、計算モデリングを組み合わせ、東アジアのアブラナ科植物群を対象に5,000点以上を解析。その結果、従来の標本の12～50％が誤同定されていたことが判明した。誤りは環境で変化する形質への依存に起因し、種子配置や花弁数、ゲノムサイズが有効な識別指標と特定された。再同定後の分布モデルでは生息域予測が大きく変化し、気候変動下での生息域減少も明確化した。本手法は生物多様性保全や農業・医薬分野に重要な基盤を提供する。

＜関連情報＞

ロリッパ属（アブラナ科）における倍数体複合体を特定するためのフレームワーク：形質進化、標本記録、機械学習の組み合わせ A framework for identifying the polyploid complex in Rorippa (Brassicaceae): combining trait evolution, herbarium records and machine learning

Ting-Shen Han ,Jun-Xian Lv ,Yao-Wu Xing
Annals of Botany Published:05 March 2026
DOI:https://doi.org/10.1093/aob/mcag050

Abstract

Background and Aims

Species identification in polyploid plants remains challenging owing to morphological continuity and genomic redundancy. Such taxonomic uncertainties obscure evolutionary or ecological inference. A critical solution involves the reassessment of polyploid collections using stable diagnostic traits and integrative approaches. Here, we examined the Rorippa dubia–indica complex (Brassicaceae), a morphologically overlapping tetraploid–hexaploid lineage with a native distribution in East Asia.

Methods

We developed a framework that integrates experimental phenotyping, herbarium reassessment and computational modelling for secondary species assessment of polyploid plants. The framework incorporates spatiotemporal data from 3136 field-collected (2017–2020) and 2015 herbarium (1893–2021) specimens. Species were circumscribed using experimental assessments of anatomical, cytological and morphological traits, interpreted within a phylogenetically informed evolutionary context. Stable diagnostic traits were then applied to re-identify specimens for improved species distribution models. Finally, curated trait and species data were used to train machine learning classification models to reconstruct the diagnostic rationale underlying specimen identification.

Key Results

Seed arrangement, number of petals and genome size exhibited clear interspecific differentiation. Phylogenomic analyses based on chloroplast genomes further resolved species circumscription consistent with these traits. According to the revision of specimens and classification models defined by machine learning, we found that initial misidentification rates reached 12–50 % across virtual or physical specimens, largely owing to reliance on plastic traits, such as leaf shape. These errors substantially distorted spatial distribution models and future climate projections.

Conclusions

Our findings underscore the need for secondary specimen evaluation. The framework demonstrates the importance of integrating morphological and phylogenetic inference with machine learning tools to resolve taxonomically difficult polyploid complexes. This approach offers direct applications for biodiversity assessment, evolutionary research and conservation planning.

月	火	水	木	金	土	日
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30
31