「ブラックボックス」から「透明」なエージェントAIへの技術的ブレークスルー(Technical Breakthrough From “Black Box” to “Transparent” Agentic AI)

ad

2026-02-27 上海交通大学(SJTU)

上海交通大学人工知能学院と上海交通大学医学院附属新華医院の研究チームは、希少疾患の誤診・診断困難という課題に対し、エビデンス循環型AI診断システム「DeepRare」を開発した。中央ハブと分散エージェント構造により、医学文献と臨床データを統合し、仮説検証と自己反省を繰り返す“スロースキンキング”型推論を実装。根拠を明示するホワイトボックス型で信頼性を高めた。表現型情報のみで診断精度57.18%を達成し、従来最良モデルを大幅に上回り、専門医超えも確認。遺伝情報統合では70.6%超に向上した。既にオンライン展開と院内品質管理、産業連携を進め、国際的「1万症例」検証も開始する。

<関連情報>

追跡可能な推論を備えた希少疾患診断のためのエージェントシステム An agentic system for rare disease diagnosis with traceable reasoning

Weike Zhao,Chaoyi Wu,Yanjie Fan,Pengcheng Qiu,Xiaoman Zhang,Yuze Sun,Xiao Zhou,Shuju Zhang,Yu Peng,Yanfeng Wang,Xin Sun,Ya Zhang,Yongguo Yu,Kun Sun & Weidi Xie
Nature  Published:18 February 2026
DOI:https://doi.org/10.1038/s41586-025-10097-9

figure 1

Abstract

Rare diseases affect more than 300 million people worldwide1,2,3, yet timely and accurate diagnosis remains an urgent challenge1,3,4,5. Patients often endure a prolonged ‘diagnostic odyssey’ exceeding 5 years, marked by repeated referrals, misdiagnoses and unnecessary interventions, leading to delayed treatment and substantial emotional and economic burden4,5. Here we present DeepRare—a multi-agent system for rare disease differential diagnosis decision support6,7,8 powered by large language models, integrating more than 40 specialized tools and up-to-date knowledge sources. DeepRare processes heterogeneous clinical inputs, including free-text descriptions, structured human phenotype ontology terms and genetic testing results to generate ranked diagnostic hypotheses with transparent reasoning linked to verifiable medical evidence. Evaluated across nine datasets from literature, case reports and clinical centres across Asia, North America and Europe spanning 14 medical specialties, DeepRare demonstrates exceptional performance on 2,919 diseases. In human-phenotype-ontology-based tasks, it achieves an average Recall@1 of 57.18%, outperforming the next best method by 23.79%; in multi-modal tests, it reaches 69.1% compared with Exomiser’s 55.9% on 168 cases. Expert review achieved 95.4% agreement on its reasoning chains, confirming their validity and traceability. Our work not only advances rare disease diagnosis but also demonstrates how the latest powerful large-language-model-driven agentic systems can reshape current clinical workflows.

 

医療・健康
ad
ad
Follow
ad
タイトルとURLをコピーしました