B細胞受容体の特徴を解析するAIモデルを開発(Researchers Develop AI Model to Decode B-Cell Receptor “Fingerprints”)

ad

2026-05-09 合肥物質科学研究院(HFIPS)

中国科学院合肥物質科学研究院の研究チームは、B細胞受容体(BCR)の「指紋」を解析するAIモデル「BCRInsight」を開発した。BCRは抗原認識だけでなく、B細胞の活性化や分化、クローン進化の履歴を保持しているが、その複雑な関係性の解析は従来困難だった。研究では、大規模配列データに自己教師あり学習を適用し、アミノ酸配列、遺伝子注釈、メタデータを統合するTransformerベースのAIを構築した。BCRInsightは、単一細胞解析より低コストでB細胞サブセット構成を高精度推定でき、抗体結合部位予測ではAUROC 0.962を達成し既存手法を上回った。また、3次元構造情報を学習していないにもかかわらず、抗原結合に重要なHCDR3領域へ注意機構が集中し、配列情報のみから構造・機能を推定できることが示された。研究は、個別化ワクチン、抗体医薬設計、免疫療法開発への応用が期待されている。

B細胞受容体の特徴を解析するAIモデルを開発(Researchers Develop AI Model to Decode B-Cell Receptor “Fingerprints”)
BCRInsight Model Framework (Image by ZHAO Hailong)

<関連情報>

BCRInsight:BCR配列から生物学的シグナルを解読するための抗体言語モデル BCRInsight: an antibody language model to decode biological signals from BCR sequences

Hailong Zhao,Shang Lou,Xuhua Li,Yiyang Gao,Wenjing Cao,Hongcang Gu,Fan Zhang
Briefings in Bioinformatics  Published:14 April 2026
DOI:https://doi.org/10.1093/bib/bbag154

Abstract

The B-cell receptor (BCR) repertoire encodes not only antigen-binding specificity but also intrinsic signatures reflecting B-cell functional states and differentiation trajectories. Deciphering the intricate sequence semantics embedded within these repertoires is pivotal for elucidating immune dynamics and expediting antibody discovery. Although single-cell sequencing provides high-resolution insights, its scalability and cost remain major obstacles, leaving population-level repertoire data underexploited. Furthermore, conventional bioinformatics approaches struggle to model the high-order, non-linear semantic dependencies inherent in antibody sequences. To address these challenges, we present BCRInsight, an antibody-specific pretrained language model that integrates a Transformer architecture with phenotype-aware contrastive learning. Pretrained on 80 million human BCR sequences, BCRInsight learns biologically meaningful contextual representations that encode subtle signatures of B-cell activation, maturation, and clonal evolution. Extensive benchmarking demonstrates that BCRInsight achieves state-of-the-art performance across multiple downstream tasks, particularly in paratope prediction. Further evaluation on diverse single-cell immune cohorts, including healthy, neoplastic, and viral infection states, reveals cross-scenario robustness and superior generalization relative to existing methods. Notably, attention-based analyses show that high-attention regions correspond closely to physical antigen-contact residues, highlighting emergent structural interpretability derived solely from self-supervised learning. Collectively, BCRInsight establishes a new paradigm for decoding the “language” of antibodies, offering a scalable and interpretable framework for computational immunology and rational antibody engineering.

細胞遺伝子工学
ad
ad
Follow
ad
タイトルとURLをコピーしました