Realistic Cell Type Annotation and Discovery for Single-cell RNA-seq Data
被引:0
作者:
Zhai, Yuyao
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Sch Math Sci, Beijing, Peoples R ChinaPeking Univ, Sch Math Sci, Beijing, Peoples R China
Zhai, Yuyao
[1
]
Chen, Liang
论文数: 0引用数: 0
h-index: 0
机构:
Huawei Technol Co Ltd, Shenzhen, Peoples R ChinaPeking Univ, Sch Math Sci, Beijing, Peoples R China
Chen, Liang
[4
]
Deng, Minghua
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Sch Math Sci, Beijing, Peoples R China
Peking Univ, Ctr Stat Sci, Beijing, Peoples R China
Peking Univ, Ctr Quantitat Biol, Beijing, Peoples R ChinaPeking Univ, Sch Math Sci, Beijing, Peoples R China
Deng, Minghua
[1
,2
,3
]
机构:
[1] Peking Univ, Sch Math Sci, Beijing, Peoples R China
[2] Peking Univ, Ctr Stat Sci, Beijing, Peoples R China
[3] Peking Univ, Ctr Quantitat Biol, Beijing, Peoples R China
[4] Huawei Technol Co Ltd, Shenzhen, Peoples R China
来源:
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023
|
2023年
关键词:
D O I:
暂无
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
The rapid development of single-cell RNA sequencing (scRNA-seq) technologies allows us to explore tissue heterogeneity at the cellular level. Cell annotation plays an essential role in the substantial downstream analysis of scRNA-seq data. Existing methods usually classify the novel cells in target data as an "unassigned" group and rarely discover the fine-grained cell type structure among them. Besides, these methods carry risks, such as susceptibility to batch effect between reference and target data, thus further compromising of inherent discrimination of target data. Considering these limitations, here we propose a new and practical task called realistic cell type annotation and discovery for scRNA-seq data. In this task, cells from seen cell types are given class labels, while cells from novel cell types are given cluster labels. To tackle this problem, we propose an end-to-end algorithm called scPOT from the perspective of optimal transport ( OT). Specifically, we first design an OT-based prototypical representation learning paradigm to encourage both global discriminations of clusters and local consistency of cells to uncover the intrinsic structure of target data. Then we propose an unbalanced OT-based partial alignment strategy with statistical filling to detect the cells from seen cell types across reference and target data. Notably, scPOT also introduces an easy yet effective solution to automatically estimate the total cell type number in target data. Extensive results on our carefully designed evaluation benchmarks demonstrate the superiority of scPOT over various state-of-the-art clustering and annotation methods.
机构:
Peking Univ, Sch Math Sci, Beijing 100871, Peoples R ChinaPeking Univ, Sch Math Sci, Beijing 100871, Peoples R China
Wan, Hui
;
Chen, Liang
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Sch Math Sci, Beijing 100871, Peoples R ChinaPeking Univ, Sch Math Sci, Beijing 100871, Peoples R China
Chen, Liang
;
Deng, Minghua
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China
Peking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R China
Peking Univ, Ctr Stat Sci, Beijing 100871, Peoples R ChinaPeking Univ, Sch Math Sci, Beijing 100871, Peoples R China
机构:
Xinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R ChinaXinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R China
Wang, Hai-Yun
;
Zhao, Jian-Ping
论文数: 0引用数: 0
h-index: 0
机构:
Xinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R China
Xinjiang Univ, Inst Math & Phys, Urumqi 830046, Peoples R ChinaXinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R China
Zhao, Jian-Ping
;
Zheng, Chun-Hou
论文数: 0引用数: 0
h-index: 0
机构:
Xinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R China
Anhui Univ, Sch Artificial Intelligence, Hefei 230039, Peoples R ChinaXinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R China
Zheng, Chun-Hou
;
Su, Yan-Sen
论文数: 0引用数: 0
h-index: 0
机构:
Anhui Univ, Sch Artificial Intelligence, Hefei 230039, Peoples R ChinaXinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R China
机构:
Univ Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USAUniv Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA
Xu, Chenling
;
Lopez, Romain
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USAUniv Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA
Lopez, Romain
;
Mehlman, Edouard
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
Ecole Polytech, Ctr Math Appl, Palaiseau, FranceUniv Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA
Mehlman, Edouard
;
Regier, Jeffrey
论文数: 0引用数: 0
h-index: 0
机构:
Univ Michigan, Dept Stat, Ann Arbor, MI USAUniv Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA
Regier, Jeffrey
;
Jordan, Michael, I
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USAUniv Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA
Jordan, Michael, I
;
Yosef, Nir
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA
Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
MIT & Harvard, Ragon Inst MGH, Boston, MA 02139 USA
Zuckerberg Biohub Investigator, San Francisco, CA 02139 USAUniv Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA
机构:
Peking Univ, Sch Math Sci, Beijing 100871, Peoples R ChinaPeking Univ, Sch Math Sci, Beijing 100871, Peoples R China
Wan, Hui
;
Chen, Liang
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Sch Math Sci, Beijing 100871, Peoples R ChinaPeking Univ, Sch Math Sci, Beijing 100871, Peoples R China
Chen, Liang
;
Deng, Minghua
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China
Peking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R China
Peking Univ, Ctr Stat Sci, Beijing 100871, Peoples R ChinaPeking Univ, Sch Math Sci, Beijing 100871, Peoples R China
机构:
Xinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R ChinaXinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R China
Wang, Hai-Yun
;
Zhao, Jian-Ping
论文数: 0引用数: 0
h-index: 0
机构:
Xinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R China
Xinjiang Univ, Inst Math & Phys, Urumqi 830046, Peoples R ChinaXinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R China
Zhao, Jian-Ping
;
Zheng, Chun-Hou
论文数: 0引用数: 0
h-index: 0
机构:
Xinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R China
Anhui Univ, Sch Artificial Intelligence, Hefei 230039, Peoples R ChinaXinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R China
Zheng, Chun-Hou
;
Su, Yan-Sen
论文数: 0引用数: 0
h-index: 0
机构:
Anhui Univ, Sch Artificial Intelligence, Hefei 230039, Peoples R ChinaXinjiang Univ, Coll Math & Syst Sci, Urumqi 830046, Peoples R China
机构:
Univ Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USAUniv Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA
Xu, Chenling
;
Lopez, Romain
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USAUniv Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA
Lopez, Romain
;
Mehlman, Edouard
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
Ecole Polytech, Ctr Math Appl, Palaiseau, FranceUniv Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA
Mehlman, Edouard
;
Regier, Jeffrey
论文数: 0引用数: 0
h-index: 0
机构:
Univ Michigan, Dept Stat, Ann Arbor, MI USAUniv Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA
Regier, Jeffrey
;
Jordan, Michael, I
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USAUniv Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA
Jordan, Michael, I
;
Yosef, Nir
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA
Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
MIT & Harvard, Ragon Inst MGH, Boston, MA 02139 USA
Zuckerberg Biohub Investigator, San Francisco, CA 02139 USAUniv Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA