- 生物信息学讨论班

SSNMDI: a novel joint learning model of semi-supervised non-negative matrix factorization and data imputation for clustering of single-cell RNA-seq data

组织者

丘成栋

演讲者

孙楠

时间

2024年01月18日 22:00 至 22:30

地点

Online

摘要

Single-cell RNA sequencing (scRNA-seq) technology attracts extensive attention in the biomedical field. It can be used to measure gene expression and analyze the transcriptome at the single-cell level, enabling the identification of cell types based on unsupervised clustering. Data imputation and dimension reduction are conducted before clustering because scRNA-seq has a high ‘dropout’ rate, noise and linear inseparability. However, independence of dimension reduction, imputation and clustering cannot fully characterize the pattern of the scRNA-seq data, resulting in poor clustering performance. Herein, we propose a novel and accurate algorithm, SSNMDI, that utilizes a joint learning approach to simultaneously perform imputation, dimensionality reduction and cell clustering in a non-negative matrix factorization (NMF) framework. In addition, we integrate the cell annotation as prior information, then transform the joint learning into a semi-supervised NMF model. Through experiments on 14 datasets, we demonstrate that SSNMDI has a faster convergence speed, better dimensionality reduction performance and a more accurate cell clustering performance than previous methods, providing an accurate and robust strategy for analyzing scRNA-seq data. Biological analysis are also conducted to validate the biological significance of our method, including pseudotime analysis, gene ontology and survival analysis. We believe that we are among the first to introduce imputation, partial label information, dimension reduction and clustering to the single-cell field.

演讲者介绍

孙楠目前是北京雁栖湖应用数学研究院的博士后。她的研究方向包括生物信息学、机器学习和应用数学，在The Innovation, Computational and Structural Biotechnology Journal, BMC Bioinformatics, Frontiers in Cellular and Infection Microbiology, Journal of Computational Biology, Genes等期刊发表多篇论文，参与多项国家自然科学基金及北京市自然科学基金项目，主持中国博士后科学基金第78批面上资助。