Loading [a11y]/accessibility-menu.js
Analyzing Large-Scale Single-Cell RNA-Seq Data Using Coreset | IEEE Journals & Magazine | IEEE Xplore

Analyzing Large-Scale Single-Cell RNA-Seq Data Using Coreset


Abstract:

The recent boom in single-cell sequencing technologies provides valuable insights into the transcriptomes of individual cells. Through single-cell data analyses, a number...Show More

Abstract:

The recent boom in single-cell sequencing technologies provides valuable insights into the transcriptomes of individual cells. Through single-cell data analyses, a number of biological discoveries, such as novel cell types, developmental cell lineage trajectories, and gene regulatory networks, have been uncovered. However, the massive and increasingly accumulated single-cell datasets have also posed a seriously computational and analytical challenge for researchers. To address this issue, one typically applies dimensionality reduction approaches to reduce the large-scale datasets. However, these approaches are generally computationally infeasible for tall matrices. In addition, the downstream data analysis tasks such as clustering still take a large time complexity even on the dimension-reduced datasets. We present single-cell Coreset (scCoreset), a data summarization framework that extracts a small weighted subset of cells from a huge sparse single-cell RNA-seq data to facilitate the downstream data analysis tasks. Single-cell data analyses run on the extracted subset yield similar results to those derived from the original uncompressed data. Tests on various single-cell datasets show that scCoreset outperforms the existing data summarization approaches for common downstream tasks such as visualization and clustering. We believe that scCoreset can serve as a useful plug-in tool to improve the efficiency of current single-cell RNA-seq data analyses.
Page(s): 1784 - 1793
Date of Publication: 24 June 2024

ISSN Information:

PubMed ID: 38913513

Funding Agency:

Author image of Khalid Usman
Institute of Interdisciplinary Information Science, Computer Science, Tsinghua University, Beijing, China
Khalid Usman received the bachelor's degree from the National University of Computer and Emerging Sciences, Islamabad, Pakistan and the master's degree from the National University of Science and Technology, Islamabad, Pakistan. He is a PhD scholar with the Institute for Interdisciplinary Information Sciences (IIIS) in professor Jianyang Zeng's group (now a professor with the School of Engineering, Westlake University), T...Show More
Khalid Usman received the bachelor's degree from the National University of Computer and Emerging Sciences, Islamabad, Pakistan and the master's degree from the National University of Science and Technology, Islamabad, Pakistan. He is a PhD scholar with the Institute for Interdisciplinary Information Sciences (IIIS) in professor Jianyang Zeng's group (now a professor with the School of Engineering, Westlake University), T...View more
Author image of Fangping Wan
Institute of Interdisciplinary Information Science, Computer Science, Tsinghua University, Beijing, China
Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA
Fangping Wan received the PhD degree in computer science from Tsinghua University. He is a postdoctoral researcher in the University of Pennsylvania. His current research interests include machine learning, deep learning, bioinformatics and drug design.
Fangping Wan received the PhD degree in computer science from Tsinghua University. He is a postdoctoral researcher in the University of Pennsylvania. His current research interests include machine learning, deep learning, bioinformatics and drug design.View more
Author image of Dan Zhao
Institute of Interdisciplinary Information Science, Computer Science, Tsinghua University, Beijing, China
Dan Zhao received the PhD degree in biochemistry and molecular biology from Peking University, in 2016. After two years of postdoctoral training advised by Prof. Haitao Li with the School of Medicine, Tsinghua University, she served as a Co-PI in professor Jianyang Zeng's group (now a professor with the School of Engineering, Westlake University), Tsinghua University. She is currently a research assistant professor with t...Show More
Dan Zhao received the PhD degree in biochemistry and molecular biology from Peking University, in 2016. After two years of postdoctoral training advised by Prof. Haitao Li with the School of Medicine, Tsinghua University, she served as a Co-PI in professor Jianyang Zeng's group (now a professor with the School of Engineering, Westlake University), Tsinghua University. She is currently a research assistant professor with t...View more
Author image of Jian Peng
Department of Computer Science, University of Illinois, Champaign, IL, USA
Jian Peng is an associate professor with the University of Illinois, Urbana-Champaign, where he conducts pivotal research in computational biology, machine learning, and cheminformatics. His work, which often explores complex biological systems and diseases, has been featured in prominent journals like PLOS Computational Biology and Nature Communications. Beyond research, he teaches bioinformatics and computational biolog...Show More
Jian Peng is an associate professor with the University of Illinois, Urbana-Champaign, where he conducts pivotal research in computational biology, machine learning, and cheminformatics. His work, which often explores complex biological systems and diseases, has been featured in prominent journals like PLOS Computational Biology and Nature Communications. Beyond research, he teaches bioinformatics and computational biolog...View more
Author image of Jianyang Zeng
School of Engineering, School of Life Sciences, Westlake University, Hangzhou, China
Jianyang (Michael) Zeng is a distinguished professor with Westlake University's School of Engineering and adjunct faculty in the School of Life Sciences. His academic journey includes a PhD degree from Duke University and prior tenure, Tsinghua University. Prof. Zeng's research, which has produced more than 80 publications, intersects computational biology with machine learning, focusing on data-driven life sciences. He h...Show More
Jianyang (Michael) Zeng is a distinguished professor with Westlake University's School of Engineering and adjunct faculty in the School of Life Sciences. His academic journey includes a PhD degree from Duke University and prior tenure, Tsinghua University. Prof. Zeng's research, which has produced more than 80 publications, intersects computational biology with machine learning, focusing on data-driven life sciences. He h...View more

Author image of Khalid Usman
Institute of Interdisciplinary Information Science, Computer Science, Tsinghua University, Beijing, China
Khalid Usman received the bachelor's degree from the National University of Computer and Emerging Sciences, Islamabad, Pakistan and the master's degree from the National University of Science and Technology, Islamabad, Pakistan. He is a PhD scholar with the Institute for Interdisciplinary Information Sciences (IIIS) in professor Jianyang Zeng's group (now a professor with the School of Engineering, Westlake University), Tsinghua University.
Khalid Usman received the bachelor's degree from the National University of Computer and Emerging Sciences, Islamabad, Pakistan and the master's degree from the National University of Science and Technology, Islamabad, Pakistan. He is a PhD scholar with the Institute for Interdisciplinary Information Sciences (IIIS) in professor Jianyang Zeng's group (now a professor with the School of Engineering, Westlake University), Tsinghua University.View more
Author image of Fangping Wan
Institute of Interdisciplinary Information Science, Computer Science, Tsinghua University, Beijing, China
Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA
Fangping Wan received the PhD degree in computer science from Tsinghua University. He is a postdoctoral researcher in the University of Pennsylvania. His current research interests include machine learning, deep learning, bioinformatics and drug design.
Fangping Wan received the PhD degree in computer science from Tsinghua University. He is a postdoctoral researcher in the University of Pennsylvania. His current research interests include machine learning, deep learning, bioinformatics and drug design.View more
Author image of Dan Zhao
Institute of Interdisciplinary Information Science, Computer Science, Tsinghua University, Beijing, China
Dan Zhao received the PhD degree in biochemistry and molecular biology from Peking University, in 2016. After two years of postdoctoral training advised by Prof. Haitao Li with the School of Medicine, Tsinghua University, she served as a Co-PI in professor Jianyang Zeng's group (now a professor with the School of Engineering, Westlake University), Tsinghua University. She is currently a research assistant professor with the Institute for Interdisciplinary Information Sciences (IIIS) at Tsinghua University. Her current research lies with the intersection of artificial intelligence and biomedicine. To date, she has published more than 30 papers and was awarded the Beijing Natural Science Award (Natural Science track, Second Prize), in 2023, National Natural Science Foundation Grants (General Program, in 2022, Young Scholars Program, in 2020), and Postdoctoral Innovation Talents Support Program, in 2016, and the Outstanding Postdoctoral Fellowship, in 2016.
Dan Zhao received the PhD degree in biochemistry and molecular biology from Peking University, in 2016. After two years of postdoctoral training advised by Prof. Haitao Li with the School of Medicine, Tsinghua University, she served as a Co-PI in professor Jianyang Zeng's group (now a professor with the School of Engineering, Westlake University), Tsinghua University. She is currently a research assistant professor with the Institute for Interdisciplinary Information Sciences (IIIS) at Tsinghua University. Her current research lies with the intersection of artificial intelligence and biomedicine. To date, she has published more than 30 papers and was awarded the Beijing Natural Science Award (Natural Science track, Second Prize), in 2023, National Natural Science Foundation Grants (General Program, in 2022, Young Scholars Program, in 2020), and Postdoctoral Innovation Talents Support Program, in 2016, and the Outstanding Postdoctoral Fellowship, in 2016.View more
Author image of Jian Peng
Department of Computer Science, University of Illinois, Champaign, IL, USA
Jian Peng is an associate professor with the University of Illinois, Urbana-Champaign, where he conducts pivotal research in computational biology, machine learning, and cheminformatics. His work, which often explores complex biological systems and diseases, has been featured in prominent journals like PLOS Computational Biology and Nature Communications. Beyond research, he teaches bioinformatics and computational biology, mentoring students who have achieved prestigious roles in their respective fields.
Jian Peng is an associate professor with the University of Illinois, Urbana-Champaign, where he conducts pivotal research in computational biology, machine learning, and cheminformatics. His work, which often explores complex biological systems and diseases, has been featured in prominent journals like PLOS Computational Biology and Nature Communications. Beyond research, he teaches bioinformatics and computational biology, mentoring students who have achieved prestigious roles in their respective fields.View more
Author image of Jianyang Zeng
School of Engineering, School of Life Sciences, Westlake University, Hangzhou, China
Jianyang (Michael) Zeng is a distinguished professor with Westlake University's School of Engineering and adjunct faculty in the School of Life Sciences. His academic journey includes a PhD degree from Duke University and prior tenure, Tsinghua University. Prof. Zeng's research, which has produced more than 80 publications, intersects computational biology with machine learning, focusing on data-driven life sciences. He has earned accolades such as the 2023 XPLORER PRIZE and the 2021 National Science Fund for Distinguished Young Scholars, in China. Zeng serves on editorial and advisory boards, contributing to top academic journals and conferences.
Jianyang (Michael) Zeng is a distinguished professor with Westlake University's School of Engineering and adjunct faculty in the School of Life Sciences. His academic journey includes a PhD degree from Duke University and prior tenure, Tsinghua University. Prof. Zeng's research, which has produced more than 80 publications, intersects computational biology with machine learning, focusing on data-driven life sciences. He has earned accolades such as the 2023 XPLORER PRIZE and the 2021 National Science Fund for Distinguished Young Scholars, in China. Zeng serves on editorial and advisory boards, contributing to top academic journals and conferences.View more

Contact IEEE to Subscribe

References

References is not available for this document.