In a groundbreaking move for the scientific community, the Chan Zuckerberg Initiative (CZI) has launched the billion cells project, an ambitious effort to generate an unprecedented dataset of one billion single cells. This massive dataset will fuel advancements in AI-driven biological research, enabling scientists to gain deeper insights into cellular behavior and gene function.
A collaboration to transform biology
CZI has partnered with 10x Genomics, Ultima Genomics, and leading researchers to make this vision a reality. By leveraging cutting-edge technologies and optimizing cost efficiencies, this initiative aims to provide a comprehensive and standardized resource for researchers worldwide. The scale and quality of data generated will surpass previous efforts, facilitating the development of AI models that can drive transformative discoveries in precision medicine, functional genomics, and disease research.
Why does this matter?
The billion cells project is poised to impact multiple domains of biology by:
- Mapping genetic perturbations across diverse cell types and tissues.
- Enabling AI models to uncover hidden patterns in cellular behavior.
- Accelerating research in disease mechanisms and potential therapeutic targets.
- Standardizing large-scale single-cell datasets for global accessibility.
“Biology not only needs more data – it needs data faster and in interoperable formats to support AI models that address specific problems,” said Jonah Cool, Cell Science Senior Science Program Officer at CZI. “This project represents a unique approach to scaling and standardizing scientific outputs for AI and more.”
Leveraging state-of-the-art technologies
The project will utilize 10x Genomics’ Chromium GEM-X technology, which delivers higher sensitivity, improved throughput, and superior data quality for large-scale single-cell analysis. Sequencing will be conducted using Ultima Genomics’ UG 100™ platform, an ultra-high throughput next-generation sequencing system designed to enable massive-scale genomic data generation at lower costs.
By integrating these advanced technologies, the initiative will generate datasets with unmatched depth, resolution, and accuracy, offering researchers new opportunities to explore gene regulation, immune response, and the fundamental mechanisms driving human biology.
Open science for global impact
CZI remains committed to the principles of open science, ensuring that the billion cells project’s results will be made freely available to the global scientific community. The generated data will be used to train virtual cell models on CZI’s powerful computing infrastructure, enhancing the ability of researchers worldwide to unlock new discoveries in human health and disease.
“This project will provide the necessary scale of data to understand the functional effects of human genetic variants and characterize the genetic drivers of disease,” said Alexander Marson, Director of the Gladstone-UCSF Institute of Genomic Immunology. “Ultimately, the billion cells project will serve as a functional roadmap for drug development, identifying key targets to restore diseased cells to health.”
The future of AI-powered biology
The billion cells project represents a significant leap forward in the convergence of artificial intelligence and biology. As additional industry and academic collaborations emerge, this initiative will continue to expand the possibilities for single-cell research, precision medicine, and functional genomics.
With the power of AI, scalable genomics, and open-access data, the billion cells project is set to redefine the boundaries of what’s possible in life sciences research. The future of biology is data-driven—and this project is paving the way for groundbreaking discoveries.
Stay tuned for more updates as this transformative initiative unfolds!