Horizontal abstract line or gradient strip.

Rapid Release: BICAN-2024-10-RR-SCO

Release

  • Release Title: Rapid Release Inventory
  • Release Date: Oct. 1, 2024
  • Release Version: BICAN-2024-10-RR-SCO

About

The BICAN is committed to rapid data sharing to increase the accessibility and impact of the data generated by constituent members. We have enabled sharing of data within a calendar quarter of its generation, utilizing a federated pipeline for metadata collection, sequencing, and data processing. The initial Rapid Release Inventory provides early search and access capabilities for single cell transcriptomic and epigenomic data.

This release features three core components:

  1. Data ingest pipelines integrated with specimen/sequencing data management at NIMP & NeMO archive.
  2. Dedicated BICAN program page in the Data Catalog with dashboard overview of BICAN data ecosystem
  3. BICAN rapid release project page and specimen viewer

The first public release of data from the BRAIN Initiative Cell Atlas Network (BICAN) is now available in the Data Catalog. BICAN is a continuation of the BRAIN Initiative Cell Census Network (BICCN) and features unprecedented diversity in cross-species specimens and data.

Release Organization

The first Rapid Release Inventory project includes single cell transcriptomic and epigenomic data generated by participating BICAN awardees. The project-level organization provides search across common features of all single cell data. Data is further packaged into 20 “collections” of files that are available individually from the Neuroscience Multiomic Archive (NeMO); each collection is grouped based upon technique, species, grant and participating laboratory.

Navigating the Release

A new dedicated BICAN program page serves as the visual entry point to BICAN data within the Data Catalog. It features a general description of the consortium effort and goals and interactive dashboards that give an overview of the BICAN data ecosystem, as well as featured projects and links to other highlighted web resources.

Scientists can explore data from 6 labs, totalling 255 donors, 1461 specimens(library aliquots), and 20 data collections. They are accessible via a dedicated project page and specimen browser. The latter features a multi-level viewing experience broken down by library aliquot or donor. Scientists can download specimen metadata and file manifests that match the filters selected in the user interface.

General Licensing and Usage Guidelines

Data are provided under different licenses. Please check the Data Catalog collection descriptions and the README file available for download alongside specimen and file manifests for the applicable license for each dataset. Generally non-human datasets are made available under a CC-BY-4.0 license. Human data derived from tissue consented for open access are provided under BICAN-BY-NR. The initial release does not include controlled access human data.

When data is reused, please provide attribution to the data generators by citing the data citation. Data citations can be found with the collection at the NeMO archive or in the Data Catalog collection description.

Release Documentation

Contributors

Data in the rapid release were generated by multiple laboratories as part of the BRAIN Initiative Cell Atlas Network, including developmental mouse data from the Allen Institute for Brain Science (Zeng) & University of California, San Francisco (Nowakowski), cross-species data from the Allen Institute for Brain Science (Lein), marmoset data from Princeton University (Krienen), as well as multi-modal human data from Broad Institute (McCarroll) & Salk Institute for Biological Studies (Ecker). The NIH grant awards contributing to data in this initial release are shown below.

Award Principal Investigators Title
U01MH130962 Paola Arlotta, Tomasz Nowakowski, Hongkui Zeng Comprehensive single-cell atlas of the developing mouse brain
UM1MH130966 Steven McCarroll, Evan Macosko An Atlas of Human Brain Cell Variation
UM1MH130994 Joseph Ecker, Margarita Behrens, Bing Ren, Ting Wang, Xiangmin Xu Center for Multiomic Human Brain Cell Atlas
UM1MH130981 Ed Lein, Hongkui Zeng Functionally guided adult whole brain cell atlas in human and NHP
UM1MH130991 Arnold Kriegstein, Aparna Bhaduri, Hao Huang, Jon Levin, Tomasz Nowakowski, Alexander Pollen, Nenad Sestan A Multidisciplinary Center for Developing Human and Non-human Primate Brain Cell Atlases
U01MH130995 Chongyuan Luo Spatiotemporal epigenomic and chromosomal architectural cell atlas of developing human brains
U01MH130907 Anton Arkhipov, Marina Garrett Bridging Function, Connectivity, and Transcriptomics of Mouse Cortical Neurons

The data ecosystem that supports the initial product release includes three platforms: Brain Knowledge Platform (BKP; the brain science accelerator at Allen Institute), the Neuroanatomy-anchored Information Management Platform for Collaborative BICAN Data Generation (NIMP, RRID:SCR_024684; The University of Texas Health Science Center at Houston) and the fastq file storage at NeMO archive. These integrated platforms will continue to enable quarterly cross-consortium data releases going forward.

Metadata and resource identifiers (ID) for specimens and sequencing data are captured, managed, and cross-linked through the Neuroanatomy-anchored Information Management Platform (NIMP, RRID:SCR_024684) for Collaborative BICAN Data Generation, codifying critical BICAN data standards and standard operating processes to ensure trackable experimental workflow and data integrity for down-stream data archives of the entire BICAN consortium.

Single cell omics data processing pipelines were developed by the Broad Data Sciences Platform in partnership with the BICAN community. Pipelines are available on GitHub (RRID:SCR_002630), Dockstore, and the cloud workbench Terra (RRID:SCR_021648) Data operations (including data ingestion, storage, and release) are performed by the Neuroscience Multi-omic (NeMO) Archive team at the University of Maryland.

The following NIH Awards provided infrastructure support for the BICAN Data Ecosystem and initial Rapid Release.

Award Principal Investigators Title
U24MH130919 Michael Hawrylycz, Carol Thompson A Community Resource for Single Cell Data in the Brain
U24MH130918 Shoaib Mufti, Satrajit Ghosh, Michael Hawrylycz, Lydia Ng An extensible brain knowledge base and toolset spanning modalities for multi-species data-driven cell types
U24MH130968 Timothy Tickle, Jesse Gillis, Owen White Scalable Molecular Pipelines for FAIR and Reusable BICAN Molecular Data
U24MH130988 Guo-Qiang Zhang, Hua Xu, Wenjin Jim Zheng Engagement and outreach to achieve a FAIR data ecosystem for the BICAN
R24MH114788 Owen White A BRAIN Initiative resource: The neuroscience multi-omic data archive