Overture: an open-source genomics data platform

被引:0
作者
Shiell, Mitchell [1 ]
Bajari, Rosi [1 ]
Andric, Dusan [1 ]
Eubank, Jon [1 ]
Chan, Brandon F. [1 ]
Richardsson, Anders J. [1 ]
Ali, Azher [1 ]
Allabadi, Bashar [1 ]
Alturmessov, Yelizar [1 ]
Baker, Jared [1 ]
Catton, Ann [1 ]
Cullion, Kim [1 ]
DeMaria, Daniel [1 ]
Dos Santos, Patrick [1 ]
Feher, Henrich [1 ]
Gerthoffert, Francois [1 ]
Ha, Minh [1 ]
Haw, Robin A. [1 ]
Kachru, Atul [1 ]
Lepsa, Alexandru [1 ]
Li, Alexis [1 ]
Mistry, Rakesh N. [1 ]
Nahal-Bose, Hardeep K. [1 ]
Pejovic, Aleksandra [1 ]
Rich, Samantha [1 ]
Rivera, Leonardo [1 ]
Schuette, Ciaran [1 ]
Su, Edmund [1 ]
Tisma, Robert [1 ]
Uddin, Jaser [1 ]
Wang, Chang [1 ]
Wilmer, Alex N. [1 ]
Xiang, Linda [1 ]
Zhang, Junjun [1 ]
Stein, Lincoln D. [1 ,2 ]
Ferretti, Vincent [1 ,3 ]
Courtot, Melanie [1 ,4 ]
Yung, Christina K. [1 ]
机构
[1] Ontario Inst Canc Res OICR, Toronto, ON M5G 1M1, Canada
[2] Univ Toronto, Dept Mol Genet, Toronto, ON M5S 3K3, Canada
[3] Univ Montreal, Res Ctr CHU Sainte Justine, Montreal, PQ H3T 1C5, Canada
[4] Univ Toronto, Dept Med Biophys, Toronto, ON M5G 2C4, Canada
基金
加拿大创新基金会; 加拿大健康研究院;
关键词
research software; data management; genomics; open-source; open-science;
D O I
10.1093/gigascience/giaf038
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Next-generation sequencing has created many new technological challenges in organizing and distributing genomics datasets, which now can routinely reach petabyte scales. Coupled with data-hungry artificial intelligence and machine learning applications, findable, accessible, interoperable, and reusable genomics datasets have never been more valuable. While major archives like the Genomics Data Commons, Sequence Reads Archive, and European Genome-Phenome Archive have improved researchers' ability to share and reuse data, and general-purpose repositories such as Zenodo and Figshare provide valuable platforms for research data publication, the diversity of genomics research precludes any one-size-fits-all approach. In many cases, bespoke solutions are required, and despite funding agencies and journals increasingly mandating reusable data practices, researchers still lack the technical support needed to meet the multifaceted challenges of data reuse.Findings Overture bridges this gap by providing open-source software for building and deploying customizable genomics data platforms. Its architecture consists of modular microservices, each of which is generalized with narrow responsibilities that together combine to create complete data management systems. These systems enable researchers to organize, share, and explore their genomics data at any scale. Through Overture, researchers can connect their data to both humans and machines, fostering reproducibility and enabling new insights through controlled data sharing and reuse.Conclusions By making these tools freely available, we can accelerate the development of reliable genomic data management across the research community quickly, flexibly, and at multiple scales. Overture is an open-source project licensed under AGPLv3.0 with all source code publicly available from https://github.com/overture-stack and documentation on development, deployment, and usage available from www.overture.bio.
引用
收藏
页数:10
相关论文
共 48 条
[1]   TIGER: The gene expression regulatory variation landscape of human pancreatic islets [J].
Alonso, Lorena ;
Piron, Anthony ;
Moran, Ignasi ;
Guindo-Martinez, Marta ;
Bonas-Guarch, Silvia ;
Atla, Goutham ;
Miguel-Escalada, Irene ;
Royo, Romina ;
Puiggros, Montserrat ;
Garcia-Hurtado, Xavier ;
Suleiman, Mara ;
Marselli, Lorella ;
Esguerra, Jonathan L. S. ;
Turatsinze, Jean-Valery ;
Torres, Jason M. ;
Nylander, Vibe ;
Chen, Ji ;
Eliasson, Lena ;
Defrance, Matthieu ;
Amela, Ramon ;
Mulder, Hindrik ;
Gloyn, Anna L. ;
Groop, Leif ;
Marchetti, Piero ;
Eizirik, Decio L. ;
Ferrer, Jorge ;
Mercader, Josep M. ;
Cnop, Miriam ;
Torrents, David .
CELL REPORTS, 2021, 37 (02)
[2]  
[Anonymous], Terraform
[3]  
[Anonymous], Genomic Data Commons
[4]  
[Anonymous], European Genome
[5]  
[Anonymous], Helm
[6]  
[Anonymous], 2020, The EU General Data Protection Regulation (GDPR): a commentary, DOI [10.1093/oso/9780198826491.001.0001, DOI 10.1093/OSO/9780198826491.001.0001]
[7]  
[Anonymous], 2020, About us
[8]  
[Anonymous], African Pathogen Data Sharing and Archive Platform
[9]   Responsible, practical genomic data sharing that accelerates research [J].
Byrd, James Brian ;
Greene, Anna C. ;
Prasad, Deepashree Venkatesh ;
Jiang, Xiaoqian ;
Greene, Casey S. .
NATURE REVIEWS GENETICS, 2020, 21 (10) :615-629
[10]   A pan-African pathogen genomics data sharing platform to support disease outbreaks [J].
Christoffels, Alan ;
Mboowa, Gerald ;
van Heusden, Peter ;
Makhubela, Sello ;
Githinji, George ;
Mwangi, Sarah ;
Onywera, Harris ;
Nnaemeka, Ndodo ;
Amoako, Daniel Gyamfi ;
Olawoye, Idowu ;
Diallo, Amadou ;
Mbala-Kingebeni, Placide ;
Oyola, Samuel O. ;
Adu, Bright ;
Mvelase, Christopher ;
Ondoa, Pascale ;
Dratibi, Fred Athanasius ;
Sow, Abdourahmane ;
Gumede, Nicksy ;
Tessema, Sofonias K. ;
Ouma, Ahmed Ogwell ;
Tebeje, Yenew Kebede .
NATURE MEDICINE, 2023, 29 (05) :1052-1055