Cambridge, 24 July 2017 ... New research promoted at the European Conference for Computational Biology (ECCB) in Prague shows how the Repositive platform can be used to find and create an open access human genotype data collection, based on online available 23andMe data contributed to the public domain.
Manuel Corpas and Richard Shaw, from Repositive, the company that created the world’s largest portal for accessing human genomic research data, used publicly available data from 23andMe to create an Open Access dataset of genotype information for more than 2,280 individuals.
The preprint was published in bioRxiv, a public archive specialising in providing preprints for biology.
Manuel Corpas, Scientific Lead at Repositive, said:
“Currently, aggregated genome datasets with unrestricted public use are unusually rare, but this approach shows how through using the Repositive platform, it is possible to create an Open Access reference dataset for human genotypes which can be used for research, and most importantly, complies with all data consent.”
Fiona Nielsen, CEO of Repositive added:
“This research shows the potential of using the Repositive platform to make the available data sources from around the world easily findable and accessible. With this new reference data set based on publicly available 23andme data we have shown how data from direct-to-consumer genetic testing can be leveraged to create resources for further research.”
Unlocking the potiential of DTC genomic tests
Watch our second vlog on the 23andMe collection
CORRECTION: the total number of individuals represented in this collection is 2,280, not 2,400
21-25 July 2017: Repositive presents cutting-edge developments in Prague
23andMe Data Collection