This blog follows on from The discover.repositive.io launch - one year on (Part 1)
To celebrate 1 year of discover.repositive.io, I wanted to take a look at what is trending on the Repositive platform since launch... What are the most searched terms? What are the most viewed datasets? The most popular collections? What success stories have we been privileged to witness? All very good questions...
Let's dive in!
Having 52 sources on the platform gives our users a much more diverse portal to query. At the start, we indexed the most popular repositories like SRA, GEO and Array Express, but it became clear that some users want something very specific and there is a lot of value in some of the smaller, more niche repositories.
So, are people exploring new sources? The small and the niche? Or are they sticking with what they know?
1. Sequence Read Archive (SRA) - with 101 unique views
2. Gene Expression Omnibus (GEO) - with 59 unique views
3. Personal Genome Project - with 55 unique views
Although it is disappointing to see that the top 3 most viewed data sources are some of the most well known, it is not that surprising. Before our users fully adopt Repositive, they will want to explore what they already know. When you go to any social media site or online listing, your first instinct is to think 'oh, I wonder if so-and-so is there', expecting to see them. Finding that person there is a test, something to give you reassurance. If they are not there, you would think differently. That is one of the reasons why getting all and up-to-date data is a priority for Repositive. It's about giving you what you can already get, and then more!
Users with viewers
Now, I had to take quite a few Repositive employees out of this list. When users ask questions on data, it historically has been us who try to help them. We believe if we do, the next user will pay it forward. Chain reactions are a powerful thing. Disappointingly, I didn't appear with my colleagues, but to drown my sorrows I looked at the users who have had more views than others. And why?
Stephen Bradley - with 32 unique views
Sam Smits - with 10 unique views
Shamith Samarajiwa - with 9 unique views
Stephen Bradley came out top, with 32 unique views. Although dashing as the frog avatar, the more likely reason is his 3 registered datasets. Not only have these registered datasets given Stephen 36 reputation points, but the datasets themselves have appeared on the discover home page, and collectively have received 140 views. At Repositive HQ, we thought "who is this amazing guy registering datasets?", and I am sure many of our users were just as curious.
Although Sam Smits has fewer reputation points, he has added himself as a contributor to one of the most looked at data sets; 1000 Genomes Project. So users do not need to register data sets to contribute and get some added visibility, users can help improve metadata, add tags, answer questions and much more.
The Best Request
HCC1143 cell-line - 44 unique views
Data requests are still taking some effort to get off the ground, but our team are always looking to help our users. We see requests getting views as other users explore the platform, but it is usually Repositive who comment and begin to provide information. Rest assured that if you make a request, our team are looking at how we can help. We hope that by helping our users, they will in return pay if forward.
When Ines de Santiago took to Twitter and told the world she found a new dataset for her research! #awesome
When we met Edoardo Giacopuzzi at ESHG 2017 and he wrote a blog for Repositive about his data eureka moment finding 40 WES and 1,600 WGS, all open access datasets for "NA12878 reference sample".
When Arindam Basu found a data source he never knew existed and allowed us to publicise this rather amazing testimonial:
“ I greatly appreciate the flexibility and ease with which Repositive.io enables search for gene and phenotype data and make both types of data accessible. Further, Repositive also makes it easy to access restricted data and even connect to teams who share data. I recently used Repositive site to access the Chinese Kadoori Project dataset. This not only enabled me to access their data sets and learn their protocols of accessing the data, it also made it easy for me to apply for and establish mutual collaboration with the researchers. I would not have known of this opportunity had I not used Repositive.io. Best, Arin."
To boldly go where no researcher has gone before
One year ago, we saw the stars align. A lot of hard work went into launching out of beta testing. We had lots of new data, features and PR. We saw a massive spike in referrals and web visits. It was a chance to look back and marvel at our achievements. But product launches are like solar eclipses, they don't happen too often! So what is next for Repositive?
The USA may have won the moon race! But they are not the only ones to do it. As I am sure we are not the only ones to address the data access bottleneck. But we've had a massive headstart. We launched and now have seen the stars and learned about our strengths and weaknesses. We see people searching for data, but often struggling to find data or unlocking the power of search. We see users who are actively contributing to the core platform getting more profile views, but not everyone is catching on yet. We see people checking the sources they know of, but not searching something new.
But this is just the beginning! Did NASA stop there? No! They went back to the drawing board. They iterated. They innovated. They improved. They also brought back mystery from space, they captured imaginations and they inspired the next generation of explorers. That's what we are doing. We are improving our rocket! We hope to inspire researchers to the possibilities of efficient and ethical access to genomic data.
"That's one small step for man, one giant leap for mankind."
Repositive Launches Full Version of its Platform
The discover.repositive.io launch - one year on (Part 1)