For some time the BMJ has been watching other journals’ efforts to encourage authors to make raw research data available. Now we are taking part too, by asking authors to include a data sharing statement at the end of each original research article. The statement will explain which additional data—if any—are available, to whom, and how. Those data could range from additional explanatory material to the complete dataset. People allowed access to the data might range from fellow researchers to everyone. And data might be available only on request, accessible online with a password, or openly accessible to all on the web with a link on bmj.com.
We understand that many authors wish to guard data until they have published all their own papers, and we know that data sharing is hard to do. But we hope that authors will, increasingly, set the data free, perhaps after a set period of personal use.
Data sharing means more than the open access publication of articles and the posting in online registries of study protocols and main results. Sharing allows other researchers—and perhaps scientists, clinicians, and patients—access to raw numbers, analyses, facts, ideas, and images that do not make it into published articles and registries. At its fullest extent, data sharing means free access for everyone. Many people would call this a moral obligation because most research is publicly funded and involves the public as participants. Other potential benefits include quicker scientific discovery and learning, better understanding of research methods and results, more transparency about the quality of research, and greater ability to confirm or refute research through replication.
Such sharing raises important questions about who owns the data, who gives permission to release the data..., where and how the data should be stored..., who should have access and when, and what limits may be needed to prevent misuse and mishandling of data. Yet, despite these and other complexities, the movement to free the world’s vast swathes of untapped research data is gathering speed....
Researchers lack the incentives and the means to analyse all the data that they generate, to manage data after funded projects have ended, and to share data other than informally with certain collaborators. A recent UKRDS study on the logistics and costs of developing and maintaining a national shared digital research data service concluded that such a service is feasible and worth funding, and that it could greatly increase UK universities’ potential for research and innovation and their global competitiveness. Meanwhile, many other countries are already on the case....
Data sharing is hardly a new idea. Physicists, environmentalists, and researchers in the basic biomedical sciences have been doing this for years. Funders such as the UK Medical Research Council, US National Institutes of Health, and Wellcome Trust already mandate sharing of data from research in basic science and genetics. The US National Heart, Lung, and Blood Institute has opened up to researchers worldwide its collection of genetic and clinical data from three asthma research networks and the Framingham Heart Study. Even GlaxoSmithKline has opened up its "patent pool" so that data relevant to finding drugs for neglected diseases can be explored by other researchers.
Numerous science journals mandate data sharing too. For example, a condition of publication in a Nature journal is that "authors are required to make materials, data and associated protocols promptly available to others without preconditions....
For most medical journals, however, sharing of clinical research data is a new and difficult concept. Last week the editors of the open access BioMed Central journal Trials spelled out the main ethical and editorial barriers to data sharing in medicine and, partly drawing on discussions with scientists and other editors (including TG), proposed some solutions (box). The maintenance of patient confidentiality is a major challenge....
Since 2007, the Annals of Internal Medicine has been asking authors to make a "reproducible research statement" at the end of each research paper. Authors state whether and within what limits they will share the original study protocol, the dataset used for the analysis, and the computer code used to produce the results. We gladly acknowledge that we are emulating this policy in introducing data sharing statements for BMJ research articles and bringing data sharing to authors’ and readers’ attention. We hope authors and readers will now join us in this debate and will help journals to set data free.
Peter Suber at 4/04/2009 04:12:00 PM.
The open access movement:
Putting peer-reviewed scientific and scholarly literature
on the internet. Making it available free of charge and
free of most copyright and licensing restrictions.
Removing the barriers to serious research.