Open Access News

News from the open access movement


Tuesday, May 19, 2009

Articulating principles for open data

A group of open data advocates, including Peter Murray-Rust, Cameron Neylon, and Rufus Pollock, recently met at the Panton Arms pub in Cambridge and articulated a set of principles for open data. Here's Neylon's version of what Murray-Rust calls the "Panton Principles":
Where a decision has been taken to publish data deriving from public science research, best practice to enable the re-use and re-purposing of that data, is to place it explicitly in the public domain via {one of a small set of protocols e.g. cc0 or PDDL}.
From Neylon's comments:

... [W]e focused on what we could agree on with the aim of seeing whether it was possible to find a common position statement on the limited area of best practice for the publication of data that arises from public science. I believe such a statement is important because there is a window of opportunity to influence funder positions. ...

The purpose of publishing public scientific data and collections of data ... is to enable re-use and re-purposing of that data. Non-commercial terms prevent this in an unpredictable and unhelpful way. Share-alike and copyleft provisions have the potential to do the same under some circumstances. ...

The advantage of this statement is that it focuses purely on what should be done once a decision to publish has been made, leaving the issue of what should be published to a separate policy statement. This also sidesteps issues of which data should not be made public. ...

From Murray-Rust's comments:

... Data itself must be completely free. The question is how to ensure that it is. ...

For us Data are born Open. The question is how to state that. ...

The biggest danger is not making the assertion that the data is Open. There may be second-order problems from CC0 or PPDL but they are nothing compared to the uncertainty of not making this simple assertion. ...

Also see John Wilbanks' comments.