Open Access News

News from the open access movement


Monday, March 16, 2009

Automated identification of copyright status of a work

Rufus Pollock, Computing Copyright (or Public Domain) Status of Cultural Works, miscellaneous factZ, March 12, 2009.

I’m working on a EU funded project to look at the size and value of the Public Domain. This involves getting large datasets about cultural material and trying to answer questions like: How many of these items are in the public domain? What’s the difference in price and availability of public domain versus non public domain items? ...

Suppose we have data on cultural items such as books and recordings. For a given item we wish to:

  1. Identify the underlying work(s) that item contains.
  2. Identify the copyright status of that work, in particular whether it is Public Domain (PD)

Putting 1 and 2 together allows us to assign a ‘copyright status’ to a given item. ...

[D]etermining copyright status is, in theory, simple:

  1. Given information on an item match it to a work (or works).
  2. For each work obtain relevant information such as date work first published (as an item) and death dates of author(s)
  3. Compute copyright status based on the copyright laws for your jurisdiction.

While copyright law is not always simple, step three is generally fairly straightforward ...

What is not so straightforward are the first two steps especially step 1. This is because most datasets give only a limited amount of information on the items they contain. ...