Skip to main content

Open Access and content mining

We've previously blogged about the British Library Electronic Theses Online Service (EThOS) that stores theses metadata and, where possible, the full text of digitised theses. BL Labs now wants to explore EThOS metadata for content mining or analysis of trends and is currently inviting research questions that could be answered using this approach. The move follows on from its project providing data to Virginia Tech to develop algorithms for automated subject tagging of theses.

This is another example of an overlay project underpinned by large-scale data harvesting such as the successful Mechanical Curator project that released one million out of copyright images into the public domain for researcher use and re-use.

ChemSpider is an earlier project that brings together chemical structures from a variety of sources into a free database including data from St Andrews theses. This publishing platform provides opportunities to make good quality data public, re-use and preserve known compound data and related information to advance research, develop services and surface the data on the wider internet.

It's an exciting area of Open Access and Open data and there are likely to be further developments as efforts are made digitising older theses and other sources.


Image captured by the Mechanical Curator project.  No known copyright restrictions.


Comments

Popular posts from this blog

Untangling Academic Publishing: Scottish launch for OA Week

St Andrews University Library is delighted to host the Scottish Launch of Untangling Academic Publishing during Open Access Week - the event is open to all, discussion encouraged!

>Please contact libraryoffice@st-andrews.ac.uk if you wish to attend.

Untangling Academic Publishing: Launch and Discussion about the past and future of academic publishingA University Library event for Open Access Week

Tuesday 24 October, 16.00-18.30 - Arts Lecture Theatre (No.31 on the map)

Presentation: Professor Aileen Fyfe, School of History, lead author of the briefing paper ‘Untangling Academic Publishing’, will explain some of the biggest changes in academic publishing over the last 60 years.

Panel Discussion: the talk will be followed by a discussion of possible futures.
Professor Fyfe will be in conversation with Professor Stephen Curry,  Imperial College London and Professor Martin Kretschmer, University of Glasgow.

Presentation and panel discussion will be followed by a wine reception.



Untangling…

Your Open Access - statistics and usage

It's Open Access Week again, and this year the theme is 'Open in order to...' This year's theme is designed to shift discussion away from wider issues of 'openness', and instead direct attention to the tangible benefits of open access. This week we will be publishing a series of posts aimed at  highlighting some of these benefits. In this post we will look at some of the statistics we gather about the open access content in our Repository, and specifically the statistics that we've chosen to highlight in our new Infographic.
Given the theme of this year's Open Access Week, the subject of this post could be appropriately described as 'Open in order to boost downloads' For years we have been collecting usage statistics about the content held in our repository. Up until now this data has been collected and, for the most part, discussed internally; but not any more. Now we want to show the academic community here in St Andrews, whose work populates …

Knowledge Exchange on the costs of Open Access

The cost of Open Access isn't a late-breaking field. In 2014 a cost of £9.2m for UK research organisations to achieve RCUK Open Access compliance was quoted [1]. This is in addition to the millions paid to publishers for article processing charges.  Because the market in scholarly publications is constantly adapting and costs for Open Access and library journal subscriptions are inexorably rising, it's incumbent on institutions to monitor not just the cost of the product, but the cost of managing it.  Open Access and open data have been identified as strategic for Librarians and university senior management [2].


The Knowledge Exchange partnership works at an international level to develop the infrastructure of open scholarship and promote common standards.  It regularly publishes reports on its activities. Its consensus report on monitoring Open Access publications and cost data published April last year makes recommendations based on the work and feedback from stakeholders at…