Showing posts with label institutional identifiers. Show all posts
Showing posts with label institutional identifiers. Show all posts

Tuesday, 13 October 2015

Standard Identifiers, Metrics and Processes in Journal Publishing: Mark Hester asks 'Aren't they a bit...dull?'

Why should we use standards? Identifiers, transaction processes, schemas, metrics and many other things in scholarly publishing have standards, or are developing them. Isn’t this a rather arduous and bureaucratic way of handling things? Are these things really there to make life easier or just another way of overcomplicating an already complex market, taking time away from the efforts of actually producing high quality content?

Here Mark Hester of Aries Systems delves into why we should care.

Aren’t standards a bit….dull?'


Standards? Just a bunch of numbers, right? With tedious documentation on how and where to use them? Why would I bother with those?

It’s not hard to see why you might think that, but also easy to see how this is misguided. Jumping straight into a document to read about standards is a little bit like reading the telephone directory when you have no intention of calling someone, or leafing through a Haynes manual when you’re not repairing a car.

An example of a standard from outside publishing might help – EAN-13. What is EAN-13 you might ask? You see examples of it daily – it is the standard for the barcodes we see on everything we buy in the supermarket. Retail staff don’t need to know how EAN-13 works, it is unlikely that they’ve read documentation on it, but they are all grateful that it does work when checking stocks, pricing items and working on the till and, in turn, so are their customers.

So I ignore standards: what’s the worst that can happen?


When I was a student in the early nineties, the departmental librarian had been using his own classification system for many years. Back then, it didn’t matter much – students got used to its quirks, visitors from other departments were rare, from other universities much rarer still. The people using the service understood it, and that was enough.

Imagine taking this approach in the online world - it would mean that your content would be less discoverable and also less usable. Online library catalogues wouldn’t work if everyone took the librarian from my alma mater’s approach! Not using DOIs means frustration for researchers who can’t click on the references and go straight to the articles, and a simple change to a URL means a broken link. If your content isn’t seen it affects your reputation, and in the case of a commercial publisher, your profits.

The benefit of standards will only increase as the ‘digital natives’ used to touch screen technology enter academia and the workplace – having to click more than once or search for more than a minute will lead them to go elsewhere.

How can standards enhance my working life and be good for my organization?


Rapid changes in scholarly publishing means that new applications are found for standards once they are in place. Adopting standards can ‘future proof’ your content and processes against changes that occur in the future.

A great example of this is the relentless adoption of gold open access. The publishing standards which enable Copyright Clearance Center’s RightsLink for OA to display different article processing charge policies to different users on the fly developed separately from one another – Ringgold for institutions, ORCID for identifying authors, and FundRef for funder identification. Brought together, however, their machine readability allows flexible APC pricing models and automated billing and payment processing, making life easier and saving time and money for both publishers and institutions.

The advantages can be psychological as well as practical – if authors, researchers and librarians see the ORCID or CrossRef logos displayed on your website, they will know that your organization is a serious player, one which will help them, one they can trust.

So what's next?


By now, I hope I’ve convinced you of the importance of standards. But if the prospect of researching the topic still fills you with a sense of dread, there's an upcoming seminar from ALPSP I'm helping to coordinate called Setting the Standard. It's being held in London on Wednesday 11 November and includes speakers from CrossRef, Ringgold, ORCID, COUNTER, Thomson Reuters, EDItEUR, Jisc and an institution. Everything you ever wanted to know about standards, but were too scared to ask.

I hope to see you there.

Wednesday, 22 January 2014

Laura Cox on institutional identifiers internally and throughout the supply chain

Laura Cox: do a data health check

Laura Cox, Chief Financial and Operating Officer at Ringgold kicked off her talk with a focus on what constitutes healthy data. Good quality, reliable and consistent data helps make good decisions. You gain insight into customers and business relationships as well as support strategic planning, decision making and ongoing business operations.

Poor data has real consequences. It is hard to get a true picture of relationships with institutions, can lead to a lack of quality author (and affiliation) data and an inability to see overlap between authors, members and customers. It can drive inaccurate holdings and revenue reports leading to protracted time and effort which can cost your business time and money. Healthy records are complete, accurate, free of duplications, current, consistent and conform with standard identifiers. 

What are unique identifiers and how can they help? 

They are numeric or alpha-numeric designations which are associated with a single entity. Entities can be institutions, persons or pieces of content. They enable the disambiguation of each entity and provide a proper understanding of customer, author, reader or institution as well as a proper identification of content object, article, product or package. They can also be used internally or in conjunction with external partners.

Why should we worry about data now? 

Cox cited the 2012 STM Report (Ware, M and Mabe, M. The STM Report, 2012) which stated that the number of researchers and the number of article are both increasing by 3% per annum. The number of journals is increasing by 3.5% per annum and growth in China has been in double digits for over 15 years. At the same time there is increased demand for anytime/anywhere access while library budgets are frozen or being cut, less money for more content.

Institutional Identifiers can be used for disambiguation (e.g. which UCL?), consolidating different versions (many ways of describing the University of Oxford and its institutions). They provide a hierarchy view (institute within an institution) and reinforce uniqueness. This means you can use them for a gap analysis. 

The Kafka-esque 'Identifiers identified' slide
The main challenge is around multiple data sources. There are system data silos, multiple locations - geographic data silos, data entered by different people for different purposes, data from 3rd parties in the supply chain and data from bought in sources. These things aren’t integrated. Typical publisher systems include financial, CRM or sales databases, authentication system, fulfilment, usage statistics, submissions systems and so on.

Cox advised that the first thing to do is think about your data and implement a data governance plan. What data is held, where and how is it accessed? How can it be used to benefit business and work across silos? But always bear in mind where are you now and where do you want to go?

Another recommendation was to improve data capture. If you can, use web forms as they minimise variance in data input. Implement required fields, use date validation and at a minimum use naming conventions. There are a number of tools such as address validation, postcode look-up, institution validation/lookup. Avoid free-text fields and make institutional identifiers a requirement.

You can use an institutional identifier as a lynch pin to link internal systems for better data integration. It can prevent duplicate account creations, help keep data up-to-date and systems synchronized. It also enables staff to use data more effectively, break down silos, simplify data transmission and provide more insight and power to analyse and understand the business.

So what can you do now?
  • Engage with the problems
  • Think about resources. Time? Money? Systems?
  • How do you want it to work – look at priorities
  • Have a data governance policy
  • Appoint a data champion and document everything
  • Create some basic rules for data entry
  • Use universal identifiers to clean and link your data
  • Work with suppliers and customers to use institutional identifiers ot strengthen the supply chain.