{"id":40,"date":"2016-08-04T10:15:35","date_gmt":"2016-08-04T09:15:35","guid":{"rendered":"http:\/\/wp.lancs.ac.uk\/highly-relevant\/?p=40"},"modified":"2018-09-18T11:31:12","modified_gmt":"2018-09-18T10:31:12","slug":"the-long-tail","status":"publish","type":"post","link":"http:\/\/wp.lancs.ac.uk\/highly-relevant\/2016\/08\/04\/the-long-tail\/","title":{"rendered":"The long tail"},"content":{"rendered":"<figure id=\"attachment_42\" aria-describedby=\"caption-attachment-42\" style=\"width: 515px\" class=\"wp-caption aligncenter\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"42\" data-permalink=\"http:\/\/wp.lancs.ac.uk\/highly-relevant\/2016\/08\/04\/the-long-tail\/pexels-photo-1\/\" data-orig-file=\"https:\/\/i0.wp.com\/wp.lancs.ac.uk\/highly-relevant\/files\/2016\/08\/pexels-photo-1.jpg?fit=640%2C426\" data-orig-size=\"640,426\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"pexels-photo (1)\" data-image-description=\"\" data-image-caption=\"&lt;p&gt;Source: http:\/\/negativespace.co\/photos\/coding\/ CC0&lt;\/p&gt;\n\" data-large-file=\"https:\/\/i0.wp.com\/wp.lancs.ac.uk\/highly-relevant\/files\/2016\/08\/pexels-photo-1.jpg?fit=640%2C426\" class=\"wp-image-42 \" src=\"https:\/\/i0.wp.com\/wp.lancs.ac.uk\/highly-relevant\/files\/2016\/08\/pexels-photo-1.jpg?resize=515%2C343\" alt=\"Source: http:\/\/negativespace.co\/photos\/coding\/ CC0\" width=\"515\" height=\"343\" srcset=\"https:\/\/i0.wp.com\/wp.lancs.ac.uk\/highly-relevant\/files\/2016\/08\/pexels-photo-1.jpg?resize=300%2C200 300w, https:\/\/i0.wp.com\/wp.lancs.ac.uk\/highly-relevant\/files\/2016\/08\/pexels-photo-1.jpg?w=640 640w\" sizes=\"auto, (max-width: 515px) 100vw, 515px\" \/><figcaption id=\"caption-attachment-42\" class=\"wp-caption-text\">Source: http:\/\/negativespace.co\/photos\/coding\/ (CC0)<\/figcaption><\/figure>\n<p>&nbsp;<\/p>\n<p>We have been doing some thinking around how to improve the research data management services we offer here at Lancaster.\u00a0 We&#8217;re keen to move away from the idea of the role of research data management as purely for compliance purposes &#8211; we want to really push the idea of open data and data reuse and develop the idea that the research data produced by the university are valuable assets. We know that researchers at the university are working on interesting, valuable and important work.\u00a0 Look at Derek Gatherer&#8217;s work on the\u00a0<a href=\"https:\/\/theconversation.com\/zika-a-rare-benign-virus-suddenly-turns-nasty-and-heads-for-the-us-52792\" target=\"_blank\" rel=\"noopener\">Zika virus <\/a>or Maggie Mort&#8217;s project looking at <a href=\"http:\/\/wp.lancs.ac.uk\/cyp-floodrecovery\/\" target=\"_blank\" rel=\"noopener\">disaster planning and children<\/a>\u00a0and <a href=\"http:\/\/www.research.lancs.ac.uk\/portal\/en\/datasets\/search.html\" target=\"_blank\" rel=\"noopener\">a host of other<\/a> more specialized datasets supporting research right across the sciences and the humanities.\u00a0 Each dataset will have its own context, background and requirements for it to be properly interpreted and understood.<\/p>\n<p><!--more--><\/p>\n<p>Capturing high quality data means capturing high quality metadata; the structure which supports the data.\u00a0 The metadata explains the research data and supports discovery and (re)interpretation.\u00a0 Archivists are well used to supplying metadata for collections (or cataloguing it as it is more familiarly known!) and also know that the richest metadata is that which is supplied by the creator of the collection.\u00a0 This will be the person who knows most about the data, who fully understands the context and can supply additional information which will help with the later re-use and re-interpretation of the data.<\/p>\n<p>The ideal set up would be one where each dataset came with full and rich descriptive metadata with keywords taken from relevant subject specific vocabularies but the reality is always going to fall very far short of this.<\/p>\n<p>Research data is often seen as something of a by-product of the research process and this can reinforce the idea that action is only necessary because the research councils demand it, running the risk of creating a compliance culture.<\/p>\n<figure id=\"attachment_43\" aria-describedby=\"caption-attachment-43\" style=\"width: 554px\" class=\"wp-caption aligncenter\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"43\" data-permalink=\"http:\/\/wp.lancs.ac.uk\/highly-relevant\/2016\/08\/04\/the-long-tail\/startup-photos-1\/\" data-orig-file=\"https:\/\/i0.wp.com\/wp.lancs.ac.uk\/highly-relevant\/files\/2016\/08\/startup-photos-1.jpg?fit=640%2C426\" data-orig-size=\"640,426\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"\" data-image-description=\"\" data-image-caption=\"&lt;p&gt;http:\/\/startupstockphotos.com\/post\/123128014991\/at-barrel-soho-nyc  CC0&lt;\/p&gt;\n\" data-large-file=\"https:\/\/i0.wp.com\/wp.lancs.ac.uk\/highly-relevant\/files\/2016\/08\/startup-photos-1.jpg?fit=640%2C426\" class=\"wp-image-43 \" src=\"https:\/\/i0.wp.com\/wp.lancs.ac.uk\/highly-relevant\/files\/2016\/08\/startup-photos-1.jpg?resize=554%2C369\" alt=\"http:\/\/startupstockphotos.com\/post\/123128014991\/at-barrel-soho-nyc CC0\" width=\"554\" height=\"369\" srcset=\"https:\/\/i0.wp.com\/wp.lancs.ac.uk\/highly-relevant\/files\/2016\/08\/startup-photos-1.jpg?resize=300%2C200 300w, https:\/\/i0.wp.com\/wp.lancs.ac.uk\/highly-relevant\/files\/2016\/08\/startup-photos-1.jpg?w=640 640w\" sizes=\"auto, (max-width: 554px) 100vw, 554px\" \/><figcaption id=\"caption-attachment-43\" class=\"wp-caption-text\">http:\/\/startupstockphotos.com\/post\/123128014991\/at-barrel-soho-nyc (CC0)<\/figcaption><\/figure>\n<p>The truth of the matter is that researchers have little spare time or resource to devote to creating detailed and complex descriptions of their data (often having done so in the related published article).\u00a0 Even worse is when it comes to capturing the data in a format which is likely to promote its chances of being accessible and reusable well into the future.\u00a0 From Art through to Women&#8217;s Studies via Engineering, Linguistics, Physics and Creative Writing and everything in between there is a dizzying array of software and file types supporting everything from spreadsheets, to videos, to models to graphs.<\/p>\n<p>To what extent might it be possible to expect and demand rich metadata and standardised file formats?\u00a0 In terms of current practices at data repositories there is wide variety.\u00a0 Some repositories are extremely prescriptive about what can be deposited.\u00a0 The UK Data Archive for example which is a repository for &#8220;large collections of high quality data&#8221; for the Social Sciences.\u00a0 With a reputation for high quality reliable data the UK Data Archive service is in a position to demand <a href=\"http:\/\/www.data-archive.ac.uk\/media\/54785\/cd078-documentationingestprocessingprocedures_08_00w.pdf\" target=\"_blank\" rel=\"noopener\">specific file formats<\/a> and detailed metadata.\u00a0 Because they have a high institutional reputation researchers immediately see the value of investing time in producing data in the format required and to some extent competing for the privilege of having work deposited in this repository.\u00a0 However the majority of institutional repositories are catering for the long tail of research &#8211; datasets which have no &#8220;natural&#8221; home and do not meet the requirements of repositories such as the UK Data Archive.\u00a0 This puts institutional repositories on the backfoot &#8211; the starting position is of the repository of last resort so rather than researchers competing for the privilege of depositing they are using the repository as a filing cabinet to clear away the papers at the end of the project.<\/p>\n<p>So what to do about this?\u00a0 Again there are a variety of approaches which range from the prescriptive to the permissive.\u00a0 Some repositories &#8211; ourselves included in this &#8211; put no restrictions on the format of data and ask for the minimum amount of metadata as required by their institutional system (in our case Pure).\u00a0 We ask for keywords, geographic locations and covering dates but these are not required fields.\u00a0 We make no restriction on the format of the digital files deposited although we ask, where possible, for some explanatory notes to help future users of the data.\u00a0 We are, however, at the mercy of our depositors.\u00a0 This can mean anything from extremely rich and well described datasets to ones where lack of time and resources (and possibly engagement) provide scant metadata and risk having datasets which are hard for others to interpret, especially where data managers have had to add in metadata and descriptions later.\u00a0 At best we end up with uneven and patchy descriptions and at worst data which are unusable by anyone other than the creator right from the outset.<\/p>\n<p>There are several improvements we can make. \u00a0We should advocate and educate so that researchers understand the need for high quality data and metadata.\u00a0 We should be better at getting across the message of why it is important to make data openly available for transparency and reuse.<\/p>\n<p>We should also be looking at ways to refine the automation of data discovery and there are various interesting initiatives around although they would require rich metadata to allow for this kind of detailed analysis.<\/p>\n<p>Each institution will find itself in a different position with regards to the level of engagement but clearly collaborative approaches will work well both in raising the profile of data management and also in looking for shared solutions to data discovery and sharing.\u00a0 It will be interesting to see how the forthcoming JISC sponsored project for shared <a href=\"https:\/\/www.jisc.ac.uk\/rd\/projects\/research-data-shared-service\" target=\"_blank\" rel=\"noopener\">Research Data Services<\/a> will affect these current issues. \u00a0Hopefully it will promote more consistency and a stronger voice, especially for smaller institutions who don&#8217;t have the resources to develop a complex repository.<\/p>\n<p>There is a lot happening right now in data management with the emphasis on making it discoverable and reusable and we are keen to be a part of that conversation.<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>&nbsp; We have been doing some thinking around how to improve the research data management services we offer here at Lancaster.\u00a0 We&#8217;re keen to move away from the idea of the role of research data management as purely for compliance purposes &#8211; we want to really push the idea of open data and data reuse &hellip; <a href=\"http:\/\/wp.lancs.ac.uk\/highly-relevant\/2016\/08\/04\/the-long-tail\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">The long tail<\/span><\/a><\/p>\n","protected":false},"author":521,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"jetpack_post_was_ever_published":false},"categories":[1,6],"tags":[7,9,8],"class_list":["post-40","post","type-post","status-publish","format-standard","hentry","category-digital-preservation","category-rdm","tag-digital-preservation","tag-metadata","tag-rdm"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p81NIC-E","jetpack-related-posts":[],"_links":{"self":[{"href":"http:\/\/wp.lancs.ac.uk\/highly-relevant\/wp-json\/wp\/v2\/posts\/40","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/wp.lancs.ac.uk\/highly-relevant\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/wp.lancs.ac.uk\/highly-relevant\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/wp.lancs.ac.uk\/highly-relevant\/wp-json\/wp\/v2\/users\/521"}],"replies":[{"embeddable":true,"href":"http:\/\/wp.lancs.ac.uk\/highly-relevant\/wp-json\/wp\/v2\/comments?post=40"}],"version-history":[{"count":7,"href":"http:\/\/wp.lancs.ac.uk\/highly-relevant\/wp-json\/wp\/v2\/posts\/40\/revisions"}],"predecessor-version":[{"id":801,"href":"http:\/\/wp.lancs.ac.uk\/highly-relevant\/wp-json\/wp\/v2\/posts\/40\/revisions\/801"}],"wp:attachment":[{"href":"http:\/\/wp.lancs.ac.uk\/highly-relevant\/wp-json\/wp\/v2\/media?parent=40"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/wp.lancs.ac.uk\/highly-relevant\/wp-json\/wp\/v2\/categories?post=40"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/wp.lancs.ac.uk\/highly-relevant\/wp-json\/wp\/v2\/tags?post=40"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}