8th Data Conversations: How Digital Humanities Impacts Research Data Management

Recently, Lancaster University Library held the 8th Data Conversations: How Digital Humanities Impacts Research Data Management. This year was different – no free pizza, but the delight of inviting external attendees to listen and contribute (and send gifs over chat!) was a welcome addition. We were joined by four of our own PhD students from a range of humanities subjects. Read on to find out more.

As MS Teams filled up, we were delighted to welcome the first of our speakers, Ben Willis-Eve. Ben talks us through his experience with social media data and “how this is managed and the effects of using Digital Humanities (DH) methods and trying to navigate what is quite a grey area with how you can collect and store this data“ – with focus on the first two points of the below slide. This was all a lot more complicated than Ben thought before starting his PhD – one particular stand out consideration surrounds ethics and what Twitter expect as per their guidelines. The challenges of using hashtags is also discussed, as these are not ‘standard words.’

Next was a look at Bibliometrics for the social sciences from Anoud Abusalim, focussing on how bibliometrics can be used to inform policy in social sciences and humanities research. Anoud looks at second language writing (SLW), which she finds is heavily Americanised. With data collection procedures, organisation is key! You can see several helpful visualisations on the YouTube slides.


To round off the presentations, we next had Ellen Roberts and Samuel Oliver. Reviewing Shakespeare, Ellen focuses on linguistic make up of genre, while Sam focuses on meta language of politeness and impoliteness.

So, what is Corpus Linguistics? Well, corpus=body in Latin. It is an established digital method of studying large a large body of text(s) using specialist software. Ellen explores some of the challenges faced looking at tags in the Enhanced Shakespearean Corpus. Since play texts include stage directions, dialogue and character interaction, how can you meaningfully digitise this for study? Ellen uses a great example of name shortening too, from Theseus to ‘The’ – such a commonly used word!

After introducing Corpus Linguistics and tagging, Ellen seamlessly hands to Sam who discusses his PhD research “(im)politeness metalanguage in Shakespeare’s plays.” This presentation brought up interesting issues – how can text be meaningfully represented? With plays – how can it be translated to machine readable data e.g. stage directions, and the challenge of shortened character names? We must also consider the historical dimensions of words to ensure we understand the context of use.

As we reflected over the amazing work of all four speakers across the three DH presentations, we began the panel Q&A with an array of audience questions.

Anoud was unable to join for this part due to teaching commitment – thank you Anoud for your amazing multi-tasking!

Thank you to all attendees, we were delighted to have you! Did you miss out? Well not to worry, you can watch the full event via our YouTube page. On Twitter, use #ludatacon to catch up on comments and add to the discussion.

We can’t wait to see you all again in person, but for now we are planning another Data Conversations online and will share further details in due course. Happy Data Managing!



6th Data Conversations: Keep it, throw it… or lock it in the vault?

As the doors opened on our sixth Lancaster based Data Conversations and the smell of pizza drifted out, new and old faces joined our conversation about real life research data stories. We were lucky enough to have four engaging speakers, all of whom explained their experience of using data in different fields, and explored the long term value of their data which led to the question: ‘Keep it, throw it… or lock it in the vault?’

Continue reading 6th Data Conversations: Keep it, throw it… or lock it in the vault?

5th Data Conversations – Stories from the Field

We recently held our fifth Data Conversations here at Lancaster University Library. These events bring researchers together and act as a forum to share their experiences of using and sharing data. The vibe’s informal and we provide our attendees with complementary coffee, cake and pizza…

Continue reading 5th Data Conversations – Stories from the Field

Data Interview with Andrew Moore

Andrew Moore (@apmoore94) is a 2nd year PhD student at Lancaster University within the School of Computing and Communications. He is studying how sentiment analysis can be improved through world knowledge using finance as his specialised domain. His research interests are across Natural Language Processing, Machine Learning, and Reproducibility.

We talked to Andrew after he presented at the 3rd Data Conversations.

Continue reading Data Interview with Andrew Moore

3rd Data Conversation – Software as data: summary and slides

We had our third Data Conversation here at Lancaster University again with the aim of bringing together researchers to share their data stories and discuss issues and exchange ideas in a  friendly and informal setting.

Data Conversations Agenda

We had a bit of a change this time, however, as we had a special guest speaker, Neil Chue-Hong of the Software Sustainability Institute talking about Software as “a different kind of research object“.

Continue reading 3rd Data Conversation – Software as data: summary and slides

2nd Data Conversations 4 May 2017 – Data Security and Confidentiality

The 2nd Data Conversations had the theme of Data Security and Confidentiality. More than 20 Lancaster researcher attended. It was nice to start with a slice of pizza and a brew.

Always nice to start an event with food!

As at the 1st Data Conversations we had five lightning talks:

Continue reading 2nd Data Conversations 4 May 2017 – Data Security and Confidentiality

First Data Conversations 30 January 2017 – Summary of event & slides

The first Data Conversations happened on Monday, 31st of January 2017. Below is a quick overview of the action. You can find slides of four talks below.

Data Conversations Opening

Adrian Friday opening Data Conversations

The event was opened by Professor Adrian Friday from the Data Science Institute (DSI) who emphasised that the DSI is all about collaboration between disciplines which is also the spirit of Data Conversations. In fact the 25 attendees came from  a range of Departments: Biological and Life Sciences, Chemistry, Computing, Educational Research, History, Law, Lancaster Environment Centre, Politics, Psychology and others.

Continue reading First Data Conversations 30 January 2017 – Summary of event & slides