2nd Data Conversations 4 May 2017 – Data Security and Confidentiality

The 2nd Data Conversations had the theme of Data Security and Confidentiality. More than 20 Lancaster researcher attended. It was nice to start with a slice of pizza and a brew.

Always nice to start an event with food!

As at the 1st Data Conversations we had five lightning talks. You can see the agenda below.

You can find a short summary of the event, the slides and some photos below.

Denes Csala – The sensor cloud around us: collecting, mining and visualizing the energy and building management data of the campus

Dr Denes Csala is a newly appointed lecturer in Energy Storage Systems Dynamics with Energy Lancaster.

There are 30,000 sensors on campus capturing all sorts of data about energy and energy consumption.  This has the potential for us to understand a huge amount about the way energy is managed and used but at the same time throws up the issue of managing extremely sensitive commercial and personal data.  Access to the data is strictly controlled but Energy Lancaster are very excited about the possibilities of what could be done with the data.

You can see an animated visualization of the campus energy metering system sensor data here:

Kopo Ramokapane – Could computing: When is Deletion Deletion

Kopo Ramokapane is a PhD student in the School of Computing and Communications. Kopo gave an overview about the growing importance of “the cloud”. But do we also see the implications that cloud computing has on security and privacy of our data?

Kopo reported that when you delete data in the cloud there is no way to be sure that all copies or all versions have been deleted from the cloud provider. This issue isn’t new but doesn’t get as much attention as it should be.  Because of the way Cloud storage operates it is almost impossible even for the service providers to be certain that all the data has been deleted.  Avoid storing confidential data in the Cloud and learn more about how the systems work! Lancaster University has a contract with cloud service Box which ensures that compliance issues are dealt with in relation to storage of confidential or sensitive data.

Karen Broadhurst and Stuart Bedston – Better data for better justice: Towards data-driven analyses of Family Court policy and practice

Professor Karen Broadhurst and Stuart Bedston from the Sociology Department reported on concerns about transparency in family court-decision-making.  Greater transparency and “open data” would have a positive impact in many ways but is hard to achieve looking at the security requirements and potential risks.

Karen and Stu presenting on “Better data for better justice”

Karen and Stu highlighted the changes that would be needed in order to strengthen interdisciplinary research using controlled-data here at Lancaster University but also the difficulties that stand in the way.

John Couzins – Security Overview at Lancaster University

Next on was John Couzins, the IT Security Manager of Lancaster University. John who works for the institutional IT service ISS reported on the certifications that are necessary to fulfil requirements of certain providers of confidential data. Current examples are Cyber Essentials Plus and the IG Toolkit (Information Governance Toolkit) which is used by the NHS.

Mateusz Mikusz – Running Research as a Service. Implications for Privacy Policies and Ethics

Mateusz Mikusz is working on his PhD in the School of Computing and Communications. He is working on a project that develops pervasive displays where students can get personalised content on public screens on campus if they use an app or iLancaster.

The issue regarding the data is that is used for two purposes:

  • To make the app and its use cases work
  • To create research data of usage and other properties that can be analysed by the project team

Mateusz explained that he is working hard to bring both things together in an ethical way that still allows innovative research.

Mateusz presenting the project

It was a great showcase for a lot of fantastic research that is taking place at Lancaster University and the way in which handling sensitive data and tackling data security is at the forefront of this.  There were probably as many questions raised as there were answers given but it was a great opportunity to share approaches to handling data securely and ethically.

Want to know more?  Get in touch with the RDM team rdm@lancaster.ac.uk

3rd Data Conversation – 19th September 2017

Join us for our next Data Conversation on 19th September on Software as Data with a special guest speaker Neil Chue Hong from the Software Sustainability Institute.

First Data Conversations 30 January 2017 – Summary of event & slides

The first Data Conversations happened on Monday, 31st of January 2017. Below is a quick overview of the action. You can find slides of four talks below.

Data Conversations Opening

Adrian Friday opening Data Conversations

The event was opened by Professor Adrian Friday from the Data Science Institute (DSI) who emphasised that the DSI is all about collaboration between disciplines which is also the spirit of Data Conversations. In fact the 25 attendees came from  a range of Departments: Biological and Life Sciences, Chemistry, Computing, Educational Research, History, Law, Lancaster Environment Centre, Politics, Psychology and others.

Data Conversations Talks

Unfortunately, Dr Chris Jewell from the Medical School had to cancel his talk. You can see an overview of the agenda below.

Leif Isaksen – Does Linked Data Have to be Open?

Leif Isaksen from the History Department (Leif is also involved in the Data Science Institute) presented the Pelagios Commons project which provides online resources for using open data methods to link and explore historical places.

Leif Isaksen

Leif stressed that linking data is a social process which is built on open partnerships.

You can see Leif’s presentation below:

Jude Towers – Is Violent Crime Increasing or Decreasing?

Dr Jude Towers from Lancaster’s Sociology Department discussed crime rates, especially the rate of domestic violence over time through the Crime Survey for England and Wales. A current ESRC project is looking at how changing survey methodologies alter the underlying data of crime statistics.

Alison Scott-Baumann – Protecting participants and their data on a sensitive topic

Next up was Alison Scott-Baumann who is a Professor of Society and Belief in the Centre of Islamic Studies in the Near and Middle East Department at SOAS. Alison is the Project lead on (Re)presenting Islam on Campus. Lancaster is a project partner and Dr Shuruq Naguib added to Alison’s presentation.

Alison Scott Baumann

Alison and Shuruq explained how difficult it is to get the balance right between confidentiality and data security required to manage often highly sensitive data, and to meet the expectations of data sharing. They stressed how much effort they spend on explaining the terms of the consent forms to project participants.

David Ellis – Building interactive data visualisations to support publications

Dr David Ellis showed the audience an example of dynamic data visualisation using a dataset he published on Lancaster University’s Research Registry. (http://dx.doi.org/10.17635/lancaster/researchdata/58). David explained how he used the R package Shiny Apps to achieve this.

David explained that the visualisation helps not only other researchers but also enables the interested public to query his data. One example was interest from journalists into his research into predicting smartphone operating system from personality and individual differences.

Chris Donaldson & James Butler – Mining and mapping places with multiple names

Finally, Dr Christopher Donaldson and Dr James Butler talked about their research using a 1.5 million word corpus of Lake District 18th and 19th century literature. Christopher and James use the Edinburgh Geoparser System to automatically recognise place names in text and disambiguate them with respect to a gazetteer.

James demonstrated how he can deal with name variations (secondary names), it is a lot of work. For example, the lake “Coniston” appears in the corpus as:  Thurstan, Coniston Lake, Coniston Water, Thurston, Conistone, Conistone Lake, Cunnistone Lake, Thurston Lake, Coniston Mere, Lake of Coniston, Conis- ton, Conyngs Tun, Conyngeston, Thorstane’s watter, Turstinus.

Chris Donaldson
James Butler

Feedback so far

The feedback from attendees and presenters so far so far is encouraging.

Enjoyed the presentations. I hope these data conversations will become a nice community for those interested in data. Relaxed and nicely themed but not too prescribed. The venue was good and the cakes and biscuits were very good!

We got some comments on the length of the presentations and question time.

Really enjoyable – perhaps a bit more time for each speaker / questions and discussions.

We will look into amending the format. We do like to keep a balance between time for data stories and discussions and giving a number of Lancaster researchers a forum to talk about their experiences. Thanks for the comments and suggestions so far!

Upcoming: 2nd Data Conversations 4th of May

We hope to report on some of the data presentations in more detail in future blog posts. Meanwhile, we are already preparing for the next Data Conversations event on 4th of May (1.45-4 pm). The theme of the event will be “Data Security and Confidentiality”, and registrations are open: http://bit.ly/ludatacon2. Please come along and if you have any questions get in touch with the RDM Support Team: rdm@lancaster.ac.uk.

First Data Conversations – Speakers confirmed!

Data Conversations on 30 January “Sharing Data – Benefits and Boundaries

We are very excited about the first Data Conversations event at Lancaster University coming up on 30 January 2017, 1.45-4pm. There will be 6 short talks from academics talking about aspects of their research data. We can now publish the agenda.

Detailed agenda

13:45 Registration and Coffee  
14:00 Welcome to Data Conversations Nigel Davies

(Data Science Institute)

14:10 – 14:50 First round of short talks
14:10 – 14:20 1. Does Linked Data have to be Open? Reflections from the Pelagios Commons Leif Isaksen

(History)

14:25 – 14:35 2. The Politics of Counting: Is Violent Crime Increasing or Decreasing? Jude Towers

(Sociology)

14:40 – 14:50 3. Protecting participants and their data on a sensitive topic Alison Scott-Baumann

(PPR)

14:55 – 15:10 Tea and coffee
15:10 – 15:55 Second round of short talks  
15:10 – 15:20 4. Building interactive data visualizations to support publications David Ellis

(Psychology)

15:15 – 15:30 5. Efficient sharing of numerical output Chris Jewell

(Medical School / CHICAS)

15:35 – 15:45 6. Mining and mapping places with multiple names’ Chris Donaldson & James Butler (History)
15:55 Close Hardy Schwamm (Library)

We are very happy that we get speaker from a range of disciplines! We are looking forward to the first Data Conversations and will report on how it went. Watch this space!