Over recent years, research into biomedical data using corpora and corpus methods has moved from a small-scale activity with isolated pockets of activity to a much larger very active field, with work advancing rapidly on many different fronts in both corpus and computational linguistics.

In many areas of academic publishing, there is an explosion of literature, and sub-division of fields into subfields, leading to stove-piping where sub-communities of expertise become disconnected from each other. This is especially true in the genetics literature over the last 10 years where researchers are no longer able to maintain knowledge of previously related areas.

We invite one-page abstract submissions on topics that include, but are not limited to, techniques developed in Natural Language Processing (NLP) and Corpus Linguistics (CL) can help in closing this gap of knowledge leading to a better hypothesis generation. This multidisciplinary effort aims to harness the power of NLP and CL to be able to build method and techniques that will provide new clues to disease aetiology.

HG2BTM workshop aims to create a venue where different activities in corpus research into biomedical data can be brought together to explore progress in the field through inviting renowned speakers working on the fields of NLP and CL towards advancing biomedical and gene ontology research.


