What is #LancsBox?
#LancsBox is a new-generation software package for the analysis of language data and corpora developed at Lancaster University. It can be freely downloaded from the following link: http://corpora.lancs.ac.uk/lancsbox/index.php The page also contains How to… videos guiding the users through the various features of the corpus tool.
Main features of #LancsBox:
- Works with your own data or existing corpora. It provides access, for example, to a large sample of written and spoken BNC 2014, British and American corpora as well as a corpus of collection of writing by Shakespeare or Austen.
- Can be used by linguists, language teachers, historians, sociologists, educators and anyone interested in language.
- Visualizes language data.
- Analyses data in any language. Find out more details about language support.
- Automatically annotates data for part-of-speech.
- Works with any major operating system (Windows, Mac, Linux).
Acknowledgements: The development of #LancsBox was supported by ESRC grants ES/K002155/1 and EP/P001559/1.#LancsBox uses the multiple third-party tools and libraries: Apache Tika, Gluegen, Groovy, JOGL, minlog, QuestDB, RSyntaxTextArea, smallseg and TreeTagger.