{"id":600,"date":"2019-09-17T17:17:42","date_gmt":"2019-09-17T17:17:42","guid":{"rendered":"http:\/\/wp.lancs.ac.uk\/cfie\/?page_id=600"},"modified":"2019-10-03T14:53:37","modified_gmt":"2019-10-03T14:53:37","slug":"tamaf2018","status":"publish","type":"page","link":"https:\/\/wp.lancs.ac.uk\/cfie\/tamaf2018\/","title":{"rendered":"TAMAF1"},"content":{"rendered":"<h2 style=\"text-align: center\"><span style=\"font-family: helvetica;font-size: 18pt\"><strong><span style=\"color: #ff6600\">1<sup>st<\/sup> Workshop on Textual Analysis Methods in Accounting and Finance<\/span><\/strong><\/span><\/h2>\n<h5 style=\"text-align: center\"><span style=\"font-family: helvetica\"><strong>Lancaster University Management School<\/strong><\/span><\/h5>\n<h5 style=\"text-align: center\"><span style=\"font-family: helvetica\"><strong>12-14 September 2018<\/strong><\/span><\/h5>\n<p>&nbsp;<\/p>\n<p><strong>Day 1: 12 September<\/strong><\/p>\n<table style=\"width: 98.1887%\" width=\"595\">\n<tbody>\n<tr>\n<td style=\"width: 15.2429%\" width=\"85\">11.00-11.15<\/td>\n<td style=\"width: 102.41%\" width=\"510\">Welcome and introduction<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.2429%\" width=\"85\">11.15-12.30<\/td>\n<td style=\"width: 102.41%\" width=\"510\"><strong>Session 1 <em>Overview of textual analysis literature in accounting and finance<\/em><\/strong><\/p>\n<p>The aim of this session is to provide participants with an overview of extant research on textual analysis in the accounting and finance literature. We will focus on the proposed benefits of automated analysis of text and evaluate extant research against these perceived advantages. A key conclusion that will emerge from the review is that prior research is limited in scope and fails to deliver many of the suggested benefits. A critical theme informing the remainder of the workshop is that automated analysis is not a \u201cquick fix\u201d replacement for close manual reading by domain experts: most advanced applications of computational methods rely on significant manual reading for training and validation.<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.2429%\" width=\"85\">12.30-13.15<\/td>\n<td style=\"width: 102.41%\" width=\"510\"><strong><span style=\"color: #ff6600\">&lt;Lunch&gt;<\/span><\/strong><\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.2429%\" width=\"85\">13.15-15.15<\/td>\n<td style=\"width: 102.41%\" width=\"510\"><strong>Session 2 \u00a0Text extraction: Methods and pitfalls<\/strong><\/p>\n<p>Automated text retrieval is the starting point for most large-sample applications of textual analysis in accounting and finance. This session will provide general guidelines on the text retrieval process, as well as hands-on experience with retrieving: 10-K annual report text (including harvesting documents from EDGAR) using python and R scripts; U.K. annual report narratives published as PDF files using the CFIE\u2019s java-based annual report tool; and U.K. earnings announcement narratives using the CFIE\u2019s java-based PEA tool.<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.2429%\" width=\"85\">15.15-15.30<\/td>\n<td style=\"width: 102.41%\" width=\"510\"><strong><span style=\"color: #ff6600\">&lt;Break&gt;<\/span><\/strong><\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.2429%\" width=\"85\">15.30-17.30<\/td>\n<td style=\"width: 102.41%\" width=\"510\"><strong>Session 3 <em>Readability and tone: Methods and critique<\/em><\/strong><\/p>\n<p>Readability and tone (sentiment) are the two most commonly analysed features of financial market text. This session will review and critique methods used in the extant literature to measure readability and tone. We will demonstrate the problems of relying on standard readability metrics such as Fog to capture sophisticated narrative features such as complexity and understandability. We will also review the various approaches for measuring tone, ranging from simple wordlists to more advanced machine learning methods. A key conclusion that will emerge from this review is that simple measures of readability and tone provide limited scope for generating significant new insights in the literature.<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.2429%\" width=\"85\">18.00-19.30<\/td>\n<td style=\"width: 102.41%\" width=\"510\"><strong>Dinner &amp; research presentation: <em>Classifying Tone and Attribution<\/em><\/strong><\/p>\n<p>A buffet dinner followed by a discussion of ongoing research assessing the relative accuracy of wordlists and machine learning for measuring tone and managerial self-attribution bias in performance sentences from earnings announcements.<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h2><\/h2>\n<p><strong>Day 2: 13 September<\/strong><\/p>\n<table style=\"width: 98.5855%\" width=\"595\">\n<tbody>\n<tr>\n<td style=\"width: 15.909%\" width=\"85\">09.00-10.30<\/td>\n<td style=\"width: 102.137%\" width=\"510\"><strong>Session 4 <em>Constructing and using wordlists <\/em><\/strong><\/p>\n<p>Wordlists are the most common approach to analysing financial text in the accounting and finance literature. This session discusses the advantages and weaknesses of using a wordlist approach to study financial text, reviews the most common wordlists employed in the literature, and considers some of the methods used in conjunction with wordlists to improve their classification performance. The session will also explain the different approaches to constructing wordlists.<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.909%\" width=\"85\">10.30-11.00<\/td>\n<td style=\"width: 102.137%\" width=\"510\"><span style=\"color: #ff6600\"><strong>&lt;Break&gt;<\/strong><\/span><\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.909%\" width=\"85\">11.00-12.30<\/td>\n<td style=\"width: 102.137%\" width=\"510\"><strong>Session 5 <em>Introduction to machine learning<\/em><\/strong><\/p>\n<p>While machine learning forms the basis for a large proportion of research in the field of natural language processing, its uptake in accounting and finance is more limited. This session provides a board introduction to the field of machine learning methods, including both supervised and unsupervised approaches. Different aspects of machine learning and their relation will be explained including classification, named entity recognition, summarization, and topic modelling.<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.909%\" width=\"85\">12.30-13.30<\/td>\n<td style=\"width: 102.137%\" width=\"510\"><span style=\"color: #ff6600\"><strong>&lt;Lunch&gt;<\/strong><\/span><\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.909%\" width=\"85\">13.30-15.00<\/td>\n<td style=\"width: 102.137%\" width=\"510\"><strong>Session 6 <em>Machine learning applications: Classification<\/em><\/strong><\/p>\n<p>This session provides a hands-on introduction to classification using machine learning methods. Participants will use the Weka toolkit (<a href=\"https:\/\/www.cs.waikato.ac.nz\/~ml\/weka\/downloading.html\">https:\/\/www.cs.waikato.ac.nz\/~ml\/weka\/downloading.html<\/a>) to construct and evaluate a model for identifying fraudulent financial reporting using 10-K filings. Results and insights from the analysis will be used to highlight weaknesses in the extant literature and identify opportunities for future research.<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.909%\" width=\"85\">15.00-15.15<\/td>\n<td style=\"width: 102.137%\" width=\"510\"><span style=\"color: #ff6600\"><strong>&lt;Break&gt;<\/strong><\/span><\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.909%\" width=\"85\">15.15-17.15<\/td>\n<td style=\"width: 102.137%\" width=\"510\"><strong>Session 7 <em>Machine learning applications: Topic modelling<\/em><\/strong><\/p>\n<p>Several recent papers in the accounting literature have employed topic modelling methods such as Latent Dirichlet Allocation (LDA) to identify topics in financial text. This session provides a hands-on introduction to topic modelling. Participants will use MALLET (<a href=\"http:\/\/mallet.cs.umass.edu\/index.php\">http:\/\/mallet.cs.umass.edu\/index.php<\/a>) to extract topics from an annual report corpus. In addition to walking participants through the pracitcalites of the modelling process, the session will highlight the many problems associated with topic modelling and discuss alternative approaches to the content analysis problem.<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.909%\" width=\"85\">18.00-19.30<\/td>\n<td style=\"width: 102.137%\" width=\"510\"><strong>Dinner &amp; research presentation: <em>Characteristics of Award Winning Annual Reports<\/em><\/strong><\/p>\n<p>A buffet dinner followed by a discussion of ongoing research that employs corpus methods to identify topics and linguistic styles that characterize high quality annual report narratives (proxied by reports shortlisted for a reporting award).<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p><strong>Day 3: 14 September<\/strong><\/p>\n<table style=\"width: 98.4708%\">\n<tbody>\n<tr>\n<td style=\"width: 16.8301%\" width=\"85\">09.00-10.30<\/td>\n<td style=\"width: 99.1987%\" width=\"510\"><strong>Session 8 <em>Introduction to corpus linguistics<\/em><\/strong><\/p>\n<p>This session provides an introduction to the theory and core methods underpinning the systematic analysis of a large body of text (i.e., a corpus). The session will cover the following themes: introduction to basic corpus linguistic concepts; presentation of different corpora types and examples; methodology for corpus design, compilation, and processing; corpus annotation; basic resources and corpus analysis tools; examples from the literature of using corpus methods to analyse analysis of financial discourse.<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 16.8301%\" width=\"85\">10.30-11.00<\/td>\n<td style=\"width: 99.1987%\" width=\"510\"><span style=\"color: #ff6600\"><strong>&lt;Break&gt;<\/strong><\/span><\/td>\n<\/tr>\n<tr>\n<td style=\"width: 16.8301%\" width=\"85\">11.00-13.00<\/td>\n<td style=\"width: 99.1987%\" width=\"510\"><strong>Session 9 <em>Applied corpus methods: Tools and techniques<\/em><\/strong><\/p>\n<p>This session provides hands-on experience of corpus analysis. The session will consist of two parts. Part 1 will introduce the corpus that will form the basis of our analysis (Brexit narratives in annual reports of UK financial firms), along with the #LancsBox software for corpus analysis. In Part 2, participants will use #LancsBox to analyse a small dataset and perform corpus tasks including: extracting word lists; finding collocates; and searching for n-grams and keywords. The session will conclude with a discussion of the insights gained from analysing the corpus.<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 16.8301%\" width=\"85\">13.00-14.00<\/td>\n<td style=\"width: 99.1987%\" width=\"510\"><span style=\"color: #ff6600\"><strong>&lt;Lunch&gt; and workshop ends<\/strong><\/span><\/td>\n<\/tr>\n<tr>\n<td style=\"width: 16.8301%\" width=\"85\">14.00-15.30<\/td>\n<td style=\"width: 99.1987%\" width=\"510\">Optional surgery session for PhD students seeking feedback on research proposals and ongoing work involving analysis of text<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n\n<div class=\"twitter-share\"><a href=\"https:\/\/twitter.com\/intent\/tweet?via=FinancialNLP\" class=\"twitter-share-button\" data-size=\"large\">Tweet<\/a><\/div>\n","protected":false},"excerpt":{"rendered":"<p>1st Workshop on Textual Analysis Methods in Accounting and Finance Lancaster University Management School 12-14 September 2018 &nbsp; Day 1: 12 September 11.00-11.15 Welcome and introduction 11.15-12.30 Session 1 Overview&hellip; <a href=\"https:\/\/wp.lancs.ac.uk\/cfie\/tamaf2018\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">TAMAF1<\/span><\/a><\/p>\n","protected":false},"author":660,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"class_list":["post-600","page","type-page","status-publish","hentry","without-featured-image"],"_links":{"self":[{"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/pages\/600","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/users\/660"}],"replies":[{"embeddable":true,"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/comments?post=600"}],"version-history":[{"count":2,"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/pages\/600\/revisions"}],"predecessor-version":[{"id":610,"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/pages\/600\/revisions\/610"}],"wp:attachment":[{"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/media?parent=600"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}