FinCausal 2020

FinCausal-2020 Shared Task:

“Financial Document Causality Detection”

 

Participation Form: Register a new team

Provisional Key Dates:

  • Trial data set released on the 1st of February 2020
  • Training data released on the 1st of March 2020
  • Blind test dataset released on the 1st of April 2020
  • Contributions from participants are expected on the 20th of April 2020
  • Release of results are provided by organizers on the 1st of May 2020

Introduction

Financial analysis needs factual data, but also explanation on the variability of these data. Data state facts, but provide little to no knowledge regarding how these facts materialised. The Financial Document Causality Detection Task aims to develop an ability to explain, from external sources, the reasons why a transformation occurs in the financial landscape, as a preamble to generating accurate and meaningful financial narrative summaries. Its goal is to evaluate which events or which chain of events can cause a financial object to be modified or an event to occur, regarding a given external context. This context is available in the financial news, but due to the high volatility of such information, mapping an external cause to a given consequence is not trivial.

The task dataset has been extracted from different 2019 financial news kindly provided by Qwam, and additional SEC data from the Edgar Database, and has been normalised for the research task.

Participants will be asked to evaluate whether a sentence is causal or not (Task 1), then to detect, in causal sentences, which elements of the sentence relate to the cause and which relate to the effect (Task 2).

This paper details the data processing and the labelling scheme, the expected results and the metrics used for evaluation. It will be updated on release of the training data.


Task

As part of the Financial Narrative workshop, we propose the FinCausal Task, focusing on detecting if an object, an event or a chain of events is considered a cause for a prior event.  This shared task focuses on determining causality associated to a quantified fact. An event is defined as the arising or emergence of a new object or context in regard of a previous situation. So the task will emphasise the detection of causality associated to transformation of financial objects embedded in quantified facts.

Participants will be provided with a sample of text blocks extracted from financial news and SEC data, labelled through inter annotator agreement.

 The Shared Task contains two sub-tasks:

Task 1: Sentence Classification

This task is a binary classification task. The goal of this subtask is to filter sentences which display causal meanings (1) from the sentences that are noise in regard of causality (0)

Table 1: Sentence Classification Sample

Text

Gold

As customer expectations continuously evolve, customers expect immediacy and simplicity.

0

Thomas Cook’s subsidiary in Germany is still technically operating as of Monday afternoon but has stopped taking bookings. More than 140,000 German holidaymakers have been impacted and tens of thousands of future travel bookings may not be honored

1

According to Gran , the company has no plans to move all production to Russia , although that is where the company is growing

0

 

Task 2: Cause and Effect Detection

This task is a relation detection task. The aim is to identify, in a causal sentence or text block, the causal elements and the consequential ones.

Table 2: Cause and Effect Detection Sample

Text

Cause

Effect

Boussard Gavaudan Investment Management LLP bought a new position in shares of GENFIT S A/ADR in the second quarter worth about $199,000. Morgan Stanley increased its stake in shares of GENFIT S A/ADR by 24.4% in the second quarter.Morgan Stanley now owns 10,700 shares of the company’s stock worth $211,000 after purchasing an additional 2,100 shares during the period

Morgan Stanley increased its stake in shares of GENFIT S A/ADR by 24.4% in the second quarter

Morgan Stanley now owns 10,700 shares of the company’s stock worth $211,000 after purchasing an additional 2,100 shares during the period.

Zhao found himself 60 million yuan indebted after losing 9,000 BTC in a single day (February 10, 2014)

losing 9,000 BTC in a single day (February 10, 2014)

Zhao found himself 60 million yuan indebted

 

Participants are free to use any method they see fit (regex, corpus linguistics, entity relationship models, deep learning methods) to identify the causes and effects.

 

Shared Task Organisers

  • Dominique Mariko – Yseop Lab
  • Anubhav Gupta – Yseop Lab
  • Hanna Abi Akl – Yseop Lab
  • Hugues de Mazancourt – Yseop Lab
  • Yagmur Ozturk – Yseop Lab

 

Shared Task Contact 

The participants to this task will access the data after registering, and thereby pledge to contribute to the workshop by submitting an experiment paper.

Participant can register to this shared task by filling this form, and get access to the datasets.

For any question please contact the organisers at fin.causal.task@gmail.com