{"id":596,"date":"2019-09-16T17:03:02","date_gmt":"2019-09-16T17:03:02","guid":{"rendered":"http:\/\/wp.lancs.ac.uk\/cfie\/?page_id=596"},"modified":"2020-12-07T20:06:24","modified_gmt":"2020-12-07T20:06:24","slug":"fintoc2020","status":"publish","type":"page","link":"https:\/\/wp.lancs.ac.uk\/cfie\/fintoc2020\/","title":{"rendered":"FinTOC 2020"},"content":{"rendered":"<p><span style=\"font-size: 14pt;color: #993300\">The 1st Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation (FNP-FNS 2020)<\/span><\/p>\n<h1><span style=\"font-size: 12pt\"><strong>FinTOC-2020 Shared Task: <\/strong><strong>&#8220;Financial Document Structure Extraction&#8221;<\/strong><\/span><\/h1>\n<p>To be held at <a href=\"https:\/\/coling2020.org\/\" target=\"_blank\" rel=\"noopener noreferrer\">The 28th International Conference on Computational Linguistics (COLING&#8217;2020)<\/a>, Barcelona, Spain [online] on <strong>12 December 2020<\/strong>.<\/p>\n<p><span style=\"color: #ff0000\"><strong>FNP-FNS Online Running Instructions: <\/strong><a href=\"http:\/\/wp.lancs.ac.uk\/cfie\/fnpfns-instructions\/\">http:\/\/wp.lancs.ac.uk\/cfie\/fnpfns-instructions\/<\/a><strong><br \/>\n<\/strong><\/span><\/p>\n<p><span style=\"color: #ff0000\"><strong>Workshop Program:<\/strong><\/span> <a href=\"http:\/\/wp.lancs.ac.uk\/cfie\/files\/2020\/11\/fnp-fns2020-program.pdf\" target=\"_blank\" rel=\"noopener noreferrer\">Click here to see the workshop schedule<\/a><\/p>\n<p><strong><span style=\"color: #ff0000\">Keynote speaker:<\/span><\/strong> <strong>Dr Ana Gisbert<\/strong>, to join the talk: h<a href=\"http:\/\/wp.lancs.ac.uk\/cfie\/keynote\/\">ttp:\/\/wp.lancs.ac.uk\/cfie\/keynote\/<\/a><\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p>FinTOC shared task results: <a href=\"https:\/\/eur02.safelinks.protection.outlook.com\/?url=https%3A%2F%2Fdocs.google.com%2Fspreadsheets%2Fd%2F1TgJ1sUKifNYXmbuDnp7CnFuN6QSRV5N7FwEGAqvS8cs%2Fedit%3Fusp%3Dsharing&amp;data=02%7C01%7Cm.el-haj%40lancaster.ac.uk%7C41b8f448fdd34609745808d8549e661f%7C9c9bcd11977a4e9ca9a0bc734090164a%7C0%7C1%7C637352386741017729&amp;sdata=gLu7mO5bCSF%2FE13U7t8dEqh%2FXBJhVTwWmkkv1J%2BKSEg%3D&amp;reserved=0\" target=\"_blank\" rel=\"noopener noreferrer\">FinTOC Ranking 2020<\/a><\/p>\n<p><del><span style=\"color: #ff0000\"><strong>NEW:<\/strong><\/span> Submission guidelines: <span style=\"color: #ff0000\"><strong><a href=\"http:\/\/wp.lancs.ac.uk\/cfie\/fnp2020\/guidelines\/\">http:\/\/wp.lancs.ac.uk\/cfie\/fnp2020\/guidelines\/<\/a><\/strong><\/span><\/del><\/p>\n<p><del>Participation Form: <a href=\"https:\/\/forms.gle\/LFsVaw6DqYikhKHx9\" target=\"_blank\" rel=\"noopener noreferrer\">https:\/\/forms.gle\/LFsVaw6DqYikhKHx9<\/a><\/del><\/p>\n<hr \/>\n<p><strong><span style=\"font-size: 14pt\"><span style=\"color: #ff6600\">Important Dates:<\/span><\/span><\/strong><\/p>\n<ul>\n<li><del>December 1st, 2019 Registration opens.<\/del><\/li>\n<li><del>February 17th, 2020: Release of training set.<\/del><\/li>\n<li><del>March 23rd, 2020: Release of test set.<\/del><\/li>\n<li><del>registration deadline May 30, 2020<\/del><\/li>\n<li><del>result submission deadline June 30, 2020<\/del><\/li>\n<li><del>release of results July 30, 2020<\/del><\/li>\n<li><del>Shared task papers due <strong>September 1, 2020.<\/strong><strong> Extended to September 6, 2020<\/strong><\/del><\/li>\n<li><del>Notification of acceptance October 1, 2020<\/del><\/li>\n<li><del>Camera-ready papers due November 1, 2020<\/del><\/li>\n<li>Workshop and shared task dates December 12, 2020<\/li>\n<\/ul>\n<hr \/>\n<p style=\"text-align: justify\"><strong><span style=\"font-size: 14pt\"><span style=\"color: #ff6600\">Introduction:<\/span><\/span><\/strong><\/p>\n<p><span style=\"font-family: helvetica\">A vast amount of financial documents are created and published constantly in machine-readable formats (generally PDF file format), with only minimal structure information. Firms use such documents to report their activities, financial situation or potential investment plans to shareholders, investors and the financial markets, basically corporate annual reports containing detailed financial and operational information.<\/span><\/p>\n<p><span style=\"font-family: helvetica\">In some countries as in the US or in France, regulators as EDGAR SEC or AMF require firms to follow a certain template when reporting their financial results to insure standardisation and consistency across firms\u2019 disclosures. In other European countries, on the other hand, the management usually have more discretion on what where and how to report resulting in lack of standardisation between financial documents published within the same market.<\/span><\/p>\n<p><span style=\"font-family: helvetica\">In this shared task, we focus on analysing Financial Prospectuses; official PDF documents in which investment funds precisely describe their characteristics and investment modalities. Although the content they must include is often regulated, their format is not standardized and displays a great deal of variability ranging from plain text format, towards more graphical and tabular presentation of data and information. The majority of prospectuses are published without a table of content (TOC), which is usually needed to help readers to navigate within the document by following a simple outline of headers and page numbers, and assist legal teams in checking if all the contents required are fully included. Thus, automatic analyses of prospectuses to extract their structure is becoming more and more vital to many firms across the world.<\/span><\/p>\n<hr \/>\n<p style=\"text-align: justify\"><strong><span style=\"font-size: 14pt\"><span style=\"color: #ff6600\">Task:<\/span><\/span><\/strong><\/p>\n<p>The second edition of the FinTOC shared task proposes two tracks: one track for english documents and another for french documents, and it will score systems on both Title detection and TOC generation performance. We have revised the task and greatly simplified data formats to make it as smooth as possible for every interested researcher to participate and submit their systems\u2019 outputs at FinTOC\u20192.<\/p>\n<p style=\"margin: 0cm;margin-bottom: .0001pt\"><span style=\"font-family: 'Helvetica',sans-serif;color: black\">Participants need to register. Once registered, all participating teams will be provided with a common training dataset containing PDF documents and the associated TOC annotation. <\/span><\/p>\n<hr \/>\n<p style=\"text-align: justify\"><strong><span style=\"font-size: 14pt\"><span style=\"color: #ff6600\">Background:<\/span><\/span><\/strong><\/p>\n<p><span style=\"font-family: helvetica\">Existing work on book and document table of contents (TOC) recognition has been almost all on small size, application-dependent, and domain-specific datasets. However, TOC of documents from different domains differ significantly in their visual layout and style, making TOC recognition a challenging problem for a large scale collection of heterogeneous documents and books. Compared to regular books (mostly provided in a full text format with limited structural information such as pages and paragraphs), Financial documents, containing textual and non textual content, have a more sophisticated structure including, parts, sections, sub-sections, sub-sub-sections.\u00a0<\/span><\/p>\n<hr \/>\n<p style=\"text-align: justify\"><span style=\"font-size: 14pt\"><span style=\"color: #ff6600\"><b>Data Format and <\/b><span style=\"font-size: 18.6667px\"><b>Evaluation<\/b><\/span><b>:<\/b><\/span><\/span><\/p>\n<p>The following pdf file describes the data format and evaluation metric used in the shared task: <a href=\"https:\/\/docs.google.com\/document\/d\/1tpxHeaUJuyiSzQNQRIYUYIWd4-Ullzk1RTnmWr0OEnE\/edit?usp=sharing\" target=\"_blank\" rel=\"noopener noreferrer\">Data Format Details<\/a><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"color: #3366ff\">Each team should write a short paper describing their methods. The paper will be published on ACL Anthology in the FNP 2020 proceedings as part of COLING 2020.<\/span><\/p>\n<hr \/>\n<h1><span style=\"color: #ff6600;font-size: 14pt\"><b>Shared task Paper Submission Instructions:<\/b><\/span><\/h1>\n<p><span style=\"font-family: helvetica\">Submission URL: <a href=\"https:\/\/www.softconf.com\/coling2020\/FNP-FNS\/\" target=\"_blank\" rel=\"noopener noreferrer\">https:\/\/www.softconf.com\/coling2020\/FNP-FNS\/<\/a><\/span><\/p>\n<p><span style=\"font-family: helvetica\">Detailed submission guidelines can be found here:\u00a0<a href=\"http:\/\/wp.lancs.ac.uk\/cfie\/fnp2020\/guidelines\/\">http:\/\/wp.lancs.ac.uk\/cfie\/fnp2020\/guidelines\/<\/a><\/span><\/p>\n<hr \/>\n<p style=\"text-align: justify\"><strong><span style=\"font-size: 14pt\"><span style=\"color: #ff6600\">Shared Task Organisers:<\/span><\/span><\/strong><\/p>\n<ul>\n<li><span style=\"font-family: helvetica\"><a href=\"https:\/\/www.linkedin.com\/in\/dvalsamou\/\" target=\"_blank\" rel=\"noopener noreferrer\">Dr Dialekti Valsamou<\/a>, <a href=\"http:\/\/fortia.fr\/\">Fortia Financial Solutions<\/a><\/span><\/li>\n<li><a href=\"https:\/\/www.linkedin.com\/in\/isma%C3%AFl-el-maarouf-9b323aa\/\" target=\"_blank\" rel=\"noopener noreferrer\">Dr Ismail El Maarouf<\/a>, <span style=\"font-family: helvetica\"><a href=\"http:\/\/fortia.fr\/\">Fortia Financial Solutions<\/a><\/span><\/li>\n<li><span style=\"font-family: helvetica\"><a href=\"https:\/\/www.linkedin.com\/in\/najah-imane-bentabet-7182b456\/\" target=\"_blank\" rel=\"noopener noreferrer\">Najah-Imane Bentabet<\/a>, <a href=\"http:\/\/fortia.fr\/\">Fortia Financial Solutions<\/a><\/span><\/li>\n<li><a href=\"https:\/\/www.linkedin.com\/in\/jugeremi\/\">R\u00e9mi Juge<\/a>, <span style=\"font-family: helvetica\"><a href=\"http:\/\/fortia.fr\/\">Fortia Financial Solutions<\/a><\/span><\/li>\n<li>Virginie Mouilleron, Fortia Financial Solutions<\/li>\n<\/ul>\n<hr \/>\n<p style=\"text-align: justify\"><strong><span style=\"font-size: 14pt\"><span style=\"color: #ff6600\">Shared Task Contact:<\/span><\/span><\/strong><\/p>\n<p><span style=\"font-family: helvetica\">Questions about FinTOC-2020 shared task can be sent to:<\/span><\/p>\n<p><span style=\"font-family: helvetica\"><a href=\"mailto:fin.toc.task@gmail.com\">fin.toc.task@gmail.com<\/a><\/span><\/p>\n\n<div class=\"twitter-share\"><a href=\"https:\/\/twitter.com\/intent\/tweet?via=FinancialNLP\" class=\"twitter-share-button\" data-size=\"large\">Tweet<\/a><\/div>\n","protected":false},"excerpt":{"rendered":"<p>The 1st Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation (FNP-FNS 2020) FinTOC-2020 Shared Task: &#8220;Financial Document Structure Extraction&#8221; To be held at The 28th International Conference on&hellip; <a href=\"https:\/\/wp.lancs.ac.uk\/cfie\/fintoc2020\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">FinTOC 2020<\/span><\/a><\/p>\n","protected":false},"author":660,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"class_list":["post-596","page","type-page","status-publish","hentry","without-featured-image"],"_links":{"self":[{"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/pages\/596","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/users\/660"}],"replies":[{"embeddable":true,"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/comments?post=596"}],"version-history":[{"count":26,"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/pages\/596\/revisions"}],"predecessor-version":[{"id":826,"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/pages\/596\/revisions\/826"}],"wp:attachment":[{"href":"https:\/\/wp.lancs.ac.uk\/cfie\/wp-json\/wp\/v2\/media?parent=596"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}