Facilitating the analysis of discourse phenomena in an interoperable NLP platform

Riza Theresa Batista-Navarro, Georgios Kontonatsios, Claudiu Mihǎilǎ, Paul Thompson, Rafal Rak, Raheel Nawaz, Ioannis Korkontzelos, Sophia Ananiadou

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

The analysis of discourse phenomena is essential in many natural language processing (NLP) applications. The growing diversity of available corpora and NLP tools brings a multitude of representation formats. In order to alleviate the problem of incompatible formats when constructing complex text mining pipelines, the Unstructured Information Management Architecture (UIMA) provides a standard means of communication between tools and resources. U-Compare, a text mining workflow construction platform based on UIMA, further enhances interoperability through a shared system of data types, allowing free combination of compliant components into workflows. Although U-Compare and its type system already support syntactic and semantic analyses, support for the analysis of discourse phenomena was previously lacking. In response, we have extended the U-Compare type system with new discourse-level types. We illustrate processing and visualisation of discourse information in U-Compare by providing several new deserialisation components for corpora containing discourse annotations. The new U-Compare is downloadable from http://nactem.ac.uk/ ucompare. © 2013 Springer-Verlag.
Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)|Lect. Notes Comput. Sci.
Place of PublicationBerlin, Germany
PublisherSpringer Nature
Pages559-571
Number of pages12
Volume7816
ISBN (Print)9783642372469
DOIs
Publication statusPublished - 2013
Event14th Annual Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2013 - Samos
Duration: 1 Jul 2013 → …
http://link.springer.com/chapter/10.1007%2F978-3-642-37247-6_45

Publication series

NameLecture Notes in Computer Science
PublisherSpringer Berlin Heidelberg

Other

Other14th Annual Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2013
CitySamos
Period1/07/13 → …
Internet address

Keywords

  • causality
  • coreference
  • discourse
  • interoperabilty
  • meta-knowledge
  • U-Compare
  • UIMA

Fingerprint

Dive into the research topics of 'Facilitating the analysis of discourse phenomena in an interoperable NLP platform'. Together they form a unique fingerprint.

Cite this