Signalling Implicit Relations: A PDTB - RST Comparison

Authors

  • Lucie Poláková Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague
  • Jiˇrí Mírovský Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague
  • Pavlína Synková Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague

DOI:

https://doi.org/10.5087/dad.2017.210

Abstract

Describing implicit phenomena in discourse is known to be a problematic task, from both theoretical and empirical perspectives. The present article contributes to this topic by a novel comparative analysis of two prominent annotation approaches to discourse relations (coherence relations) that were carried out on the same texts. We compare the annotation of implicit relations in the Penn Discourse Treebank 2.0, i.e. discourse relations not signaled by an explicit discourse connective, to the recently released analysis of signals of rhetorical relations in the RST Signalling Corpus (RST-SC). The intersection of corresponding pairs of relations is rather a small one, but it shows a clear tendency: unlike the overall signal distribution in the RST-SC, more than half of the signals in the studied intersection are of semantic type, formed mostly by loosely defined lexical chains. Our data transformation allows for a simultaneous depiction and detailed study of the two resources.

Downloads

Published

2017-12-15

Issue

Section

Articles