Discours (Dec 2012)
Discourse in Statistical Machine Translation
Abstract
Current approaches to statistical machine translation assume that sentences in a text are independent, ignoring the property of connectedness present in virtually all discourse. We provide an extensive overview of the literature about statistical machine translation that can be related to discourse phenomena and present a detailed investigation and discussion of existing research efforts on a particular discourse-related problem, the translation of anaphoric pronouns. Comparing different approaches to discourse in statistical machine translation allows us to identify fundamental problems and draw conclusions from an overarching perspective.
Keywords