Multilingual Supervision of Semantic Annotation

In this paper, we investigate annotation projection of semantic units in a practical setting. Previous approaches have focused on using parallel corpora for semantic transfer. We evaluate an alternative approach using loosely parallel corpora, that does not require corpora to be exact translations of each other. We investigate a method that transfers semantic annotations from one language to another using sentences aligned by entities, and extend it to include alignments by entity-like linguistic units. We conduct our experiments on a large scale using the English, Swedish, and French language editions of Wikipedia. Our results show that annotation projection using entities in combination with loosely parallel corpora provides a viable approach to extending previous attempts. In addition, it enables the generation of proposition banks upon which semantic parsers can be trained.