Abstract
In recent years, there has been a considerable interest in dependency parsing for many reasons. First, dependency-based syntactic representations seem to be effective in many areas of NLP, such as machine translation, question answering, and relation extraction, thanks to their transparent encoding of predicate-argument structure. Second, dependency parsing is flexible for free word order languages (e.g. Arabic and Czech). Third, and most importantly, the dependency-based approach has led to the development of fast robust reasonably accurate syntactic parsers for a number of languages. In this paper, we investigate the technique of combining multiple data-driven dependency parsers for parsing Arabic. Arabic has a number of characteristics, which will be described through the paper, that make parsing it challenging. Experimental results show that combined parsers can produce more accurate results, even for imperfectly tagged text, than each parser produces by itself for texts with the gold-standard tags.
| Original language | English |
|---|---|
| Title of host publication | Human Language Technology. Challenges for Computer Science and Linguistics |
| Editors | Zygmunt Vetulani |
| Place of Publication | Poland |
| Publisher | Human Language Technologies |
| Pages | 546-550 |
| Number of pages | 5 |
| Publication status | Published - Nov 2011 |
| Event | 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics - Poznań, Poland Duration: 25 Nov 2011 → 27 Nov 2011 |
Conference
| Conference | 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics |
|---|---|
| City | Poznań, Poland |
| Period | 25/11/11 → 27/11/11 |
Keywords
- Dependency Parsing
- MSTParser
- MALTParser
- System combination