TY - GEN
T1 - Paraphrasing with bilingual parallel corpora
AU - Bannard, Colin
AU - Callison-Burch, Chris
PY - 2005
Y1 - 2005
N2 - Previous work has used monolingual parallel corpora to extract and generate paraphrases. We show that this task can be done using bilingual parallel corpora, a much more commonly available resource. Using alignment techniques from phrasebased statistical machine translation, we show how paraphrases in one language can be identified using a phrase in another language as a pivot. We define a paraphrase probability that allows paraphrases extracted from a bilingual parallel corpus to be ranked using translation probabilities, and show how it can be refined to take contextual information into account. We evaluate our paraphrase extraction and ranking methods using a set of manual word alignments, and contrast the quality with paraphrases extracted from automatic alignments.
AB - Previous work has used monolingual parallel corpora to extract and generate paraphrases. We show that this task can be done using bilingual parallel corpora, a much more commonly available resource. Using alignment techniques from phrasebased statistical machine translation, we show how paraphrases in one language can be identified using a phrase in another language as a pivot. We define a paraphrase probability that allows paraphrases extracted from a bilingual parallel corpus to be ranked using translation probabilities, and show how it can be refined to take contextual information into account. We evaluate our paraphrase extraction and ranking methods using a set of manual word alignments, and contrast the quality with paraphrases extracted from automatic alignments.
UR - http://www.scopus.com/inward/record.url?scp=80053267402&partnerID=8YFLogxK
U2 - 10.3115/1219840.1219914
DO - 10.3115/1219840.1219914
M3 - Conference contribution
AN - SCOPUS:80053267402
SN - 1932432515
SN - 9781932432510
T3 - ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
SP - 597
EP - 604
BT - ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
PB - Association for Computational Linguistics
T2 - 43rd Annual Meeting of the Association for Computational Linguistics, ACL-05
Y2 - 25 June 2005 through 30 June 2005
ER -