Iterative Semantic Transformer by Greedy Distillation for Community Question Answering

Jinmeng Wu, Tingting Mu, Jeyarajan Thiyagalingam, Hanyu Hong, Yanbin Hao, Tianxu Zhang, John y. Goulermas

Research output: Contribution to journalArticlepeer-review

38 Downloads (Pure)

Abstract

The semantic matching problem consists of recognizing if the candidate text is relevant to a particular input text. Semantic similarities can be determined from human-curated knowledge, but such knowledge may not be available in every language. Instead, statistical learning techniques have been applied, but these techniques circumvent the need for manual feature engineering by using large datasets to train models to perform semantic similarity scoring between portions of text or words. The pre-trained transformer provides a further mechanism to consolidate the information throughout a sentence into single sentence-level representations, but these representations may not be optimal for the matching task. As an alternative, we propose an interactive semantic transformer based on a greedy layer-wise framework to learn a distributed similarity representation for sentence pairs. The novelty of the architecture lies in an abstract representation of the semantic similarities created by three-stage learning strategies. Model training is accomplished through a greedy layer-wise training scheme, that incorporates both supervised and unsupervised learning. The proposed model is experimentally compared to state-of-the-art approaches on three different dataset types: the library TREC, the Yahoo!, and Stack Exchange community question datasets, and results show the proposed model outperforming other approaches.
Original languageEnglish
Pages (from-to)1-13
JournalIEEE/ACM Transactions on Audio, Speech, and Language Processing
Early online date19 Jun 2024
DOIs
Publication statusE-pub ahead of print - 19 Jun 2024

Keywords

  • semantic representation
  • descriptive clustering
  • feature selection
  • pre-trained model
  • greedy-layer wise
  • sparse models

Fingerprint

Dive into the research topics of 'Iterative Semantic Transformer by Greedy Distillation for Community Question Answering'. Together they form a unique fingerprint.
  • MCAIF: Centre for AI Fundamentals

    Kaski, S. (PI), Alvarez, M. (Researcher), Pan, W. (Researcher), Mu, T. (Researcher), Rivasplata, O. (PI), Sun, M. (Researcher), Mukherjee, A. (Researcher), Caprio, M. (PI), Sonee, A. (Researcher), Leroy, A. (Researcher), Wang, J. (Researcher), Lee, J. (Researcher), Parakkal Unni, M. (Researcher), Sloman, S. (Researcher), Menary, S. (Researcher), Quilter, T. (Researcher), Hosseinzadeh, A. (PGR student), Mousa, A. (PGR student), Glover, E. (PGR student), Das, A. (PGR student), DURSUN, F. (PGR student), Zhu, H. (PGR student), Abdi, H. (PGR student), Dandago, K. (PGR student), Piriyajitakonkij, M. (PGR student), Rachman, R. (PGR student), Shi, X. (PGR student), Keany, T. (PGR student), Liu, X. (PGR student), Jiang, Y. (PGR student), Wan, Z. (PGR student), Evans, I. (Support team), Harrison, M. (Support team) & Machado, M. (PI)

    1/10/2130/09/26

    Project: Research

Cite this