Explaining tourist revisit intention using natural language processing and classification techniques

Andreas Gregoriades, Maria Pampaka, Herodotos Herodotou, Evripides Christodoulou

Research output: Contribution to journalArticlepeer-review


Revisit intention is a key indicator of business performance, studied in many fields including hospitality. This work employs big data analytics to investigate revisit intention patterns from tourists’ electronic word of mouth (eWOM) using text classification, negation detection, and topic modelling. The method is applied on publicly available hotel reviews that are labelled automatically based on consumers’ intention to revisit a hotel or not. Topics discussed in revisit-annotated reviews are automatically extracted and used as features during the training of two Extreme Gradient Boosting models (XGBoost), one for each of two hotel categories (2/3 and 4/5 stars). The emerging patterns from the trained XGBoost models are identified using an explainable machine learning technique, namely SHAP (SHapley Additive exPlanations). Results show how topics discussed by tourists in reviews relate with revisit/non revisit intention. The proposed method can help hoteliers make more informed decisions on how to improve their services and thus increase customer revisit occurrences.

Original languageEnglish
Article number60
JournalJournal of Big Data
Issue number1
Early online date6 May 2023
Publication statusPublished - 1 Dec 2023


  • Explainable machine learning
  • Negation detection
  • Revisit intention
  • Text classification
  • Topic modelling


Dive into the research topics of 'Explaining tourist revisit intention using natural language processing and classification techniques'. Together they form a unique fingerprint.

Cite this