Correlation between human assessment of essays and ROUGE evaluation of essays' summaries

Seemab Latif, Mary McGee Wood, Goran Nenadic

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    In this paper we have addressed the qualitative (human evaluation) and quantitative (ROUGE) evaluation of computer generated summaries of the students' essays. The experimental results show that there is a positive high correlation between ROUGE scores and human assessment of the essays (human assigned marks). We have also found out that human evaluation of the automatic summaries positively correlates with the human assessment of the essays. These correlations can be used to classify students' essays into broad bands of quality. © 2009 IEEE.
    Original languageEnglish
    Title of host publication2009 8th International Symposium on Natural Language Processing, SNLP '09|Int. Symp. Nat. Lang. Process., SNLP
    Pages122-127
    Number of pages5
    DOIs
    Publication statusPublished - 2009
    Event2009 8th International Symposium on Natural Language Processing, SNLP '09 - Bangkok
    Duration: 1 Jul 2009 → …

    Conference

    Conference2009 8th International Symposium on Natural Language Processing, SNLP '09
    CityBangkok
    Period1/07/09 → …

    Fingerprint

    Dive into the research topics of 'Correlation between human assessment of essays and ROUGE evaluation of essays' summaries'. Together they form a unique fingerprint.

    Cite this