Text Mining Workflows for Indexing Archives with Automatically Extracted Semantic Metadata

Research output: Contribution to conferenceOtherpeer-review

Abstract

With the vast amounts of textual data that many digital libraries hold, finding information relevant to users has become a challenge. The unstructured and ambiguous nature of natural language in which documents are written, poses a barrier to the accessibility and discovery of information. This can be alleviated by indexing documents with semantic metadata, e.g., by tagging them with terms that could indicate their “aboutness”. As manually indexing these documents is impracticable, automatic tools capable of generating semantic metadata and building search indexes have become attractive solutions. In this tutorial, we demonstrate how digital library developers and managers can use the Argo text mining platform to develop their own customised, modular workflows for automatic semantic metadata generation and search index construction. In this way, we are providing digital library practitioners with the necessary technical know-how on building semantic search indexes without any programming effort, owing to Argo’s graphical interface for workflow construction and execution. We believe that this in turn will allow various digital libraries to build search systems that will enable their users to find and discover information of interest more efficiently and accurately.
Original languageEnglish
Pages471-473
DOIs
Publication statusPublished - 10 Aug 2016
EventResearch and Advanced Technology for Digital Libraries - 20th International Conference on Theory and Practice of Digital Libraries, TPDL 2016, -
Duration: 8 Sept 20168 Sept 2016

Conference

ConferenceResearch and Advanced Technology for Digital Libraries - 20th International Conference on Theory and Practice of Digital Libraries, TPDL 2016,
Period8/09/168/09/16

Fingerprint

Dive into the research topics of 'Text Mining Workflows for Indexing Archives with Automatically Extracted Semantic Metadata'. Together they form a unique fingerprint.

Cite this