A study on aligning documents using the Circle of Interest technique

Daniel Joseph, César A. Marín

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper we present a study on applying a technique called Circle of Interest, along with Formal Concept Analysis and Rough Set Theory to semantically align documents such as those found in a business domain. Indeed, when companies try to engage in business it becomes crucial to keep the semantics when exchanging information usually known as a business document. Typical approaches are not practical or require a high cost to implement. In contrast, we consider the concepts and their relationships discovered within an exchanged business document to find automatically an alignment to a local interpretation known as a document type. We present experimental results on applying Formal Concept Analysis as the ontological representation of documents, the Circle of Interest for selecting the most relevant document types to choose from, and Rough Set Theory for discerning among them. The results on a set of business documents show the feasibility of our approach and its direct application to a business domain.
Original languageEnglish
Title of host publicationICSOFT 2010 - Proceedings of the 5th International Conference on Software and Data Technologies|ICSOFT - Proc. Int. Conf. Softw. Data Technol.
PublisherScience and Technology Publications Lda
Pages374-383
Number of pages9
Volume2
ISBN (Print)9789898425225
Publication statusPublished - 2010
Event5th International Conference on Software and Data Technologies, ICSOFT 2010 - Athens
Duration: 1 Jul 2010 → …

Conference

Conference5th International Conference on Software and Data Technologies, ICSOFT 2010
CityAthens
Period1/07/10 → …

Keywords

  • Circle of Interest
  • Document alignment
  • Formal Concept Analysis
  • Rough Set Theory
  • Semantic alignment

Fingerprint

Dive into the research topics of 'A study on aligning documents using the Circle of Interest technique'. Together they form a unique fingerprint.

Cite this