On The Stability of Feature Selection in the Presence of Feature Correlations

Konstantinos Sechidis, Konstantinos Papangelou, Sarah Nogueira, James Weatherall, Gavin Brown

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

Abstract

Feature selection is central to modern data science. The `stability' of a feature selection algorithm refers to the sensitivity of its choices to small changes in training data. This is, in effect, the robustness of the chosen features. This paper considers the estimation of stability when we expect strong pairwise correlations, otherwise known as feature redundancy. We demonstrate that existing measures are inappropriate here, as they systematically underestimate the true stability, giving an overly pessimistic view of a feature set. We propose a new statistical measure which overcomes this issue, and generalises previous work.
Original languageEnglish
Title of host publicationEuropean Conference on Machine Learning
Publication statusAccepted/In press - 7 Jun 2019
EventEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - Wurzburg, Germany
Duration: 16 Sept 201920 Sept 2019

Conference

ConferenceEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases
Abbreviated titleECMLPKDD
Country/TerritoryGermany
CityWurzburg
Period16/09/1920/09/19

Fingerprint

Dive into the research topics of 'On The Stability of Feature Selection in the Presence of Feature Correlations'. Together they form a unique fingerprint.

Cite this