On the robustness of feature selection with absent and non-observed features

P.L. Geenen, L.C. van der Gaag, W.L.A. Loeffen, A.R.W. Elbers

    Research output: Contribution to journalArticleAcademicpeer-review

    3 Citations (Scopus)

    Abstract

    To improve upon early detection of Classical Swine Fever, we are learning selective Naive Bayesian classifiers from data that were collected during an outbreak of the disease in the Netherlands. The available dataset exhibits a lack of distinction between absence of a clinical symptom and the symptom not having been addressed or observed. Such a lack of distinction is not uncommonly found in biomedical datasets. In this paper, we study the effect that not distinguishing between absent and non-observed features may have on the subset of features that is selected upon learning a selective classifier. We show that while the results from the filter approach to feature selection are quite robust, the results from the wrapper approach are not.
    Original languageEnglish
    Pages (from-to)148-159
    JournalLecture Notes in Computer Science
    Volume3337
    DOIs
    Publication statusPublished - 2004

    Keywords

    • classical swine-fever
    • epidemic

    Fingerprint Dive into the research topics of 'On the robustness of feature selection with absent and non-observed features'. Together they form a unique fingerprint.

  • Cite this