To improve upon early detection of Classical Swine Fever, we are learning selective Naive Bayesian classifiers from data that were collected during an outbreak of the disease in the Netherlands. The available dataset exhibits a lack of distinction between absence of a clinical symptom and the symptom not having been addressed or observed. Such a lack of distinction is not uncommonly found in biomedical datasets. In this paper, we study the effect that not distinguishing between absent and non-observed features may have on the subset of features that is selected upon learning a selective classifier. We show that while the results from the filter approach to feature selection are quite robust, the results from the wrapper approach are not.
- classical swine-fever
Geenen, P. L., van der Gaag, L. C., Loeffen, W. L. A., & Elbers, A. R. W. (2004). On the robustness of feature selection with absent and non-observed features. Lecture Notes in Computer Science, 3337, 148-159. https://doi.org/10.1007/978-3-540-30547-7_16