Journal of Integrative Bioinformatics (Dec 2011)
Improving imbalanced scientific text classification using sampling strategies and dictionaries
Abstract
Many real applications have the imbalanced class distribution problem, where one of the classes is represented by a very small number of cases compared to the other classes. One of the systems affected are those related to the recovery and classification of scientific documentation.