Repository logo
Institutional Digital Repository
Shreenivas Deshpande Library, IIT (BHU), Varanasi

Is Group Means Imputation Any Better Than Mean Imputation: A Study Using C5.0 Classifier

dc.contributor.authorFaizan U F Khan
dc.contributor.authorKashan U Z Khan
dc.contributor.authorS K Singh
dc.date.accessioned2019-08-08T10:11:09Z
dc.date.available2019-08-08T10:11:09Z
dc.date.issued2018-07-23
dc.description.abstractSince most data-driven systems including classifiers require large amounts of complete data, the task of handling missing data has garnered much attention. If one of the variables under study in a dataset has some incomplete values, it is treated as a missing data problem. Various methods in the literature exist for dealing with missing data including complete case analysis, listwise deletion, single imputation and multiple imputations. Out of these, mean imputation remains a favourite for researchers due to its simplicity and ease of use, despite some glaring flaws. In this paper, we compare Mean imputation with a similar single imputation method – Group Means imputation – and present our results on nine real-world datasets with respect to classifier accuracy of the C5.0 classifier on the imputed dataset. We show that while Group Means imputation fares better on training data, the test set accuracies fall in favour of Mean Imputation, which deals with novel data in a much better fashionen_US
dc.identifier.issn17426588
dc.identifier.urihttps://idr-sdlib.iitbhu.ac.in/handle/123456789/363
dc.language.isoenen_US
dc.publisherInstitute of Physics Publishingen_US
dc.titleIs Group Means Imputation Any Better Than Mean Imputation: A Study Using C5.0 Classifieren_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Is-Group-Means-Imputation-Any-Better-Than-Mean-Imputation-A-Study-Using-C50-Classifier2018Journal-of-Physics-Conference-Series.pdf
Size:
698.92 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: