dc.contributor.advisor |
Dougherty, John P. |
|
dc.contributor.author |
Lo, Ethan |
|
dc.date.accessioned |
2011-09-30T16:10:45Z |
|
dc.date.available |
2011-09-30T16:10:45Z |
|
dc.date.issued |
2011 |
|
dc.identifier.uri |
http://hdl.handle.net/10066/7516 |
|
dc.description.abstract |
Missing data is found in virtually every large database, and can be a significant challenge for researchers who want to create analyses on the data. Several approaches have been developed to deal with missing data, but many of them are either ineffective or computationally expensive. However, an approach called Multiple Imputation (MI) is widely used due to its simplicity and effectiveness. MI creates several different sets of full databases (with missing data predicted from existing values of other variables), and then analyzes each one. These results are then combined to create one overall analysis. This seems very straightforward, but is not without problems. There are specific assumptions made when dealing with MI, and ignoring these assumptions can lead to invalid conclusions. Assumptions such as not having missing values in the predictors, not normalizing data, and not looking at missing data problems can all contribute to incorrect conclusions. However, if carefully implemented, MI is a powerful and simple tool that can be applied to almost any missing data problem. |
|
dc.description.sponsorship |
Haverford College. Department of Computer Science |
|
dc.language.iso |
eng |
|
dc.rights.uri |
http://creativecommons.org/licenses/by-nc/3.0/us/ |
|
dc.subject.lcsh |
Database searching -- Methodology |
|
dc.subject.lcsh |
Multiple imputation (Statistics) |
|
dc.title |
Multiple Imputation: A Solution for Nonresponse |
|
dc.type |
Thesis |
|
dc.rights.access |
Haverford users only |
|