Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
Heckman_DJ_Updated11018.pdf (1.34 MB)
ETD Abstract Container
Abstract Header
A Comparison of Classification Methods in Predicting the Presence of DNA Profiles in Sexual Assault Kits
Author Info
Heckman, Derek J
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1513703948257233
Abstract Details
Year and Degree
2018, Master of Science (MS), Bowling Green State University, Applied Statistics (Math).
Abstract
In 2014 Ohio began the Sexual Assault Kit Testing Initiative with the goal of analyzing all previously untested sexual assault kits (SAKs). Approximately 13,900 previously untested SAKs were sent for forensic analysis. Of these SAKs, a sample of 2,500 was drawn for statistical analysis. The goal was to gain some general information about the SAKs as well as to answer a variety of specific questions in the hopes of producing cost-saving measures in the future. Questions considered were those such as: which forensic samples most consistently produce Combined DNA Index System (CODIS)-eligible DNA profiles, what factors predict whether or not a kit will contain a DNA profile foreign to the victim, as well as others. The results of the initiative were published in Kerka, Heckman, Maddox, Sprague, & Albert (2018). This thesis expands upon the work in the aforementioned article. In the article, a logistic regression model was constructed to predict whether or not an SAK would contain a CODISeligible DNA profile. It was estimated to have a misclassification rate of 34.2%. This thesis compares three other models to the logistic regression model to determine if any improvements in performance can be made. The models tested were decision trees, bagged trees and random forests. The decision tree had an estimated misclassification rate of 29.7%, thus offering a moderate improvement over the logistic regression model. In addition, the same models were compared for their ability to predict which SAKs would contain duplicate DNA profiles across multiple forensic samples (vaginal sample, anal sample, etc). No model was able to do a satisfactory job of predicting this response.
Committee
Jim Albert, Ph.D. (Advisor)
Arkajyoti Roy, Ph.D. (Committee Member)
Jon Sprague, Ph.D. (Committee Member)
Pages
88 p.
Subject Headings
Criminology
;
Mathematics
;
Statistics
Keywords
Statistics
;
DNA Profiles
;
Sexual Assault
;
Sexual Assault Kits
;
Logistic Regression
;
Decision Trees
;
Bagged Trees
;
Random Forests
;
SAKs
;
Classification
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
Heckman, D. J. (2018).
A Comparison of Classification Methods in Predicting the Presence of DNA Profiles in Sexual Assault Kits
[Master's thesis, Bowling Green State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1513703948257233
APA Style (7th edition)
Heckman, Derek.
A Comparison of Classification Methods in Predicting the Presence of DNA Profiles in Sexual Assault Kits.
2018. Bowling Green State University, Master's thesis.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1513703948257233.
MLA Style (8th edition)
Heckman, Derek. "A Comparison of Classification Methods in Predicting the Presence of DNA Profiles in Sexual Assault Kits." Master's thesis, Bowling Green State University, 2018. http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1513703948257233
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
bgsu1513703948257233
Download Count:
484
Copyright Info
© 2017, all rights reserved.
This open access ETD is published by Bowling Green State University and OhioLINK.