Doctor of Philosophy, Case Western Reserve University, 2019, EECS - Computer and Information Sciences
Biomedical ontologies and standardized terminologies play an important role in healthcare information management, extraction, and data integration. The quality of ontologies impacts its usability. One of the quality issues is not conforming lattice property, a generally applicable ontology design principle. Non-lattice structures are often indicative of anomalies in ontological systems and, as such, represent possible areas of focus for subsequent quality assurance work. Quality assurance of ontologies is an indispensable part of the terminology development cycle.
This dissertation presents a non-lattice based ontology quality assurance workflow, along with involved approaches, algorithms, and applications. The general steps of non-lattice based ontology quality assurance include: (1) extracting non-lattice fragments; (2) detecting potential defects and proposing remediation suggestions; (3) reviewing and validating these suggested remediations.
For (1), a general MapReduce pipeline, called MaPLE (MapReduce Pipeline for Lattice-based Evaluation), is developed for extracting non-lattice fragments in large partially ordered sets. Using MaPLE in a 30-node Hadoop local cloud, we systematically extracted non-lattice fragments in 8 SNOMED CT versions from 2009 to 2014, with an average total computing time of less than 3 hours per version. Compared with previous work, which took about 3 months, MaPLE makes it feasible not only to perform exhaustive structural analysis of large ontological hierarchies but also to systematically track structural changes between versions. Our change analysis showed that the average change rates on the non-lattice pairs are up to 38.6 times higher than the change rates of the background structure (concept nodes).
For (2), two methods, NEO and Spark-MCA, are proposed. NEO is a systematic structural approach for embedding of FMA fragments into the Body Structure hierarchy to understand the structural disparity of the subsumption relat (open full item for complete abstract)
Committee: Guo-Qiang Zhang (Advisor); Kenneth Loparo (Committee Chair); Xu Rong (Committee Member); Li Pan (Committee Member)
Subjects: Biomedical Research; Computer Science; Health; Information Science