Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
ADePeroMastersThesis.pdf (12.02 MB)
ETD Abstract Container
Abstract Header
Schemalysis: Visualization of a Sub-Schemas in Document NoSQL Databases
Author Info
DePero, Andrew Joseph
ORCID® Identifier
http://orcid.org/0000-0001-5962-8140
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=miami1670965282027271
Abstract Details
Year and Degree
2022, Master of Computer Science, Miami University, Computer Science and Software Engineering.
Abstract
NoSQL database systems are useful for managing large and diverse data sets associated with Big Data. Highly diverse data sets contain data with different structures, but often there are no readily available schemas describing the structures. The lack of a uniform structure for data may make it difficult to understand and query a database. Recent research and industry software tools extract some aspects of the structures inherent in a NoSQL database; most tools provide a schema that gives the union of attributes across all objects, termed a union schema. Some provide sample values for attributes. We present Schemalysis, a tool for analyzing and displaying the sub-schemas of a document NoSQL database along with example instances. The web application implements an algorithm that reads objects and detects individual sub-schemas of each document in a document database, as well as the database’s union schema. We also conduct three different case studies to validate the functionality of Schemalysis with real-world data and compare and contrast to existing tools for extracting schemas.
Committee
Karen Davis (Advisor)
Alan Ferrenberg (Committee Member)
James Kiper (Committee Member)
Pages
100 p.
Subject Headings
Computer Science
Keywords
NoSQL
;
document databases
;
union schema
;
sub-schema
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
DePero, A. J. (2022).
Schemalysis: Visualization of a Sub-Schemas in Document NoSQL Databases
[Master's thesis, Miami University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=miami1670965282027271
APA Style (7th edition)
DePero, Andrew.
Schemalysis: Visualization of a Sub-Schemas in Document NoSQL Databases.
2022. Miami University, Master's thesis.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=miami1670965282027271.
MLA Style (8th edition)
DePero, Andrew. "Schemalysis: Visualization of a Sub-Schemas in Document NoSQL Databases." Master's thesis, Miami University, 2022. http://rave.ohiolink.edu/etdc/view?acc_num=miami1670965282027271
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
miami1670965282027271
Download Count:
126
Copyright Info
© 2022, all rights reserved.
This open access ETD is published by Miami University and OhioLINK.