Skip to Main Content
 

Global Search Box

 
 
 
 

ETD Abstract Container

Abstract Header

Schemalysis: Visualization of a Sub-Schemas in Document NoSQL Databases

DePero, Andrew Joseph

Abstract Details

2022, Master of Computer Science, Miami University, Computer Science and Software Engineering.
NoSQL database systems are useful for managing large and diverse data sets associated with Big Data. Highly diverse data sets contain data with different structures, but often there are no readily available schemas describing the structures. The lack of a uniform structure for data may make it difficult to understand and query a database. Recent research and industry software tools extract some aspects of the structures inherent in a NoSQL database; most tools provide a schema that gives the union of attributes across all objects, termed a union schema. Some provide sample values for attributes. We present Schemalysis, a tool for analyzing and displaying the sub-schemas of a document NoSQL database along with example instances. The web application implements an algorithm that reads objects and detects individual sub-schemas of each document in a document database, as well as the database’s union schema. We also conduct three different case studies to validate the functionality of Schemalysis with real-world data and compare and contrast to existing tools for extracting schemas.
Karen Davis (Advisor)
Alan Ferrenberg (Committee Member)
James Kiper (Committee Member)
100 p.

Recommended Citations

Citations

  • DePero, A. J. (2022). Schemalysis: Visualization of a Sub-Schemas in Document NoSQL Databases [Master's thesis, Miami University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=miami1670965282027271

    APA Style (7th edition)

  • DePero, Andrew. Schemalysis: Visualization of a Sub-Schemas in Document NoSQL Databases. 2022. Miami University, Master's thesis. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=miami1670965282027271.

    MLA Style (8th edition)

  • DePero, Andrew. "Schemalysis: Visualization of a Sub-Schemas in Document NoSQL Databases." Master's thesis, Miami University, 2022. http://rave.ohiolink.edu/etdc/view?acc_num=miami1670965282027271

    Chicago Manual of Style (17th edition)