Skip to Main Content

Basic Search

Skip to Search Results
 
 
 

Left Column

Filters

Right Column

Search Results

Search Results

(Total results 10)

Mini-Tools

 
 

Search Report

  • 1. Joshi, Amit Exploiting Alignments in Linked Data for Compression and Query Answering

    Doctor of Philosophy (PhD), Wright State University, 2017, Computer Science and Engineering PhD

    Linked data has experienced accelerated growth in recent years due to its interlinking ability across disparate sources, made possible via machine-processable RDF data. Today, a large number of organizations, including governments and news providers, publish data in RDF format, inviting developers to build useful applications through reuse and integration of structured data. This has led to tremendous increase in the amount of RDF data on the web. Although the growth of RDF data can be viewed as a positive sign for semantic web initiatives, it causes performance bottlenecks for RDF data management systems that store and provide access to data. In addition, a growing number of ontologies and vocabularies make retrieving data a challenging task. The aim of this research is to show how alignments in the Linked Data can be exploited to compress and query the linked datasets. First, we introduce two compression techniques that compress RDF datasets through identification and removal of semantic and contextual redundancies in linked data. Logical Linked Data Compression is a lossless compression technique which compresses a dataset by generating a set of new logical rules from the dataset and removing triples that can be inferred from these rules. Contextual Linked Data Compression is a lossy compression technique which compresses datasets by performing schema alignment and instance matching followed by pruning of alignments based on confidence value and subsequent grouping of equivalent terms. Depending on the structure of the dataset, the first technique was able to prune more than 50% of the triples. Second, we propose an Alignment based Linked Open Data Querying System (ALOQUS) that allows users to write query statements using concepts and properties not present in linked datasets and show that querying does not require a thorough understanding of the individual datasets and interconnecting relationships. Finally, we present LinkGen, a multipurpose synthetic Linke (open full item for complete abstract)

    Committee: Pascal Hitzler Ph.D. (Advisor); Guozhu Dong Ph.D. (Committee Member); Krishnaprasad Thirunaraya Ph.D. (Committee Member); Michelle Cheatham Ph.D. (Committee Member); Subhashini Ganapathy Ph.D. (Committee Member) Subjects: Computer Science
  • 2. Krisnadhi, Adila Ontology Pattern-Based Data Integration

    Doctor of Philosophy (PhD), Wright State University, 2015, Computer Science and Engineering PhD

    Data integration is concerned with providing a unified access to data residing at multiple sources. Such a unified access is realized by having a global schema and a set of mappings between the global schema and the local schemas of each data source, which specify how user queries at the global schema can be translated into queries at the local schemas. Data sources are typically developed and maintained independently, and thus, highly heterogeneous. This causes difficulties in integration because of the lack of interoperability in the aspect of architecture, data format, as well as syntax and semantics of the data. This dissertation represents a study on how small, self-contained ontologies, called ontology design patterns, can be employed to provide semantic interoperability in a cross-repository data integration system. The idea of this so-called ontology pattern- based data integration is that a collection of ontology design patterns can act as the global schema that still contains sufficient semantics, but is also flexible and simple enough to be used by linked data providers. On the one side, this differs from existing ontology-based solutions, which are based on large, monolithic ontologies that provide very rich semantics, but enforce too restrictive ontological choices, hence are shunned by many data providers. On the other side, this also differs from the purely linked data based solutions, which do offer simplicity and flexibility in data publishing, but too little in terms of semantic interoperability. We demonstrate the feasibility of this idea through the actual development of a large scale data integration project involving seven ocean science data repositories from five institutions in the U.S. In addition, we make two contributions as part of this dissertation work, which also play crucial roles in the aforementioned data integration project. First, we develop a collection of more than a dozen ontology design patterns that capture the key noti (open full item for complete abstract)

    Committee: Pascal Hitzler Ph.D. (Advisor); Krzysztof Janowicz Ph.D. (Committee Member); Khrisnaprasad Thirunarayan Ph.D. (Committee Member); Michelle Cheatham Ph.D. (Committee Member) Subjects: Computer Science; Information Systems; Information Technology; Logic
  • 3. Jain, Prateek Linked Open Data Alignment & Querying

    Doctor of Philosophy (PhD), Wright State University, 2012, Computer Science and Engineering PhD

    The recent emergence of the “Linked Data” approach for publishing data represents a major step forward in realizing the original vision of a web that can "understand and satisfy the requests of people and machines to use the web content" i.e. the Semantic Web. This new approach has resulted in the Linked Open Data (LOD) Cloud, which includes more than 295 large datasets contributed by experts belonging to diverse communities such as geography, entertainment, and life sciences. However, the current interlinks between datasets in the LOD Cloud, as we will illustrate,are too shallow to realize much of the benefits promised. If this limitation is left unaddressed, then the LOD Cloud will merely be more data that suffers from the same kinds of problems, which plague the Web of Documents, and hence the vision of the Semantic Web will fall short. This thesis presents a comprehensive solution to address the issue of alignment and relationship identification using a bootstrapping based approach. By alignment we mean the process of determining correspondences between classes and properties of ontologies. We identify subsumption, equivalence and part-of relationship between classes. The work identifies part-of relationship between instances. Between properties we will establish subsumption and equivalence relationship. By bootstrapping we mean the process of being able to utilize the information which is contained within the datasets for improving the data within them. The work showcases use of bootstrapping based methods to identify and create richer relationships between LOD datasets. The BLOOMS project (http://wiki.knoesis.org/index.php/BLOOMS) and the PLATO project, both built as part of this research, have provided evidence to the feasibility and the applicability of the solution.

    Committee: Amit Sheth PhD (Advisor); Pascal Hitzler PhD (Committee Member); Krishnaprasad Thirunarayan PhD (Committee Member); Kunal Verma PhD (Committee Member); Peter Yeh PhD (Committee Member) Subjects: Computer Science
  • 4. Pschorr, Joshua SemSOS : an Architecture for Query, Insertion, and Discovery for Semantic Sensor Networks

    Master of Science (MS), Wright State University, 2013, Computer Science

    With sensors, storage, and bandwidth becoming ever cheaper, there has been a drive recently to make sensor data accessible on the Web. However, because of the vast number of sensors collecting data about our environment, finding relevant sensors on the Web and then interpreting their observations is a non-trivial challenge. The Open Geospatial Consortium (OGC) defines a web service specification known as the Sensor Observation Service (SOS) that is designed to standardize the way sensors and sensor data are discovered and accessed on the Web. Though this standard goes a long way in providing interoperability between sensor data producers and consumers, it is predicated on the idea that the consuming application is equipped to handle raw sensor data. Sensor data consuming end-points are generally interested in not just the raw data itself, but rather actionable information regarding their environment. The approaches for dealing with this are either to make each individual consuming application smarter or to make the data served to them smarter. This thesis presents an application of the latter approach, which is accomplished by providing a more meaningful representation of sensor data by leveraging semantic web technologies. Specifically, this thesis describes an approach to sensor data modeling, reasoning, discovery, and query over richer semantic data derived from raw sensor descriptions and observations. The artifacts resulting from this research include: - an implementation of an SOS service which hews to both Sensor Web and Semantic Web standards in order to bridge the gap between syntactic and semantic sensor data consumers and that has been proven by use in a number of research applications storing large amounts of data, which serves as - an example of an approach for designing applications which integrate syntactic services over semantic models and allow for interactions with external reasoning systems. As more sensors and observations move o (open full item for complete abstract)

    Committee: Krishnaprasad Thirunarayan Ph.D. (Advisor); Amit Sheth Ph.D. (Committee Member); Bin Wang Ph.D. (Committee Member) Subjects: Computer Science; Geographic Information Science; Information Systems; Remote Sensing; Systems Design; Web Studies
  • 5. Mixter, Jeffrey Linked Data in VRA Core 4.0: Converting VRA XML Records into RDF/XML

    MLIS, Kent State University, 2013, College of Communication and Information / School of Information

    Linked Data has become an increasingly important and valuable way for sharing data across the Internet. It is the basis for the Semantic Web and allows organizations to not only easily share data, but also connect data with other related data. Visual Resource Association (VRA) Core 4 is an XML schema-based data model for cataloging cultural objects and visual resources. Using the existing VRA Core 4 restricted XML schema, a new data model was developed that took advantage of popular domain specific vocabularies. Using popular vocabularies such as Schema.org, helps ensure that data will be interoperable with other data and can potentially help improve visibility on the Internet. Using the data model as a reference, an ontology was developed using Protege ontology editor. It illustrated how popular domain specific vocabularies can be combined with the existing VRA data model to create a new semantically-rich model that still retains the specificity and detail of the original XML restricted schema. In addition to developing a new VRA data model, an XSLT stylesheet was created that demonstrated how existing XML based records could be converted into RDF data. The stylesheet was used to successfully convert a 4,150 record collection from the University of Notre Dame into RDF triples. The XSLT templates used in the stylesheet were able to not only convert the existing XML elements/attributes into RDF classes/properties but also convert the existing controlled vocabulary terms into functioning http URIs representing concepts. The study successfully demonstrated that existing data models can be enhanced to incorporate Linked Data and that existing datasets of implementation-specific XML records can be converted into RDF triples with properties defined by popular RDF vocabularies using an XSLT stylesheet.

    Committee: Marcia Zeng Ph.D. (Advisor); Yin Zhang Ph.D. (Committee Member); Athena Salaba Ph.D. (Committee Member) Subjects: Information Science; Library Science
  • 6. Chittella, Rama Someswar Leveraging Schema Information For Improved Knowledge Graph Navigation

    Master of Science (MS), Wright State University, 2019, Computer Science

    Over the years, the semantic web has emerged as a new generation of the world wide web featuring advanced technologies and research contributions. It has revolutionized the usage of information by allowing users to capture and publish machine-understandable data and expedite methods such as ontologies to perform the same. These ontologies help in the formal representation of a specified domain and foster comprehensive machine understanding. Although, the engineering of ontologies and usage of logic have been an integral part of the web semantics, new areas of research such as the semantic web search, linking and usage of open data on the web, and the subsequent use of these technologies in building semantic web applications have also become significant in recent times. One such research contribution that we are going to focus on is the browsing of linked RDF data. Semantic web advocates the methodology of linked data to publish structured data on the web. Most of the linked data is available as browsable RDF data which is built using triples that define statements in the form of subject-predicate-object. These triples can be tabulated by sorting the three parts into separate columns. To browse the linked data of semantic web, several web browsers such as CubicWeb, VisiNav and Pubby were designed. These browsers provide the users with a tabular browsing experience displaying the data in nested tables. Also, they help users navigate through various subjects and their respective objects with the help of links associated with them. Several other browsers such as Tabulator were developed which enable real-time editing of semantic web resources\cite{berners2008tabulator} However, with the tabulated interface, users may sometimes find it difficult to realize the relationships between the various documents. Also navigating using the links between subjects and its predicates inside the documents is more time consuming which makes the overall user experience tedious. To i (open full item for complete abstract)

    Committee: Pascal Hitzler Ph.D. (Advisor); Mateen M. Rizki Ph.D. (Committee Member); Yong Pei Ph.D. (Committee Member) Subjects: Computer Science
  • 7. Gunaratna, Kalpa Semantics-based Summarization of Entities in Knowledge Graphs

    Doctor of Philosophy (PhD), Wright State University, 2017, Computer Science and Engineering PhD

    The processing of structured and semi-structured content on the Web has been gaining attention with the rapid progress in the Linking Open Data project and the development of commercial knowledge graphs. Knowledge graphs capture domain-specific or encyclopedic knowledge in the form of a data layer and add rich and explicit semantics on top of the data layer to infer additional knowledge. The data layer of a knowledge graph represents entities and their descriptions. The semantic layer on top of the data layer is called the schema (ontology), where relationships of the entity descriptions, their classes, and the hierarchy of the relationships and classes are defined. Today, there exist large knowledge graphs in the research community (e.g., encyclopedic datasets like DBpedia and Yago) and corporate world (e.g., Google knowledge graph) that encapsulate a large amount of knowledge for human and machine consumption. Typically, they consist of millions of entities and billions of facts describing these entities. While it is good to have this much knowledge available on the Web for consumption, it leads to information overload, and hence proper summarization (and presentation) techniques need to be explored. In this dissertation, we focus on creating both \textit{comprehensive} and \textit{concise} entity summaries at: (i) the single entity level and (ii) the multiple entity level. To summarize a single entity, we propose a novel approach called FACeted Entity Summarization (FACES) that considers importance, which is computed by combining popularity and uniqueness, and diversity of facts getting selected for the summary. We first conceptually group facts using semantic expansion and hierarchical incremental clustering techniques and form facets (i.e., groupings) that go beyond syntactic similarity. Then we rank both the facts and facets using Information Retrieval (IR) ranking techniques to pick the highest ranked facts from these facets for the summary. The important (open full item for complete abstract)

    Committee: Amit Sheth Ph.D. (Committee Co-Chair); Krishnaprasad Thirunarayan Ph.D. (Committee Co-Chair); Keke Chen Ph.D. (Committee Member); Gong Cheng Ph.D. (Committee Member); Edward Curry Ph.D. (Committee Member); Hamid Motahari Nezhad Ph.D. (Committee Member) Subjects: Computer Science
  • 8. Clunis, Julaine Designing an Ontology for Managing the Diets of Hypertensive Individuals

    MLIS, Kent State University, 2016, College of Communication and Information / School of Information

    Making use of semantic technologies to combine various resources into one integrated environment, this study developed an ontology for hypertensive individuals to gain a better understanding of the nutrients in foods and recipes and what effect these have on the disease, prescribed drugs, and their general health. In particular, 10% of a sample of 500 recipes obtained from the web, and 10% of the food items from the USDA nutrient database was used as data in the ontology which had 75 classes, 22 object properties, and 33 data properties. The study established proto-personas to aid in development of competency questions which would be used with the Pellet reasoner to test whether the ontology could inform about nutrition goals for hypertensive individuals. The testing results provide evidence to support the idea that an ontology may be used to provide guidance to individuals with chronic disease, highlighting what foods may be safely consumed and which may cause problems. The conclusion is that an ontology can be successfully used to provide support to medical personnel and advance the cause of patient engagement as they seek to manage chronic illnesses such as hypertension.

    Committee: Marcia Zeng Ph.D. (Advisor); Rebecca Meehan Ph.D. (Advisor); Karen Gracy Ph.D. (Committee Member) Subjects: Health; Information Science; Information Systems; Library Science; Web Studies
  • 9. Petiya, Sean Building a Semantic Web of Comics: Publishing Linked Data in HTML/RDFa Using a Comic Book Ontology and Metadata Application Profiles

    MLIS, Kent State University, 2014, College of Communication and Information / School of Information

    Information about the various resources, concepts, and entities in the world of comics can be found in a wide range of systems, including those of libraries, archives, and museums, as well as the records of independent research projects. Semantic Web technologies and standards represent an opportunity to connect these resources using Linked Data. In an attempt to realize this opportunity, this thesis presents a case study for the development of a domain ontology for comic books and comic book collections. In the initial phase, reference resources and example materials were collected and consulted to develop a representative domain model and core schema. A workflow was then developed to convert common CSV data to XML and RDF/XML, replacing common values with LOD URIs using XSLT. The second phase of the study then focused on publishing Linked Data using HTML/RDFa. A review of existing information systems and an analysis of their content was conducted in order to address the usability of the vocabulary, and inform the design of a series of modularized metadata application profiles using the core schema as a base. Examples were tested for their ability to produce valid, meaningful RDF data from HTML content that was consistent with the ontology. The final result is an RDFS/OWL Web vocabulary for comics, titled the Comic Book Ontology (CBO). It is an open and extensible semantic model that identifies comics using two components: (a) the form and (b) the container. This approach allows the Ontology's conceptualization of comics to include comic books, comic strips, web comics, graphic novels, manga, or original artwork, with the potential for further describing other aspects of comics culture and scholarship, or connecting, community created data to Semantic Web applications, such as next-generation library catalogs.

    Committee: Marcia Lei Zeng Ph.D. (Advisor); Karen F. Gracy Ph.D. (Committee Member); David B. Robins Ph.D. (Committee Member) Subjects: Information Science; Information Systems; Library Science; Web Studies
  • 10. Gu, Chen Ontology Alignment Techniques for Linked Open Data Ontologies

    Master of Computer Science, Miami University, 2013, Computer Science and Software Engineering

    Ontology alignment (OA) addresses the Semantic Web challenge to enable information interoperability between related but heterogeneous ontologies. Traditional OA systems have focused on aligning well defined and structured ontologies from the same or closely related domains and producing equivalence mappings between concepts in the source and target ontologies. Linked Open Data (LOD) ontologies, however, present some different characteristics from standard ontologies. For example, equivalence relations are limited among LOD concepts; thus OA systems for LOD ontology alignment should be able to produce subclass and superclass mappings between the source and target. This thesis overviews the current research on aligning LOD ontologies. An important research aspect is the use of background knowledge in the alignment process. Two current OA systems are modified to perform alignment of LOD ontologies. For each modified OA system, experiments have been performed using a set of LOD reference alignments to evaluate their alignment results using standard OA performance measures.

    Committee: Valerie Cross PhD (Advisor); Eric Bachmann PhD (Committee Member); Michael Zmuda PhD (Committee Member) Subjects: Computer Science