Skip to Main Content
 

Global Search Box

 
 
 
 

ETD Abstract Container

Abstract Header

Topological Analysis of Averaged Sentence Embeddings

Abstract Details

2020, Master of Science (MS), Wright State University, Computer Science.
Sentence embeddings are frequently generated by using complex, pretrained models that were trained on a very general corpus of data. This thesis explores a potential alternative method for generating high-quality sentence embeddings for highly specialized corpora in an efficient manner. A framework for visualizing and analyzing sentence embeddings is developed to help assess the quality of sentence embeddings for a highly specialized corpus of documents related to the 2019 coronavirus epidemic. A Topological Data Analysis (TDA) technique is explored as an alternative method for grouping embeddings for document clustering and topic modeling tasks and is compared to a simple clustering method for effectiveness. The sentence embeddings generated are found to be effective for use in similarity based tasks and group in useful ways when used with the TDA based techniques explored as alternatives to traditional clustering-based approaches.
Michael Raymer, Ph.D. (Advisor)
Mateen Rizki, Ph.D. (Committee Member)
Krishnaprasad Thirunarayan, Ph.D. (Committee Member)
104 p.

Recommended Citations

Citations

  • Holmes, W. J. (2020). Topological Analysis of Averaged Sentence Embeddings [Master's thesis, Wright State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=wright1609351352688467

    APA Style (7th edition)

  • Holmes, Wesley. Topological Analysis of Averaged Sentence Embeddings. 2020. Wright State University, Master's thesis. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=wright1609351352688467.

    MLA Style (8th edition)

  • Holmes, Wesley. "Topological Analysis of Averaged Sentence Embeddings." Master's thesis, Wright State University, 2020. http://rave.ohiolink.edu/etdc/view?acc_num=wright1609351352688467

    Chicago Manual of Style (17th edition)