Search ETDs:
On evaluating similarity between heterogeneous data
POPOVICI, STEFANA A.

2008, MS, University of Cincinnati, Engineering : Computer Science.

Heterogeneous data are multidimensional data whose attributes belong to different domains. Processing heterogeneous data has become an important problem in data mining. However, due to the heterogeneous nature of the data the task of measuring the similarity between two heterogeneous data objects has proven to be rather difficult.

There are plenty of similarity measures that apply to homogeneous data. Each of them is applicable for one data type and they were constructed based on particular properties of that corresponding data type. In principle, they should not be applied to other kinds of data.


This thesis is concerned with the issues encountered in proximity evaluation between heterogeneous data. It focuses on a particular, probability-based, method and discusses its suitability.


Anca Ralescu, PhD (Advisor)
Dan Ralescu, PhD (Committee Member)
Qing-An Zeng, PhD (Committee Member)
Ali Minai, PhD (Committee Member)
93 p.

Recommended Citations

Hide/Show APA Citation

POPOVICI, S. (2008). On evaluating similarity between heterogeneous data. (Electronic Thesis or Dissertation). Retrieved from https://etd.ohiolink.edu/

Hide/Show MLA Citation

POPOVICI, STEFANA. "On evaluating similarity between heterogeneous data." Electronic Thesis or Dissertation. University of Cincinnati, 2008. OhioLINK Electronic Theses and Dissertations Center. 20 Apr 2015.

Hide/Show Chicago Citation

POPOVICI, STEFANA "On evaluating similarity between heterogeneous data." Electronic Thesis or Dissertation. University of Cincinnati, 2008. https://etd.ohiolink.edu/

Files

ucin1212172030.pdf (887.5 KB) View|Download