Investigating Gene Relationships in Microarray Expressions:  Approaches Using Clustering Algorithms

Hasan, Mohammad Shabbir

Keyword Search

School Logo

HasanMoS.the (final comments 3).pdf (2.56 MB)

Investigating Gene Relationships in Microarray Expressions: Approaches Using Clustering Algorithms

Author Info

Hasan, Mohammad Shabbir

Permalink:

http://rave.ohiolink.edu/etdc/view?acc_num=akron1376536496

Year and Degree

2013, Master of Science, University of Akron, Computer Science.

Abstract

DNA Microarray technology has provided a very convenient way to concurrently investigate the expression levels of thousands of genes in a collection of related samples during different biological processes. Researchers from different disciplines such as computer science and biology have found it very much interesting and meaningful to group genes based on the similarity of their expression patterns. Different clustering algorithms such as hierarchical clustering, k-means clustering, self-organizing maps have been applied to group of genes with similar expression patterns. However each of these traditional clustering algorithms suffers from different limitations. Beside these clustering algorithms, there are some other algorithms to group similar items together. Ford Fulkerson algorithm which is based on maximum flow – minimum cut approach is one of them and it is widely used for community discovery in web graphs. In this research work, we aimed to group genes with similar expression pattern using two different approaches: one is k-means clustering combined with hierarchical clustering and another is maximum flow – minimum cut approach in association with Dijkstra’s algorithm to select source and sink node. We use a publicly available microarray data on Adenocarcinoma which is the most frequent type of non-small-cell cancers. This dataset is available in the Gene Expression Omnibus which is a public functional genomics data repository. This dataset contains samples of five different groups: normal tissue, tissues with EGFR mutation, tissues with KRAS mutation, tissues with EML4-ALK fusion and tissues with EGFR, KRAS, EML4-ALK negative cases. We investigate a number of representative genes from the group of normal tissue and from the group of KRAS mutation tissues which is also termed as KRAS positive groups in this study. We clustered the genes for both of these groups. Finally we used Gene Ontology database to find the change in the enrichment of molecular functions of the genes contained in each cluster discovered by the above mentioned approaches for both normal and KRAS positive dataset. We discovered that both of these approaches can group genes with similar expression pattern together and hence we proposed that these approaches can be used in future for clustering microarray data.

Committee

Zhong-Hui Duan, Dr. (Advisor)
Yingcai Xiao, Dr. (Committee Member)
Kathy Liszka, Dr. (Committee Member)

Pages

72 p.

Subject Headings

Computer Science

Keywords

Gene Relation, Microarray, Gene Expression, Clustering

Hasan, M. S. (2013). Investigating Gene Relationships in Microarray Expressions: Approaches Using Clustering Algorithms [Master's thesis, University of Akron]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=akron1376536496
APA Style (7th edition)
Hasan, Mohammad Shabbir. Investigating Gene Relationships in Microarray Expressions: Approaches Using Clustering Algorithms. 2013. University of Akron, Master's thesis. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=akron1376536496.
MLA Style (8th edition)
Hasan, Mohammad Shabbir. "Investigating Gene Relationships in Microarray Expressions: Approaches Using Clustering Algorithms." Master's thesis, University of Akron, 2013. http://rave.ohiolink.edu/etdc/view?acc_num=akron1376536496
Chicago Manual of Style (17th edition)

Document number:

akron1376536496

Download Count:

842

Copyright Info

Global Search Box

Files

File List

ETD Abstract Container

Abstract Header

Investigating Gene Relationships in Microarray Expressions: Approaches Using Clustering Algorithms

Abstract Details

Recommended Citations

Citations

Abstract Footer

Global Footer

Ohio Department of Higher Education

State Government Links

Education Links

Global Search Box

Files

File List

ETD Abstract Container

Abstract Header

Investigating Gene Relationships in Microarray Expressions: Approaches Using Clustering Algorithms

Abstract Details

Recommended CitationsRefworksEndNoteRISMendeley

Citations

Abstract Footer

Global Footer

Ohio Department of Higher Education

State Government Links

Education Links

Recommended Citations