Date of Award

Spring 2013

Document Type

Thesis

Degree Name

Master of Science in Bioinformatics - (M.S.)

Department

Computer Science

First Advisor

Usman W. Roshan

Second Advisor

Zhi Wei

Third Advisor

Alexandros V. Gerbessiotis

Abstract

With the recent advances in the next generation sequencing technologies, short read sequences of human genome are made more accessible. Paired end sequencing of short reads is currently the most sensitive method for detecting somatic mutations that arise during tumor development. In this study, a novel approach to optimize the detection of structural variants using a new short read alignment program is presented.

Pairwise interaction effects of the Single Nucleotide Polymorphisms (SNPs) have proven to uncover the underlying complex disease traits. Computing the disease risk based on the interaction effects of SNPs on a case - control study is a difficult problem. As another part of the thesis, a fast GPU program that can calculate the chi-square statistics of SNP-SNP interactions and output the significant interacting SNPs is presented. The algorithm is applied to the datasets of seven common diseases obtained from Wellcome Trust Case Control Consortium (WTCCC). The algorithm computed the significant SNP pairs much faster than the existing algorithms and also identifies 3 significant pairs associated with genes IL23R and C11orf30 which are associated with pathogenesis in the Crohns disease dataset.

Share

COinS