首页  |  English  |  中国科学院
  • 学术报告
Search for Risk Haplotype Segments with GWAS Data by Use of Finite Mixture Models
主讲:Prof. Jian Zhang(University of Kent)
举办时间:2015.7.9;9:30am    地点:S309

摘要:  The region-based association analysis has been proposed to capture the  collective behavior of sets of variants by testing the association of each set instead of individual variants with the disease.  Such an analysis typically involves a list of unphased multiple-locus genotypes with potentially sparse frequencies in cases and controls.  To tackle the problem of the sparse distribution,  a two-stage approach was proposed in literature: In the first stage, haplotypes are computationally inferred from genotypes, followed by a haplotype co-classification. In the second stage, the association analysis is performed on the inferred haplotype groups. If a haplotype is unevenly distributed between the case and control samples, this  haplotype is labeled as a risk haplotype. Unfortunately, the in-silico reconstruction of haplotypes might produce a proportion of  false haplotypes which hamper the detection of rare but true  haplotypes. Here, to address the issue, we propose an alternative approach: In Stage 1, we cluster genotypes instead of inferred haplotypes and estimate the risk genotypes based on a finite mixture model. In Stage 2, we infer risk haplotypes from risk genotypes inferred from the previous stage.  To estimate the finite mixture model, we propose an EM algorithm with a novel data partition-based initialization. The performance of the proposed procedure is assessed by simulation studies and a real data analysis. Compared to the existing  multiple Z-test procedure, we find that the power of genome-wide association studies can be increased by using the proposed procedure. This talk is based on a joint work with Dr. Fadhaa Ali. 

附件下载:
中国科学院系统科学研究所 2013 版权所有 京ICP备05002810号-1
北京市海淀区中关村东路55号 邮政编码:100190, 中国科学院系统科学研究所
电话:86-10-82541881  网址:http://iss.amss.cas.cn/