###
计算机系统应用英文版:2018,27(1):143-148
本文二维码信息
码上扫一扫!
基于R语言的互信息网络模型在乳腺癌易感基因检测分析中的应用
(中国石油大学(华东)计算机与通信工程学院, 青岛 266580)
Application of Mutual Information Network in Detection and Analysis of Breast Cancer Susceptibility Genes Using R Language
(College of Computer and Communication Engineering, China University of Petroleum, Qingdao 266580, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 1669次   下载 1955
Received:March 28, 2017    Revised:April 20, 2017
中文摘要: 全基因组关联研究(Genome-wide association studies,GWAS)是指在基因水平上进行关联分析来寻找致病基因的方法. 传统的研究方法没有考虑到基因之间的相互作用,而且在复杂的因素情形下往往效率、准确率较低. 针对上述难题,本文提出一种基于互信息的结构性关键SNPs集合选取方法. 在互信息理论和仿真数据的基础之上,逆向构建SNPs互信息网络,给定互信息一个阈值范围,找到对应阈值下相关统计量进行比较分析,选取出合适的阈值. 根据选取的阈值,筛选出对网络结构有明显影响效果的“结构性关键SNPs”. 实验结果表明:本文采用的参数取值方法能够准确快速地筛选出对网络结构有明显影响效果的关键SNPs.
Abstract:Genome-wide association studies (GWAS) refer to the method that uses correlation analysis to identify disease associated genes. Traditional research method did not consider the interaction between genes and had low accuracy and efficiency in the case of complex factors. Aimed at these aforementioned problems, this paper presents a key SNPs selecting algorithm based on mutual information. It constructs reversely the SNPs interaction network using simulation data based on the theory of mutual information and compares the difference of the statistics of SNPs interaction networks between case and control groups with the increase of the mutual information threshold. According to the selected threshold, we select the structural key SNPs. The results of experiments show that the method of parameter selection presented in this paper is useful to select the structural key SNPs.
文章编号:     中图分类号:    文献标志码:
基金项目:国家自然科学基金(61572522)
引用文本:
王淑栋,张善强,贺思程.基于R语言的互信息网络模型在乳腺癌易感基因检测分析中的应用.计算机系统应用,2018,27(1):143-148
WANG Shu-Dong,ZHANG Shan-Qiang,HE Si-Cheng.Application of Mutual Information Network in Detection and Analysis of Breast Cancer Susceptibility Genes Using R Language.COMPUTER SYSTEMS APPLICATIONS,2018,27(1):143-148