IDENTIFICATION OF DIFFERENTIALLY EXPRESSED GENES USING BI-CLUSTERING

B. SATHWIKA, D. VINUSHA, CH.NARESH BABU, G.C. MANVITHA, PAVITHRA. S

Abstract


Inorder to identify the biologically relevant gene module and also which are the genes responsible for causing a disease,we use Biclustering technique which is also a useful co-clustering technique.In this paper, we present a exceptional method to state specific gene modules and also functionally related gene modules which are the reason for disease causing ,by applying a genetic algorithm to genetic data which is in the form of microarray data. To detect these differentially expressed gene modules, the anticipated method finds biclusters in which genes are overexpressed or under expressed, and also which  are differentially-expressed in the samples of genetic data. Inorder to get the differentially expressed we perform three steps in which we use K means alogorithm for clustering and Cheng and Chruch algorithm for biclustering. As to overcome the drawbacks in clustering  we use Biclustering technique which reduce redundancy in the data.The ensuing gene modules uncover preferable exhibitions over near techniques in the GO (Gene Ontology) term enhancement test and an analyzed association between gene modules and infection.

 


Keywords


Bioinformatics, Raw data, analysing, Preprocessing, Clustering, Bi-clustering

Full Text:

PDF

References


Hartigan JA. Direct clustering of a data matrix. J AM stat Assoc. 67(337): pp.123-9,1972.

Y.Cheng and G. M. Church, Biclustering of expression data. Inproc.of the international conference on intelligent system for molecular biology. pp.93-103,2000.

Hua QU, Liu-Pu Wang and Chun-Guo Wu. An improved biclustering algorithm and its applications to gene expression spectrum analysis. Genomics, Proteomics and Bioinformatics, Elsevier. 3(3):pp.189-193,2016

Ben-Dor A, Chor B, Karp R, and Yakhini Z. Discovering local structure in gene expression data: The order-preserving sub A matrix problem, In Proc. International Conference on Computational Biology, pp.49-57, 2002.

Fadhl M. Al-Akwaa. Analysis of gene expression data using Bi-clustering algorithms.2012.

Wang Z, Gerstein M and Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 10(1):pp.57-63, 2009.

Bozdag D, Kumar A and Catalyurek UV, Comparative analysis of biclustering algorithms, In: Proceedings of 1st ACM, International Conference Bioinformatics and Computational Biology, pp.265274, 2010.

Cano C, Adarve L, Lopez L and Blanco A, Possibilistic approach for biclustering microarray data, Computers in Biology and Medicine, 37, pp.1426-1436, 2007.

Ahn, Youngmi Yoon, Jaegyoon Sanghyun Park, Noise-robust algorithm for identifying functionally associated biclusters from gene expression data, Information Sciences, 181 pp.435-449, 2011.

Shahreen Kasim, Safaai Deris, Razib M. and Othman, Multi-stage filtering for improving confidence level and determining dominant clusters in clustering algorithms of gene expression data, Computers in Biology and Medicine, 43(9), pp.1120-1133, 2013.

Tanay, A., Sharan, R., and Shamir, R, Biclustering algorithms: A survey, In Handbook of Computational Molecular Biology, S. Aluru, Ed, Chapman and Hall, 2006.

www.ncbi.nlm.nih.gov

G.F. Berriz, O.D. King, B. Bryant, C. Sander, F.P. Roth, Characterizing gene sets with Func Associate, Bioinformatics, 19, pp.2502-2504, 2003.




DOI: https://doi.org/10.26483/ijarcs.v11i0.6597

Refbacks

  • There are currently no refbacks.




Copyright (c) 2020 International Journal of Advanced Research in Computer Science