|Identification of Cancer Drivers using Classification of large DNA Methylation Dataset|
|Shivam Kumar, Sanchi Bansal, Shubham Pandey, Stuti Saxena, Prof. Rohini Khalkar|
|Cite This Article:|
Identification of Cancer Drivers using Classification of large DNA Methylation Dataset, International Journal of Innovative Research in Technology(www.ijirt.org) ,ISSN: 2349-6002 ,Volume 5 ,Issue 10 ,Page(s):69-73 ,March 2019 ,Available :IJIRT147638_PAPER.pdf
|DNA methylation; machine learning; cancer; disease diagnostic predictive models; algorithm and techniques to speed up the analysis of big medical data.; classification.|
|A well-studied genetic modification is crucial to regulate the functioning of the genome, which is done with the help of DNA Methylation. Alteration of DNA plays a vital role in tumor generation (tumorigenesis) and tumor-suppression. Therefore, studying DNA methylation data may help in identifying basic molecules or elements in body that indicates the presence of cancer. DNA methylation related data available from the public is huge – and considering the high number of methylated sites (features) present in the genome – it is crucial to have a technology for efficient processing of huge datasets. With the help of big data technologies, we propose an algorithm that can apply supervised learning in the form of classification methods to datasets with large amount of features. Through iterative deletion of selected features, extraction of equivalent classification models is possible using this algorithm. The experiments will be executed on DNA methylation datasets extracted from The Cancer Genome Atlas, where we will be focusing on three types of tumors: breast, kidney, and thyroid carcinomas. Several methylated sites and their associated genes will be extracted and classification will be performed on them with accurate performance. Thereafter, we will study the performance of our algorithm and compare it with other classifiers and with existing approaches used to analyze this data i.e, a wide-spread DNA methylation analysis method based on network analysis. Finally, we will be able to efficiently compute multiple alternative classification models and extract a set of candidate genes from DNA-methylation large datasets to be further examined to determine their role in cancer.|
|Unique Paper ID: 147638|
Publication Volume & Issue: Volume 5, Issue 10
Page(s): 69 - 73
|Article Preview & Download|
Enhanced PAD Neural Based Approach for Forgery Det...
Paper ID : IJIRT148940
A Comparative Study on Various Dairy Co-operative ...
Paper ID : IJIRT148939
ENHANCED DISTRIBUTED ENERGY EFFICIENT CLUSTERING P...
Paper ID : IJIRT148938
A Photovoltaic Modeling module with different Conv...
Paper ID : IJIRT148936
A Partial Replacement of Ceramic Tiles Waste in Co...
Paper ID : IJIRT148935