World Scientific
  • Search
  •   
Skip main navigation

Cookies Notification

We use cookies on this site to enhance your user experience. By continuing to browse the site, you consent to the use of our cookies. Learn More
×
Our website is made possible by displaying certain online content using javascript.
In order to view the full content, please disable your ad blocker or whitelist our website www.worldscientific.com.

System Upgrade on Tue, Oct 25th, 2022 at 2am (EDT)

Existing users will be able to log into the site and access content. However, E-commerce and registration of new users may not be available for up to 12 hours.
For online purchase, please visit us again. Contact us at [email protected] for any enquiries.

iDNA6mA-Rice-DL: A local web server for identifying DNA N6-methyladenine sites in rice genome by deep learning method

    https://doi.org/10.1142/S0219720021500190Cited by:4 (Source: Crossref)

    Accurate detection of N6-methyladenine (6mA) sites by biochemical experiments will help to reveal their biological functions, still, these wet experiments are laborious and expensive. Therefore, it is necessary to introduce a powerful computational model to identify the 6mA sites on a genomic scale, especially for plant genomes. In view of this, we proposed a model called iDNA6mA-Rice-DL for the effective identification of 6mA sites in rice genome, which is an intelligent computing model based on deep learning method. Traditional machine learning methods assume the preparation of the features for analysis. However, our proposed model automatically encodes and extracts key DNA features through an embedded layer and several groups of dense layers. We use an independent dataset to evaluate the generalization ability of our model. An area under the receiver operating characteristic curve (auROC) of 0.98 with an accuracy of 95.96% was obtained. The experiment results demonstrate that our model had good performance in predicting 6mA sites in the rice genome. A user-friendly local web server has been established. The Docker image of the local web server can be freely downloaded at https://hub.docker.com/r/his1server/idna6ma-rice-dl.

    References

    • 1. Tahir M, Tayara H, Chong KT , iDNA6mA (5-step rule): Identification of DNA N6-methyladenine sites in the rice genome by intelligent computational model via Chou’s 5-step rule, Chemometr Intell Lab Syst 189 :96–101, 2019. CrossrefGoogle Scholar
    • 2. Bergman Y, Cedar H , DNA methylation dynamics in health and disease, Nat Struct Mol Biol 20 :274–281, 2013. Crossref, MedlineGoogle Scholar
    • 3. Smith ZD, Meissner A , DNA methylation: Roles in mammalian development, Nat Rev Gene 14 :204–220, 2013. Crossref, MedlineGoogle Scholar
    • 4. von Meyenn F, Iurlaro M, Habibi E, Liu NQ, Salehzadeh-Yazdi A, Santos F, Petrini E, Milagre I, Yu M, Xie Z , Impairment of DNA methylation maintenance is the main cause of global demethylation in naive embryonic stem cells, Mol Cell 62 :848–861, 2016. Crossref, MedlineGoogle Scholar
    • 5. Hao L, Dao F-Y, Guan Z-X, Zhang D, Tan J-X, Zhang Y, Chen W, Lin H , iDNA6mA-Rice: A computational tool for detecting N6-methyladenine sites in rice, Front Gene 10 :793, 2019. Crossref, MedlineGoogle Scholar
    • 6. Chen W, Lv H, Nie F, Lin H , i6mA-Pred: Identifying DNA N6-methyladenine sites in the rice genome, Bioinformatics 35 :2796–2800, 2019. Crossref, MedlineGoogle Scholar
    • 7. O’Brown ZK, Greer EL , N6-Methyladenine: A conserved and dynamic DNA mark, Adv Exp Med Biol 945 :213–246, 2016. Crossref, MedlineGoogle Scholar
    • 8. Wang HT, Xiao FH, Li GH, Kong QP , Identification of DNA N6-methyladenine sites by integration of sequence features, Epigenetics Chromatin 13 :8, 2020. Crossref, MedlineGoogle Scholar
    • 9. Hasan MM, Basith S, Khatun MS, Lee G, Manavalan B, Kurata H , Meta-i6mA: An interspecies predictor for identifying DNA N6-methyladenine sites of plant genomes by exploiting informative features in an integrative machine-learning framework, Brief Bioinform 22 :bbaa202, 2020. CrossrefGoogle Scholar
    • 10. Cai J, Wang D, Chen R, Niu Y, Ye X, Su R, Xiao G, Wei L , A bioinformatics tool for the prediction of DNA N6-Methyladenine modifications based on feature fusion and optimization protocol, Front Bioeng Biotechnol 8 :502, 2020. Crossref, MedlineGoogle Scholar
    • 11. Yue H, Nie X, Yan Z, Weining S , N6-methyladenosine regulatory machinery in plants: Composition, function and evolution, Plant Biotechnol J 17 :1194, 2019. Crossref, MedlineGoogle Scholar
    • 12. Liang Z, Shen L, Cui X, Bao S, Geng Y, Yu G, Liang F, Xie S, Lu T, Gu X, Yu H , DNA N(6)-Adenine Methylation in Arabidopsis thaliana, Dev Cell 45 :406–416.e403, 2018. Crossref, MedlineGoogle Scholar
    • 13. Feng P, Yang H, Ding H, Lin H, Chen W, Chou KC , iDNA6mA-PseKNC: Identifying DNA N(6)-methyladenosine sites by incorporating nucleotide physicochemical properties into PseKNC, Genomics 111 :96–102, 2019. Crossref, MedlineGoogle Scholar
    • 14. Zhang A, Lipton ZC, Li M, Smola AJ , Dive into Deep Learning, https://d2l.ai, 2020. Google Scholar
    • 15. Goodfellow I, Bengio Y, Courville A, Bengio Y , Deep Learning, MIT Press, Cambridge, 2016. Google Scholar
    • 16. Oubounyt M, Louadi Z, Tayara H, Chong KT , Deep learning models based on distributed feature representations for alternative splicing prediction, IEEE Access 6 :58826–58834, 2018. CrossrefGoogle Scholar
    • 17. Guo J, He H, He T, Lausen L, Li M, Lin H, Shi X, Wang C, Xie J, Zha S , GluonCV and GluonNLP: Deep learning in computer vision and natural language processing, J Mach Learn Res 21 :1–7, 2020. MedlineGoogle Scholar
    • 18. Patel S, Tripathi R, Kumari V, Varadwaj P , DeepInteract: Deep neural network based protein-protein interaction prediction tool, Current Bioinform 12 :551–557, 2017. CrossrefGoogle Scholar
    • 19. Stephenson N, Shane E, Chase J, Rowland J, Ries D, Justice N, Zhang J, Chan L, Cao R , Survey of machine learning techniques in drug discovery, Current Drug Metabol 20 :185–193, 2019. Crossref, MedlineGoogle Scholar
    • 20. Pan X, Rijnbeek P, Yan J , Shen H-B, Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks, BMC Genom 19 :511, 2018. Crossref, MedlineGoogle Scholar
    • 21. Nazari I, Tayara H, Chong KT , Branch point selection in RNA splicing using deep learning, IEEE Access 7 :1800–1807, 2018. CrossrefGoogle Scholar
    • 22. Tahir M, Tayara H, Chong KT , iRNA-PseKNC (2methyl): Identify RNA 2’-O-methylation sites by convolution neural network and Chou’s pseudo components, J Theor Biol 465 :1–6, 2019. Crossref, MedlineGoogle Scholar
    • 23. Zerouali A, Mens T, Roover CD , On the usage of JavaScript, Python and Ruby packages in docker hub images, Sci Comput Program 207 :102653, 2021. CrossrefGoogle Scholar
    • 24. Docker, https://www.docker.com/. Google Scholar
    • 25. He W, Jia C, Duan Y, Zou Q , 70ProPred: A predictor for discovering sigma70 promoters based on combining multiple features, BMC Syst Biol 12 :44, 2018. Crossref, MedlineGoogle Scholar
    • 26. Manavalan B, Lee J , SVMQA: Support–vector-machine-based protein single-model quality assessment, Bioinformatics 33 :2496–2503, 2017. Crossref, MedlineGoogle Scholar
    • 27. Manavalan B, Shin TH, Lee G , PVP-SVM: Sequence-based prediction of phage virion proteins using a support vector machine, Front Microbiol 9 :476, 2018. Crossref, MedlineGoogle Scholar
    • 28. Zuo Y-C, Peng Y, Liu L, Chen W, Yang L , Fan G-L, Predicting peroxidase subcellular location by hybridizing different descriptors of Chou’pseudo amino acid patterns, Anal Biochem 458 :14–19, 2014. Crossref, MedlineGoogle Scholar
    • 29. Zou Q, Wan S, Ju Y, Tang J, Zeng X , Pretata: Predicting TATA binding proteins with novel features and dimensionality reduction strategy, BMC Syst Biol 10 :401–412, 2016. CrossrefGoogle Scholar
    • 30. Manavalan B, Basith S, Shin TH, Choi S, Kim MO, Lee G , MLACP: Machine-learning-based prediction of anticancer peptides, Oncotarget 8 :77121, 2017. Crossref, MedlineGoogle Scholar
    • 31. Cao R, Freitas C, Chan L, Sun M, Jiang H, Chen Z , ProLanGO: Protein function prediction using neural machine translation based on a recurrent neural network, Molecules 22 :1732, 2017. CrossrefGoogle Scholar
    • 32. Cao R, Adhikari B, Bhattacharya D, Sun M, Hou J, Cheng J , QAcon: Single model quality assessment using protein structural and contact information with machine learning techniques, Bioinformatics 33 :586–588, 2017. Crossref, MedlineGoogle Scholar
    • 33. Cheng J-H, Yang H, Liu M-L, Su W, Feng P-M, Ding H, Chen W, Lin H , Prediction of bacteriophage proteins located in the host cell using hybrid features, Chemometr Intell Lab Syst 180 :64–69, 2018. CrossrefGoogle Scholar
    • 34. Yang H, Lv H, Ding H, Chen W, Lin H , iRNA-2OM: A sequence-based predictor for identifying 2’-O-methylation sites in homo sapiens, J Comput Biol 25 :1266–1277, 2018. Crossref, MedlineGoogle Scholar
    • 35. Zhu X-J, Feng C-Q, Lai H-Y, Chen W, Hao L , Predicting protein structural classes for low-similarity sequences by evaluating different features, Knowl-Based Syst 163: 787–793, 2019. CrossrefGoogle Scholar
    • 36. Dao F-Y, Lv H, Wang F, Feng C-Q, Ding H, Chen W, Lin H , Identify origin of replication in Saccharomyces cerevisiae using two-step feature selection technique, Bioinformatics 35 :2075–2083, 2019. Crossref, MedlineGoogle Scholar
    • 37. Li F, Li C, Marquez-Lago TT, Leier A, Akutsu T, Purcell AW, Ian Smith A, Lithgow T, Daly RJ, Song J , Quokka: A comprehensive tool for rapid and accurate prediction of kinase family-specific phosphorylation sites in the human proteome, Bioinformatics 34 :4223–4231, 2018. Crossref, MedlineGoogle Scholar
    • 38. Song J, Li F, Leier A, Marquez-Lago TT, Akutsu T, Haffari G, Chou K-C, Webb GI, Pike RN , PROSPERous: High-throughput prediction of substrate cleavage sites for 90 proteases with improved accuracy, Bioinformatics 34 :684–687, 2018. Crossref, MedlineGoogle Scholar
    • 39. Song J, Wang Y, Li F, Akutsu T, Rawlings ND, Webb GI, Chou K-C , iProt-Sub: A comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites, Brief Bioinform 20 :638–658, 2019. Crossref, MedlineGoogle Scholar
    • 40. Metz CE , Some practical issues of experimental design and data analysis in radiological ROC studies, Investig Radiol 24 :234–245, 1989. Crossref, MedlineGoogle Scholar
    • 41. Chen X-X, Tang H, Li W-C, Wu H, Chen W, Ding H, Lin H , Identification of bacterial cell wall lyases via pseudo amino acid composition, BioMed Res Int 2016 :1654623, 2016. MedlineGoogle Scholar
    • 42. Hanley JA, McNeil BJ , The meaning and use of the area under a receiver operating characteristic (ROC) curve, Radiology 143 :29–36, 1982. Crossref, MedlineGoogle Scholar