JOURNAL ARTICLE

A novel method for predicting DNA N4-methylcytosine sites based on deep forest algorithm.

  • Published In: Journal of Bioinformatics & Computational Biology, 2023, v. 21, n. 1. P. 1 1 of 3

  • Database: Academic Search Ultimate 2 of 3

  • Authored By: Zhang, Yonglin; Hu, Mei; Mo, Qi; Gan, Wenli; Luo, Jiesi 3 of 3

Abstract

N4-methyladenosine (4mC) methylation is an essential epigenetic modification of deoxyribonucleic acid (DNA) that plays a key role in many biological processes such as gene expression, gene replication and transcriptional regulation. Genome-wide identification and analysis of the 4mC sites can better reveal the epigenetic mechanisms that regulate various biological processes. Although some high-throughput genomic experimental methods can effectively facilitate the identification in a genome-wide scale, they are still too expensive and laborious for routine use. Computational methods can compensate for these disadvantages, but they still leave much room for performance improvement. In this study, we develop a non-NN-style deep learning-based approach for accurately predicting 4mC sites from genomic DNA sequence. We generate various informative features represented sequence fragments around 4mC sites, and subsequently implement them into a deep forest (DF) model. After training the deep model using 10-fold cross-validation, the overall accuracies of 85.0%, 90.0%, and 87.8% were achieved for three representative model organisms, A. thaliana, C. elegans, and D. melanogaster, respectively. In addition, extensive experiment results show that our proposed approach outperforms other existing state-of-the-art predictors in the 4mC identification. Our approach stands for the first DF-based algorithm for the prediction of 4mC sites, providing a novel idea in this field. [ABSTRACT FROM AUTHOR]

Additional Information

  • Source:Journal of Bioinformatics & Computational Biology. 2023/02, Vol. 21, Issue 1, p1
  • Document Type:Article
  • Subject Area:Biology
  • Publication Date:2023
  • ISSN:0219-7200
  • DOI:10.1142/S0219720023500038
  • Accession Number:162916338
  • Copyright Statement:Copyright of Journal of Bioinformatics & Computational Biology is the property of World Scientific Publishing Company and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)

Looking to go deeper into this topic? Look for more articles on EBSCOhost.