CNN-Pred: Prediction of single-stranded and double-stranded DNA-binding protein using convolutional neural networks

Farnoush Manavi, Alok Sharma, Ronesh Sharma, Tatsuhiko Tsunoda, Swakkhar Shatabda, Iman Dehzangi

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

DNA-binding proteins play a vital role in biological activity including DNA replication, DNA packing, and DNA reparation. DNA-binding proteins can be classified into single-stranded DNA-binding proteins (SSBs) or double-stranded DNA-binding proteins (DSBs). Determining whether a protein is DSB or SSB helps determine the protein's function. Therefore, many studies have been conducted to accurately identify DSB and SSB in recent years. Despite all the efforts have been made so far, the DSB and SSB prediction performance remains limited. In this study, we propose a new method called CNN-Pred to accurately predict DSB and SSB. To build CNN-Pred, we first extract evolutionary-based features in the form of mono-gram and bi-gram profiles using position specific scoring matrix (PSSM). We then, use 1D-convolutional neural network (CNN) as the classifier to our extracted features. Our results demonstrate that CNN-Pred can enhance the DSB and SSB prediction accuracies by more than 4%, on the independent test compared to previous studies found in the literature. CNN-pred as a standalone tool and all its source codes are publicly available at: https://github.com/MLBC-lab/CNN-Pred.

Original languageEnglish (US)
Article number147045
JournalGene
Volume853
DOIs
StatePublished - Feb 15 2023

All Science Journal Classification (ASJC) codes

  • Genetics

Keywords

  • Convolutional neural networks
  • DNA-Binding proteins
  • DSBs
  • Feature
  • SSBs

Fingerprint

Dive into the research topics of 'CNN-Pred: Prediction of single-stranded and double-stranded DNA-binding protein using convolutional neural networks'. Together they form a unique fingerprint.

Cite this