EvoStruct-Sub: An accurate Gram-positive protein subcellular localization predictor using evolutionary and structural features

Md Raihan Uddin, Alok Sharma, Dewan Md Farid, Md Mahmudur Rahman, Abdollah Dehzangi, Swakkhar Shatabda

Research output: Contribution to journalArticlepeer-review

28 Scopus citations

Abstract

Determining subcellular localization of proteins is considered as an important step towards understanding their functions. Previous studies have mainly focused solely on Gene Ontology (GO) as the main feature to tackle this problem. However, it was shown that features extracted based on GO is hard to be used for new proteins with unknown GO. At the same time, evolutionary information extracted from Position Specific Scoring Matrix (PSSM) have been shown as another effective features to tackle this problem. Despite tremendous advancement using these sources for feature extraction, this problem still remains unsolved. In this study we propose EvoStruct-Sub which employs predicted structural information in conjunction with evolutionary information extracted directly from the protein sequence to tackle this problem. To do this we use several different feature extraction method that have been shown promising in subcellular localization as well as similar studies to extract effective local and global discriminatory information. We then use Support Vector Machine (SVM) as our classification technique to build EvoStruct-Sub. As a result, we are able to enhance Gram-positive subcellular localization prediction accuracies by up to 5.6% better than previous studies including the studies that used GO for feature extraction.

Original languageEnglish (US)
Pages (from-to)138-146
Number of pages9
JournalJournal of Theoretical Biology
Volume443
DOIs
StatePublished - Apr 14 2018
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Modeling and Simulation
  • Biochemistry, Genetics and Molecular Biology(all)
  • Immunology and Microbiology(all)
  • Agricultural and Biological Sciences(all)
  • Applied Mathematics

Keywords

  • Classification
  • Evolutionary-based features
  • Feature selection
  • Proteins subcellular localization
  • Structural-based features
  • Support vector machine

Fingerprint

Dive into the research topics of 'EvoStruct-Sub: An accurate Gram-positive protein subcellular localization predictor using evolutionary and structural features'. Together they form a unique fingerprint.

Cite this