A KNN-based learning method for biology species categorization

Yan Dang, Yulei Zhang, Dongmo Zhang, Liping Zhao

Research output: Contribution to journalConference articlepeer-review

1 Scopus citations

Abstract

This paper presents a novel approach toward high precision biology species categorization which is mainly based on KNN algorithm. KNN has been successfully used in natural language processing (NLP). Our work extends the learning method for biological data. We view the DNA or RNA sequences of certain species as special natural language texts. The approach for constructing composition vectors of DNA and RNA sequences is described. A learning method based on KNN algorithm is proposed. An experimental system for biology species categorization is implemented. Forty three different bacteria organisms selected randomly from EMBL are used for evaluation purpose. And the preliminary experiments show promising results on precision.

Original languageEnglish (US)
Pages (from-to)956-964
Number of pages9
JournalLecture Notes in Computer Science
Volume3610
Issue numberPART I
DOIs
StatePublished - 2005
Externally publishedYes
EventFirst International Conference on Natural Computation, ICNC 2005 - Changsha, China
Duration: Aug 27 2005Aug 29 2005

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'A KNN-based learning method for biology species categorization'. Together they form a unique fingerprint.

Cite this