Privacy-preserving SVM classification

Jaideep Vaidya, Hwanjo Yu, Xiaoqian Jiang

Research output: Contribution to journalArticlepeer-review

119 Scopus citations

Abstract

Traditional Data Mining and Knowledge Discovery algorithms assume free access to data, either at a centralized location or in federated form. Increasingly, privacy and security concerns restrict this access, thus derailing data mining projects. What is required is distributed knowledge discovery that is sensitive to this problem. The key is to obtain valid results, while providing guarantees on the nondisclosure of data. Support vector machine classification is one of the most widely used classification methodologies in data mining and machine learning. It is based on solid theoretical foundations and has wide practical application. This paper proposes a privacy-preserving solution for support vector machine (SVM) classification, PP-SVM for short. Our solution constructs the global SVM classification model from data distributed at multiple parties, without disclosing the data of each party to others. Solutions are sketched out for data that is vertically, horizontally, or even arbitrarily partitioned. We quantify the security and efficiency of the proposed method, and highlight future challenges.

Original languageEnglish (US)
Pages (from-to)161-178
Number of pages18
JournalKnowledge and Information Systems
Volume14
Issue number2
DOIs
StatePublished - Feb 2008

All Science Journal Classification (ASJC) codes

  • Software
  • Information Systems
  • Human-Computer Interaction
  • Hardware and Architecture
  • Artificial Intelligence

Keywords

  • Classification
  • Privacy
  • Security
  • Support vector machine

Fingerprint

Dive into the research topics of 'Privacy-preserving SVM classification'. Together they form a unique fingerprint.

Cite this