An improved incremental training approach for large scaled dataset based on support vector machine

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

7 Citations (Scopus)

Abstract

The Support Vector Machine(SVM) is well known in machine learning and artificial intelligence for its high performance in data classification, regression and forecasting. Usually for large scaled dataset, an incremental training algorithm is applied for tuning or balancing the training cost and the accuracy in SVM applications. This paper presents an improved incremental training approach for large scaled dataset on SVM. We focus on data's own distribution information to unfold our research, we proposed a self adaptive clustering method to extract the area and density information of data, a border detection technologies and uncertainty strategy is applied to maintain the border and some potential samples. Our proposed method can greatly reduce the training error for incremental training on SVM, especially for some uneven distribution dataset. We can greatly tuning or balancing the training cost and the accuracy of algorithms to achieve a better performance.

Original languageEnglish
Title of host publicationProceeding of 2016 IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT)
PublisherAssociation for Computing Machinery, Inc
Pages149-157
Number of pages9
ISBN (Electronic)9781450346177
DOIs
Publication statusPublished - 6 Dec 2016
Externally publishedYes
Event3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2016 - Shanghai, China
Duration: 6 Dec 20169 Dec 2016

Publication series

NameProceedings - 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2016

Conference

Conference3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2016
Country/TerritoryChina
CityShanghai
Period6/12/169/12/16

Keywords

  • Classification
  • Data distribution
  • Incremental learning
  • Self adaptive clustering
  • Uncertainty

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems

Fingerprint

Dive into the research topics of 'An improved incremental training approach for large scaled dataset based on support vector machine'. Together they form a unique fingerprint.

Cite this