Building a Classification Model Using Affinity Propagation

Christopher Klecker, Ashraf Saad

Research output: Contribution to book or proceedingConference articlepeer-review

Abstract

Regular classification of data includes a training set and test set. For example for Naïve Bayes, Artificial Neural Networks, and Support Vector Machines, each classifier employs the whole training set to train itself. This study will explore the possibility of using a condensed form of the training set in order to get a comparable classification accuracy. The technique we explored in this study will use a clustering algorithm to explore how the data can be compressed. For example, is it possible to represent 50 records as a single record? Can this single record train a classifier as similarly to using all 50 records? This thesis aims to explore the idea of how we can achieve data compression through clustering, what are the concepts that extract the qualities of a compressed dataset, and how to check the information gain to ensure the integrity and quality of the compression algorithm. This study will explore compression through Affinity Propagation using categorical data, exploring entropy within cluster sets to calculate integrity and quality, and testing the compressed dataset with a classifier using Cosine Similarity against the uncompressed dataset.

Original languageEnglish
Title of host publicationHybrid Artificial Intelligent Systems - 14th International Conference, HAIS 2019, Proceedings
EditorsHilde Pérez García, Lidia Sánchez González, Manuel Castejón Limas, Héctor Quintián Pardo, Emilio Corchado Rodríguez
PublisherSpringer Verlag
Pages275-286
Number of pages12
ISBN (Print)9783030298586
DOIs
StatePublished - 2019
Externally publishedYes
Event14th International Conference on Hybrid Artificial Intelligence Systems, HAIS 2019 - León, Spain
Duration: Sep 4 2019Sep 6 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11734 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th International Conference on Hybrid Artificial Intelligence Systems, HAIS 2019
Country/TerritorySpain
CityLeón
Period09/4/1909/6/19

Scopus Subject Areas

  • Theoretical Computer Science
  • General Computer Science

Keywords

  • Affinity propagation
  • Categorical data
  • Classification
  • Clustering
  • Clustering analysis
  • Condensed dataset
  • Damping factor
  • Elbow method
  • Exemplars
  • Naïve-Bayes
  • Prediction model
  • Preference value
  • Similarity matrix

Fingerprint

Dive into the research topics of 'Building a Classification Model Using Affinity Propagation'. Together they form a unique fingerprint.

Cite this