Skip to main navigation Skip to search Skip to main content

Mixing Consistent Deep Clustering

  • Daniel Lutscher*
  • , Ali el Hassouni
  • , Maarten Stol
  • , Mark Hoogendoorn
  • *Corresponding author for this work

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

208 Downloads (Pure)

Abstract

Finding well-defined clusters in data represents a fundamental challenge for many data-driven applications, and largely depends on good data representation. Drawing on literature regarding representation learning, studies suggest that one key characteristic of good latent representations is the ability to produce semantically mixed outputs when decoding linear interpolations of two latent representations. We propose the Mixing Consistent Deep Clustering (MCDC) method which encourages interpolations to appear realistic while adding the constraint that interpolations of two data points must look like one of the two inputs. By applying this training method to various clustering (non-)specific autoencoder models we found that using the proposed training method systematically changed the structure of learned representations of a model and it improved clustering performance for the tested ACAI, IDEC, and VAE models on the MNIST, SVHN, and CIFAR-10 datasets. These outcomes have practical implications for numerous real-world clustering tasks, as it shows that the proposed method can be added to existing autoencoders to further improve clustering performance.

Original languageEnglish
Title of host publicationMachine Learning, Optimization, and Data Science
Subtitle of host publication[Proceedings] 7th International Conference, LOD 2021, Grasmere, UK, October 4–8, 2021, Revised Selected Papers, Part I
EditorsGiuseppe Nicosia, Varun Ojha, Emanuele La Malfa, Gabriele La Malfa, Giorgio Jansen, Panos M. Pardalos, Giovanni Giuffrida, Renato Umeton
PublisherSpringer Science and Business Media Deutschland GmbH
Pages124-137
Number of pages14
Volume1
ISBN (Electronic)9783030954673
ISBN (Print)9783030954666
DOIs
Publication statusPublished - 2022
Event7th International Conference on Machine Learning, Optimization, and Data Science, LOD 2021 - Virtual, Online
Duration: 4 Oct 20218 Oct 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13163 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference7th International Conference on Machine Learning, Optimization, and Data Science, LOD 2021
CityVirtual, Online
Period4/10/218/10/21

Bibliographical note

Publisher Copyright:
© 2022, Springer Nature Switzerland AG.

Keywords

  • Adversarial training
  • Autoencoder
  • Clustering

Fingerprint

Dive into the research topics of 'Mixing Consistent Deep Clustering'. Together they form a unique fingerprint.

Cite this