SciELO - Scientific Electronic Library Online

 
vol.31 número3Development of Phyllantus acumminatus hairy roots culture system for the production of compounds with biological activityEffect of insemination catheter type and calving order on swine production parameters índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Revista Tecnología en Marcha

versão On-line ISSN 0379-3982versão impressa ISSN 0379-3982

Resumo

CALVO-VALVERDE, Luis Alexander  e  VALLEJOS-PENA, Alonso. Semisupervised clustering algorithm combining SUBCLU and constrained clustering for detecting groups in high dimensional datasets. Tecnología en Marcha [online]. 2018, vol.31, n.3, pp.74-85. ISSN 0379-3982.  http://dx.doi.org/10.18845/tm.v31i3.3904.

High dimensional data poses a challenge to traditional clustering algorithms, where the similarity measures are not meaningful, affecting the quality of the groups. As a result, subspace clustering algorithms have been proposed as an alternative, aiming to find all groups in all spaces of the dataset (1).

By detecting groups on lower dimensional spaces, each group may belong to different subspaces of the original dataset (2). Therefore, attributes the user considers of interest may be excluded in some or all groups, decreasing the value of the result for the data analysts.

In this project, a new algorithm is proposed, that combines SUBCLU (3) and the clustering algorithms by constraint (4), which allows the users to identify variables as attributes of interest based on prior knowledge of domain, targeting direct group detection toward spaces that include user’s attributes of interest, and thereafter, generating more meaningful groups.

Palavras-chave : Data mining; subspaces; SUBCLU; clustering; clustering by constraint.

        · resumo em Espanhol     · texto em Espanhol     · Espanhol ( pdf )