Publication:
Visualizing Profiles of Large Datasets of Weighted and Mixed Data

dc.affiliation.dptoUC3M. Departamento de Estadísticaes
dc.contributor.authorGrané Chávez, Aurea
dc.contributor.authorSow-Barry, Alpha A.
dc.contributor.funderMinisterio de Economía y Competitividad (España)es
dc.date.accessioned2021-06-30T09:36:37Z
dc.date.available2021-06-30T09:36:37Z
dc.date.issued2021-04-02
dc.description.abstractThis work provides a procedure with which to construct and visualize profiles, i.e., groups of individuals with similar characteristics, for weighted and mixed data by combining two classical multivariate techniques, multidimensional scaling (MDS) and the k-prototypes clustering algorithm. The well-known drawback of classical MDS in large datasets is circumvented by selecting a small random sample of the dataset, whose individuals are clustered by means of an adapted version of the k-prototypes algorithm and mapped via classical MDS. Gower’s interpolation formula is used to project remaining individuals onto the previous configuration. In all the process, Gower’s distance is used to measure the proximity between individuals. The methodology is illustrated on a real dataset, obtained from the Survey of Health, Ageing and Retirement in Europe (SHARE), which was carried out in 19 countries and represents over 124 million aged individuals in Europe. The performance of the method was evaluated through a simulation study, whose results point out that the new proposal solves the high computational cost of the classical MDS with low error.en
dc.description.sponsorshipThis research was funded by the Spanish Ministry of Economy and Competitiveness, grant number MTM2014-56535-R; and the V Regional Plan for Scientific Research and Technological Innovation 2016-2020 of the Community of Madrid, an agreement with Universidad Carlos III de Madrid in the action of "Excellence for University Professors."en
dc.format.extent20
dc.identifier.bibliographicCitationGrané, A. & Sow-Barry, A. A. (2021). Visualizing Profiles of Large Datasets of Weighted and Mixed Data. Mathematics, 9(8), 891.en
dc.identifier.doihttps://doi.org/10.3390/math9080891
dc.identifier.issn2227-7390
dc.identifier.publicationfirstpage891
dc.identifier.publicationissue8
dc.identifier.publicationtitleMathematicsen
dc.identifier.publicationvolume9
dc.identifier.urihttps://hdl.handle.net/10016/32960
dc.identifier.uxxiAR/0000027865
dc.language.isoeng
dc.publisherMDPI
dc.relation.projectIDGobierno de España. MTM2014-56535-Res
dc.rights© 2021 by the authors.en
dc.rightsAtribución 3.0 España*
dc.rights.accessRightsopen accessen
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es/*
dc.subject.ecienciaEstadísticaes
dc.subject.otherClusteringen
dc.subject.otherGower's interpolation formulaen
dc.subject.otherGower's metricen
dc.subject.otherMixed dataen
dc.subject.otherMultidimensional scalingen
dc.titleVisualizing Profiles of Large Datasets of Weighted and Mixed Dataen
dc.typeresearch article*
dc.type.hasVersionVoR*
dspace.entity.typePublication
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Visualizing_mathematics_2021.pdf
Size:
1 MB
Format:
Adobe Portable Document Format