Download Joining Similarity Measures Using Quasi-Arithmetic Means Etienne Cuvelier , Marie-Aude Aufaure

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nonlinear dimensionality reduction wikipedia , lookup

Cluster analysis wikipedia , lookup

Transcript
Joining Similarity Measures Using Quasi-Arithmetic
Means
Etienne Cuvelier1,∗ , Marie-Aude Aufaure2
1. ICHEC, Brussels Management School, Bruxelles, Belgium
2. Ecole Centrale Paris, Paris, France
? Contact author: [email protected]
Keywords: Similarity Measures, Dissimilarity Measures, Combining Measures, Quasi-Arithmetic
Means, Archimedean Generator.
A lot of data analysis methods are based on similarity or dissimilarity measures but, most of the
times, these measures are defined for one type of data (real multidimensional data, interval data,
functional data, nodes in graphs,...). This fact implies that all the techniques of knowledge extraction based on such measures can be performed only on the data type for which they are defined.
But the description and the modelling of real situations require the joint use of several kind of data.
Symbolic Data Analysis deals also with this situation describing concepts using real data, interval
data, histogram data, set data and/or probability distributions. We propose a new technique of
combination of different measures in one single result. The method is based on Quasi-Arithmetics
Means using Archimedean Generators. Quasi-Arithmetic Means with this kind of generators have
several advantages to compute a resulting measure starting from several measures (computed on
different types of data describing the same concept or individual): they allow to choose to emphasize the similarity or the dissimilarity between objects, they have flexible parameters, its possible
to mix similarities and dissimilarities to compute a resulting similarity (or dissimilarity).
The resulting measure (similarity or dissimilarity) can be used in any existing algorithm based on
such measures: clustering, supervised classification...
We will give some examples of use of this method on attributed networks and on symbolic data.
References
Stanislawa Ostasiewicz and Walenty Ostasiewicz (2000). Means and their applications. Annals of
Operations Research , 97, :337 – 355.
Arlei Silva, Wagner Meira Jr., and Mohammed J. Zaki (2012). Mining attribute-structure correlated
patterns in large attributed graphs. PVLDB, 5(5), 466 – 477.
Yang Zhou, Hong Cheng, and Jeffrey Xu Yu (2009). Graph clustering based on structural/attribute
similarities. VLDB09 , Lyon, France.
J. Kim and L. Billard (2013). Dissimilarity measures for histogram-valued observations. Communications in Statistics-Theory and Method , 42, 283 – 303.