|
Séminaire DIGITEO, séance du 20/10 |
|
|
|
|
Séminaire DIGITEO, séance du 20/10 20 octobre 2011
Degeneracy Based Community Evaluation
Michalis Vazirgiannis, chaire Digiteo
Supélec, amphi F3.05, 14h30 |
|
Abstract -------------- Graphs constitute the dominant data structure in the WWW, and appear essentially in all forms of information. Examples are the web graph, social networks, protein interaction networks, terms dependency graphs etc. Important knowledge is hidden in the macroscopic topology and features of these graphs. The dominant knowledge artifacts extracted from graphs are either individual node based scores (such as authority/hubness/centrality) or unsupervised grouping of nodes in to clusters or global statistics computations (such as degree distributions etc). What is missing is metrics, structures and measures that represent the deeper knowledge hidden in the macroscopic structure of potentially directed/weighted graphs. We propose new metrics and evaluation schemes for the macroscopic structure of graphs capitalizing on the degeneracy concept, i.e. k-cores, towards identifying the most cohesive components of graphs – finding thus the most collaborative constituents. These metrics compute the most robust subgraphs representing dense and mutual connectivity in the case of directed and weighted graphs as well. Connectivity can be then interpreted in several ways: i.e. as collaboration in citation or social networking graphs, collective affinity in protein interaction graphs etc. We also use the best k-cores of graphs as seeds for optimizing the speed of spectral graph clustering. We conducted several experiments on real (DBLP, Wikipedia) and synthetic data sets. The results are interesting especially in the case in the DBLP citation graph.
We further extend k-core to deal with directed graphs, introducing the D-core concept, as means of evaluating a digraph’s collaborative nature. Based on the D-core we devise a wealth of novel metrics used to evaluate the graphs collaboration features. We applied the above approaches on large real world graphs - Wikipedia and DBLP - and report interesting results.
A relevant prototype demo is available at: http://www.lix.polytechnique.fr/~giatsidis/cores/Pour en savoir plus: http://www.digiteo.fr/Digiteo_NewsSeminaire_Michalis |
|
|
|
|
News |
|
|
Disparition de Yannis Manoussakis06 juin 2021Nous venons d'apprendre la disparition de Yannis Manoussakis, Professeur à l'Université Paris-Saclay, décédé samedi 5 juin.
Il était le responsable de l'équipe GALaC et avait été de nombreuses années directeur du LRI, nous perdons un ami et un coll Semaine du cerveau : Cerveau connecté16 mars 2021Laurence Devillers, chercheuse en Intelligence Artificielle et Ethique de l'IA et Michel Beaudouin-Lafon, chercheur en Interaction Humain-Machine exposent leurs points de vue dans la série de Podcats du CNRS à l'occasion de la semaine du Cerveau. Wizard project01 avril 2021Innovation Area: Public Safety, IoT, Mobility
|
|
|
|
|