Home // DATA ANALYTICS 2015, The Fourth International Conference on Data Analytics // View article


Combining Machine Learning with Shortest Path Methods

Authors:
Armand Prieditis
Chris Lee

Keywords: Social networks; modeling; disovery; visualization; clustering; influence analysis; machine learning

Abstract:
This paper describes a method to model, discover, and visualize communities in social networks. It makes use of a novel method based on the “Six Degrees of Kevin Bacon” principle: find the shortest path between entities in a social graph and then discover communities based on clustering with those shortest-path distances. We have applied this idea to find Hollywood’s power clusters based on IMDB (Internet Movie Database), which links actors to movies. Using this method, we found roughly three clusters of Hollywood elite actors, the largest of which contained many of Hollywood’s best-known actors. For living actors, we found Colin Firth (who played Pride and Prejudice’s Mr. Darcy), Javier Bardem (who played a psychopathic killer in No Country for Old Men), and Joaquin Phoenix (who played Johnny Cash and a Roman Emperor in Gladiator) to be some of the most well-connected actors in Hollywood. This suggests that analyzing a social network using our method can lead to some surprising results.

Pages: 112 to 120

Copyright: Copyright (c) IARIA, 2015

Publication date: July 19, 2015

Published in: conference

ISSN: 2308-4464

ISBN: 978-1-61208-423-7

Location: Nice, France

Dates: from July 19, 2015 to July 24, 2015