Data Mining: The Textbook



Yüklə 17,13 Mb.
səhifə364/423
tarix07.01.2024
ölçüsü17,13 Mb.
#211690
1   ...   360   361   362   363   364   365   366   367   ...   423
1-Data Mining tarjima

19.2. SOCIAL NETWORKS: PRELIMINARIES AND PROPERTIES

621

correspond to comments or other documents posted by social network users. It is assumed that the social network contains n nodes and m edges. In the following, some key properties of social networks will be discussed.


19.2.1 Homophily




Homophily is a fundamental property of social networks that is used in many applications, such as node classification. The basic idea in homophily is that nodes that are connected to one another are more likely to have similar properties. For example, a person’s friend-ship links in Facebook may be drawn from previous acquaintances in school and work. Aside from common backgrounds, the friendship links may often imply common interests between the two parties. Thus, individuals who are linked may often share common beliefs, backgrounds, education, hobbies, or interests. This is best stated in terms of the old proverb:


Birds of a feather flock together

This property is leveraged in many network-centric applications.


19.2.2 Triadic Closure and Clustering Coefficient


Intuitively, triadic closure may be thought of as an inherent tendency of real-world networks to cluster. The principle of triadic closure is as follows:




If two individuals in a social network have a friend in common, then it is more likely that they are either connected or will eventually become connected in the future.

The principle of triadic closure implies an inherent correlation in the edge structure of the network. This is a natural consequence of the fact that two individuals connected to the same person are more likely to have similar backgrounds and also greater opportunities to interact with one another. The concept of triadic closure is related to homophily. Just as the similarity in backgrounds of connected individuals makes their properties similar, it also makes it more likely for them to be connected to the same set of actors. While homphily is typically exhibited in terms of content properties of node attributes, triadic closure can be viewed as the structural version of homophily. The concept of triadic closure is directly related to the clustering coefficient of the network.


The clustering coefficient can be viewed as a measure of the inherent tendency of a network to cluster. This is similar to the Hopkins statistic for multidimensional data (cf. Sect. 6.2.1.4 of Chap. 6). Let Si ⊆ N be the set of nodes connected to node i ∈ N in the undirected network G = (N, A) . Let the cardinality of Si be ni. There are ni possible edges between nodes in Si. The local clustering coefficient η(i) of node i is the2fraction of these pairs that have an edge between them.





η(i) =

|{(j, k) ∈ A : j ∈ Si, k ∈ Si}|

(19.1)







ni
















2










The Watts–Strogatz network average clustering coefficient is the average value of η(i) over all nodes in the network. It is not difficult to see that the triadic closure property increases the clustering coefficient of real-world networks.



622 CHAPTER 19. SOCIAL NETWORK ANALYSIS

19.2.3 Dynamics of Network Formation


Many real properties of networks are affected by how they are formed. Networks such as the World Wide Web and social networks are continuously growing over time with new nodes and edges being added constantly. Interestingly, networks from multiple domains share a number of common characteristics in the dynamic processes by which they grow. The manner in which new edges and nodes are added to the network has a direct impact on the eventual structure of the network and choice of effective mining techniques. Therefore, the following will discuss some common properties of real-world networks:






  1. Yüklə 17,13 Mb.

    Dostları ilə paylaş:
1   ...   360   361   362   363   364   365   366   367   ...   423




Verilənlər bazası müəlliflik hüququ ilə müdafiə olunur ©azkurs.org 2024
rəhbərliyinə müraciət

gir | qeydiyyatdan keç
    Ana səhifə


yükləyin