CS224w HomeWork 2

本文旨在记录CS224w Machine Learning With Graphs 2019完成作业中遇到的问题和作业的结果。

我的github仓库 LINK。

本部分共包含：

Node Classication: correlation with Collective Classification and message passing
Node Embeddings with TransE: correlation with Graph Representation Learning

Part 1 Node Classification

Node Classication主要是使用Collective Classification的方法，其主要流程和组成部分如下图所示：

其中approximate inference的方法主要为（这三者sort by how advanced these methods are）：

Relational Classification
Iterative Classification
Belief Classification

这三种方法都是基于Markov Assumption，即：

the labely $Y_i$ of one node depends on the labels of its neighbors, which can be mathematically written as:

$$P(Y_i|i)=P(Y_i|N_i)$$

其中$N_i$为节点i的neighbors。

Relational Classification

非常简单的方法，只使用了label和网络拓扑结构的信息，没有使用每个节点的features。对于每个无label节点的预测只是简单的取一个邻居的label的平均。

使用下式的公式来预测无label的节点。

其缺点也很明显，即

The convergence not guaranteed.
Cannot use node feature information, only use the graph information.

这部分对应的作业也比较简单，即在给定的简单的图上实现这个算法。

简易的代码如下：

import snap


def create_graph():
    g = snap.TUNGraph.New()
    for i in range(1, 11):
        g.AddNode(i)
    g.AddEdge(1, 2)
    g.AddEdge(1, 3)
    g.AddEdge(2, 3)
    g.AddEdge(2, 4)
    g.AddEdge(3, 6)
    g.AddEdge(4, 7)
    g.AddEdge(4, 8)
    g.AddEdge(5, 8)
    g.AddEdge(5, 6)
    g.AddEdge(5, 9)
    g.AddEdge(6, 9)
    g.AddEdge(6, 10)
    g.AddEdge(7, 8)
    g.AddEdge(8, 9)
    g.AddEdge(9, 10)
    return g


G = create_graph()
node_dict = {}
positive = [1, 0]
negative = [0, 1]
node_dict[3] = positive
node_dict[5] = positive
node_dict[8] = negative
node_dict[10] = negative
label_id = [3, 5, 8, 10]
node_num = G.GetNodes()

for i in range(1, node_num + 1):
    if i not in label_id:
        node_dict[i] = [0.5, 0.5]

flag = 1
loop_cnt = 0
while flag is not 0:
    # 当不在变化时，停止迭代
    flag = 0
    for i in range(1, node_num + 1):
        if i not in label_id:
            neighbors = []
            cur_node = G.GetNI(i)
            degree = cur_node.GetDeg()

            for nbr in range(degree):
                neighbors.append(cur_node.GetNbrNId(nbr))
            origin = node_dict[i]
            sum_p = [0, 0]
            for mid in neighbors:
                sum_p[0] += node_dict[mid][0] / degree
                sum_p[1] += node_dict[mid][1] / degree
            node_dict[i] = sum_p
            if abs(origin[0] - sum_p[0]) > 0.001:
                # 当每次变化小于0.001时，认为仍在变化
                flag += 1
            print('id:{}\t pro:{}'.format(i, sum_p))
    loop_cnt += 1
    print('Loop {} finish!!!'.format(loop_cnt))

运行代码即可得到Q1.1的答案。

Belief Propagation

Belief Propagation is a dynamic programming approach to answering conditional probability queries in a graphical model. 本质上是模拟网络上信息的传递，根据相邻节点传递的信息来确定自己的状态。在迭代过程中，每个节点都会跟相邻节点通信，即传递信息。详细的公式和细节见slide。

答案：

Part 2 Node Embeddings with TransE

这里主要讨论一种经典的应用于Multi-relational graphs的embedding方法—-TransE。Multi-relational graphs是指：graphs with multiple types of edges. 这种图的一个典型例子就是知识图谱（Knowledge Graph）。而TransE也是知识图谱上面embedding的经典方法。

Part 3 GNN Expressiveness

NOTE：在进行训练的时候，如果节点没有features，其features vector设为全1向量或者该节点的度。

本部分主要在研究，GNN的层数与其表示能力（Expressiveness）的关系。expressiveness refers to the set of functions (usually the loss function for classication or regression tasks) a neural network is able to compute, which depends on the structural properties of a neural network architecture.

3.1 Effect of Depth on Expressiveness

3.2 Relation to Random Walk

3.3 Over-Smoothing Effect

3.4 Learning BFS with GNN

很简单的average aggregation，这里就不再写出。