home..

Query2Box

huyi / July 2022

Query2box_reading

Query2box_reading

abstract

1.introduction

KG captures different types of relationships between entities.

First-order logical queries can be represented as Directed Acyclic Graphs (DAGs)(有向无环图)

-->drawbacks: 1)复杂度expotential in the query size 2)不能回答有missing relation的query
-->若要解决(2)需要补全KG但那会加剧问题(1)

another alternative: 把query embed 到一个低维向量空间，使得可以回答query的entities离query比较近。

-->解决了missing relations的问题
-->复杂度降低：simply identifying entities nearest to the embedding of the query in the vector space

PROBLEM: prior work embeds a query into a single point in the vector space.

-->answering a logical query requires modeling a set of active entities-->how to effectively model a set with a single point is unclear
-->unnatural to define logical operators (e.g., set intersection) of two points in the vector space
-->can only handle conjunctive queries-->a subset of first-order logic that only involves conjunction (∧) and existential quantifier (∃), but not disjunction (∨). It remains an open question how to handle disjunction effectively in the vector space

QUERY2BOX: an embedding-based framework for reasoning over KGs capable of handling arbitrary Existential Positive First-order (EPFO) logical queries (i.e., queries that include any set of ∧, ∨, and ∃) in a scalable manner.

modeling a set of entities:use a closed region rather than a single point in the vector space –>use a box (axis-aligned hyper-rectangle) to represent a query

Boxes naturally model sets of entities they enclose
Logical operators (e.g., set intersection) can naturally be defined over boxes similarly as in Venn diagrams
Executing logical operators over boxes results in new boxes, which means that the operations are closed; thus, logical reasoning can be efficiently performed in QUERY2BOX by iteratively(迭代) updating boxes according to the query computation graph

1.QUERY2BOX can naturally handle conjunctive queries

2.证明把EPFO queries embed到single points or boxes是困难的–>require embedding dimension proportional to the number of KG entities.

3.Solution: transform a given EPFO logical query into a Disjunctive Normal Form (DNF)，i.e., disjunction of conjunctive queries

Given any EPFO query, QUERY2BOX represents it as a set of individual boxes, where each box is obtained for each conjunctive query in the DNF.

We then return nearest neighbor entities to any of the boxes as the answers to the query. --> to answer any EPFO query we first answer individual conjunctive queries and then take the union of the answer entities.

approaches for multi-hop reasoning over KGs Bordes et al., 2013 Das et al., 2017等等

Crucial difference : we provide a way to tractably handle a larger subset of the first-order logic (EPFO queries vs. conjunctive queries) and that we embed queries as boxes, which provides better accuracy and generalization.

structures embedding: associate images, words, sentences, or knowledge base concepts with geometric objects such as regions

3.QUERY2BOX: LOGICAL REASONING OVER KGS IN VECTOR SPACE

要learn什么？

- embeddings of entities in the KG
- parameterized geometric logical operators over boxes 输入输出？

Input：an arbitrary EPFO query q
-->identify its computation graph(fig.B)
-->embed the query by executing a set of geometric operators over boxes
-->Output:Entities that are enclosed in the final box embedding are returned as answers to the query

模型最后能干啥？

- 准确回答训练集里的问题
- 能回答非训练集的问题和没见过的logical structure
- implicitly impute missing relations and answer queries that would be impossible to answer with traditional graph traversal methods

3.1 KNOWLEDGE GRAPHS AND CONJUNCTIVE QUERIES

把KG表示成 $\mathcal{G}=(\mathcal{V},\mathcal{R})$
$v\in\mathcal{V}$ 表示entity；$r\in\mathcal{R}$ 表示binary function(二元函数)
r : $\mathcal{V}\times\mathcal{V}\rightarrow{True,False}$ ,r表示entities是否成对(?成对的意思是direct edge?) 就是v_i v_j之间是不是有r这个edge

such binary output indicates the existence of the directed edge between a pair of entities

conjunctive queries 是用 $\land$ 和 $\exist$ 的first-order logical queries一阶逻辑？的子类。

其中$v_{a}$是一个不变的锚点, $V_{1},…,V_{k}$是一些有值的bound variables, $V_{?}$是target variable.

The goal of answering the logical query q is to find a set of entities $[q]\subseteq\mathcal{V}$ such that $\mathcal{v}\in[q]$ iff q[v] = True.
[q]: the denotation set (i.e., answer set) of query q

类似于fig.1的dependency graph,是conjunctive query的图表示。其中nodes是variable/non-variable entities,edges是q中的关系.

valid query $\rightarrow$ directed acyclic graph(有向无环图)，锚点是source nodes, query target $V_{?}$ 是唯一的sink node.

从dependency graph可以得出computation graph,其中有两种directed edges:
Projection 投影: Given a set of entities $S \in V$, and relation $r \in R$ , this operator obtains $\cup_{v \in S}A_{r}(v) $, where $A_{r}(v) \equiv {v′\in \mathcal{V} : r(v, v′) = True}$
Intersection 交集: Given a set of entity sets ${S_1, S_2, . . . , S_n}$, this operator obtains $\cap_{i=1}^{n}S_i$

computation graph 具体化了寻找sink node的操作

3.2 REASONING OVER SETS OF ENTITIES USING BOX EMBEDDINGS

上节已经定义了conjunctive queries,只要按照上面的computation graph 直接执行就可以了.接下来在vector space定义logic reasoning.

步骤: query–>decompose into sequence of logical operations–>execute the operations in the vector space–>embedding of query

two methodological advances:
box embeddings –> efficient & reason over sets of entities in the vector space
tractably handle disjunction operator ($\lor$)

Box embeddings.

$Box_\mathsf{p}\equiv{\mathsf{v}\in\mathbb{R}^{d}: Cen(\mathsf{p})-Off(\mathsf{p})\preceq v\preceq Cen(\mathsf{p})+Off(\mathsf{p})}$

相当于一个以cen(p)为中心，以off(p)为范围的一个d维矩形

其中p=(Cen(p),Off(p)), p $\in\mathbb{R}^{2d}$,

Initial boxes for source nodes.

initial box: (v,0)

Geometric projection operator.

每个$r$对应着 relation embedding r=(Cen(r),Off(r)),其中Off(r) $\succeq$ 0

input box embedding p –projection–> p+r
映射到了一个更大的超矩形中去

Geometric intersection operator.

这边有learning parameter

？intersection操作一定需要learn出来不能直接表示出来吗脚注：One possible choice here would be to directly use raw box intersection, however, we find that our richer learnable parameterization is more expressive and robust ? but why

对a set of box embeddings {p1,…,pn} ,intersection 记作 $\mathsf{p}{inter}$=(Cen($\mathsf{p}{inter}$),Off($\mathsf{p}_{inter}$))

Entity-to-box distance.

给定一个box q (dimention=2d)和一个entity vector v (dimension=d)，距离定义如下：

$dist_{box}(v;q)=dist_{outside}(v;q)+\alpha\cdot dist_{inside}(v;q)$

关键在于把$\alpha$取在0~1,这样可以降低inside的权重，使得内部的entity vectors都可以被认为离q足够近

Training objective.

除开geometric projection 和intersection operators, 还要learn entity embeddings.

用这个loss function

？为什么这边positive entity(i.e. answer to the query)只有一个而negative entities有多个

3.3 TRACTABLE HANDLING OF DISJUNCTION USING DISJUNCTIVE NORMAL FORM

定义一种新的edge(之前只有projection 和 intersection)：Union

Union: Given a set of entitiy sets ${S_1,…,S_n}$,this operator obtains $\bigcup_{i=1}^{n}S_i$

主要问题在于union对于box来说并不是一个封闭的操作，union若出现在computation graph的任意位置，则得到的结果并不能接下去进行projection / intersection / union的操作

复杂度问题–>需要把 EPFO query 转化为 DNF

Transformation to DNF.

computation graph 中，相当于把union移动到最后一步来做

$dist_{agg}(\mathsf{v;q})=Min({dist_{box}(\mathsf(v);\mathsf{q}^{(1)}),…,dist_{box}(\mathsf(v);\mathsf{q}^{(N)})})$

4 EXPERIMENTS

4.1 KNOWLEDGE GRAPHS AND QUERY GENERATION

? 为什么是通过split edges into train / test / validation 要测试 **incomplete** KG上模型的推断能力

4.2 EVALUATION PROTOCOL

4.3 Baseline and Model Variants

• Q2B (our method): The box embeddings are used to model queries, and the attention mechanism is used for the intersection operator.
• Q2B-AVG: The attention mechanism for intersection is replaced with averaging.
• Q2B-DEEPSETS: The attention mechanism for intersection is replaced with the deep sets.
• Q2B-AVG-1P: The variant of Q2B-AVG that is trained with only 1p queries (see Fig. 4); thus, logical operators are not explicitly trained.
• Q2B-SHAREDOFFSET: The box offset is shared across all queries (every query is represented by a box with the same trainable size).