home..

Faithful Reasoning Using Large Language Models (2022 Deepmind)

0. Abstract

QA task, flaws of language models

1. Introduction

flaws of LM in QA task

faithful reasoning

model description

2. Defining a Valid Reasoning Trace

fig1

3. Components of a Faithful Reasoning Model

3.1.1 Selection

selection LM: training an LM to refer to statements in the context by their sentence labels fig3

3.1.2 Inference

assumption: the Inference model produces logically correct inferences

3.2 Halting: when to stop reasoning?

如果超过一定的iteration仍然没有得到最终结果,就判定为’unknown’。最终的实验结果把那些判定为unknown的都去掉了。

inference的方向比较随机,没有定向,直觉上会产生很多的unknown question

3.3 Search: Finding the Best Trace

4. Experimental Setup and Evaluation of Components

© 2023 huyi   •  Powered by Soopr   •  Theme  Moonwalk