Skip to main content
An Advantage Actor-Critic Algorithm with Confidence Exploration for Open Information Extraction

An Advantage Actor-Critic Algorithm with Confidence Exploration for Open Information Extraction

Guiliang Liu, Xu Li, Miningming Sun, P. Li

00
2020-01-01
Computer ScienceJournalArticleConference

Abstract

Open Information Extraction (OIE) is a task of generating the structured representations of information from natural language sentences. Recently years, many works have trained an End-to-End OIE extractor based on Sequence-to-Sequence (Seq2Seq) model and applied Reinforce Algorithm to update the model. However, the model performance often suffers from a large training variance and limited exploration. This paper introduces a reinforcement learning framework that enables an Advantage Actor-Critic (AAC) algorithm to update the Seq2Seq model with samples from a novel Confidence Exploration (CE). The AAC algorithm reduces the training variance with a fine-grained evaluation of each individual word. The confidence exploration provides effective training samples by exploring the word at key positions. Empirical evaluations demonstrate the leading performance of our Advantage Actor-Critic algorithm and Confidence Exploration over other comparison methods.