검색 상세

Multi-sentence Compression using Recursive Generation

초록/요약

In this paper, we present a new method that summarizes multiple sentences into one sentence. Such a task is useful in domains such as short text summarization. Despite the usefulness of this issue, many studies have not been proposed on the matter. One of the reasons is the training data scarcity: it is impractical to prepare a large data set that contains a sufficiently diverse set of combinations to ensure generalization. We aim to alleviate this difficulty by reformulating the original problem into a problem that produces sentences sequentially. That is, we construct a recursive generation model that takes two sentences in order: the input sentence of the current time step and the generalized sentence so far. Since there is no need to consider the combination of the total input sentences, less complex training data is required than the original problem. Based on the recursive generation model, we propose to summarize the sequentially-input sentences in an online, stochastic manner, where the summary is updated with probability proportional to semantic similarity. Thanks to this stochastic generation scheme, our approach is more flexible for handling situations where inputs are not similar, thus produces summaries with better semantic compression.

more

목차

I. Introduction 1
II. Related Works 3
A. Natural Language Understanding 3
B. Multi-Sentence Compression 3
III. Algorithm 6
A. Recursive Generation 6
B. Training Model 11
C. Generation Strategies 13
1. Myopic Generation 13
2. Stochastic Generation 13
IV. Experiments 15
A. Data 15
B. Settings 18
C. Results 19
1. Comparison with Baseline 19
2. Generation 22
3. Myopic vs. Stochastic 23
4. Application 25
5. Visualization 27
V. Conclusion and Future Works 28
VI. References 30

more