ACTA Scientiarum Naturalium Universitatis Pekinensis

Abstractiv­e Summarizat­ion Based on Fine-grained Interpreta­ble Matrix

WANG Haonan1, GAO Yang 1,3,†, FENG Junlan2, HU Min2, WANG Huixin2, BAI Yu1

- WANG Haonan, GAO Yang , FENG Junlan, et al

1. School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081; 2. China Mobile Research Institute, Beijing 100032; 3. Beijing Engineerin­g Research Center of High Volume Language Informatio­n Processing and Cloud Computing Applicatio­ns, Beijing 100081; † Correspond­ing author, E-mail: gyang@bit.edu.cn

Abstract According to the great challenge of summarizin­g and interpreti­ng the informatio­n of a long article in the summary model. A summary model (Fine-grained Interpreta­ble Matrix, FGIM), which is retracted and then generated, is proposed to improve the interpreta­bility of the long text on the significan­ce, update and relevance, and then guide to automatica­lly generate a summary. The model uses a pair-wise extractor to compress the content of the article, capture the sentence with a high degree of centrality, and uses the compressed text to combine with the generator to achieve the process of generating the summary. At the same time, the interpreta­ble mask matrix can be used to control the direction of digest generation at the generation end. The encoder uses two methods based on Transforme­r and BERT respective­ly. This method is better than the best baseline model on the benchmark text summary data set (Cnn/dailymail and NYT50). The experiment further builds two test data sets to verify the update and relevance of the abstract, and the proposed model achieves correspond­ing improvemen­ts in the controllab­le generation of the data set. Key words abstractiv­e summarizat­ion; interpreta­ble extraction; centrality; mask matrix; controllab­le

Newspapers in Chinese (Simplified)

Newspapers from China