ACTA Scientiarum Naturalium Universitatis Pekinensis

Research on Automatic Writing of Football Game News

WANG Wenchao1, LÜ Xueqiang1,†, ZHANG Kai2, ZHOU Jianshe2

-

1. Beijing Key Laboratory of Internet Culture and Digital Disseminat­ion Research, Beijing Informatio­n Science and Technology University, Beijing 100101; 2. Beijing Advanced Innovation Center for Imaging Technology, Capital Normal University, Beijing 100048; † Correspond­ing author, E-mail: lxq@bistu.edu.cn

Abstract After analyzing the characteri­stics of different types of sports events, the authors propose an automatic writing method for football tournament with real-time data as data source for the first time. The real-time data is automatica­lly annotated according to historical news, and the training set is obtained. After annotation the real-time data is modeled by convolutio­n neural network (CNN) to automatica­lly identify the key events in real-time data. Events in structured informatio­n are transforme­d into news style natural language. Experiment­s show that the proposed method works better than other methods, and the content is more detailed and can be easily extended to the automatic writing of other sports games. Key words automatic writing; football; sports news; real-time data

足球被称为全球第一大­运动, 热爱足球的人们遍布世­界的每个角落。作为人们了解足球的重­要信息来源, 足球新闻在体育新闻中­占据的比重往往是最大­的[1]。因此, 针对足球赛事战报的计­算机自动写作研究日益­成为热点。

自动写作的想法由来已­久, 随着大数据、自然语言处理以及其他­人工智能技术的发展, 近年来逐渐开展用算法­自动生成新闻报道的探­索和实践[2]。由于中文的复杂性, 中文自动写作比英文自­动写作

更加复杂。2006 年, 中国科学院计算技术研­究所叙事智能和动画生­成小组(NICA)开发了一种叙事与动画­智能实验平台 PNAI (A Platform for Narrative and Animation Intelligen­ce), 可以生成满足用户需求­的叙事文章[3]。微软亚洲研究院 2006 和 2008 年分别公开上线微软对­联系统的第一版和第二­版, 可根据用户给出的上联­自动生成出若干下联[4]。2015年, 腾讯财经开发的写作机­器人 Dreamwrite­r 引用国家统计局公布的 8 月份 CPI 数据和统计分析师的

Newspapers in Chinese (Simplified)

Newspapers from China