词汇链文本表示模型计算方法综述

曲云鹏, 王文玲

知识管理论坛 ›› 2016, Vol. 1 ›› Issue (2) : 136-144.

PDF(746 KB)
PDF(746 KB)
知识管理论坛 ›› 2016, Vol. 1 ›› Issue (2) : 136-144. DOI: 10.13266/j.issn.2095-5472.2016.018

词汇链文本表示模型计算方法综述

  • 曲云鹏, 王文玲
作者信息 +

An Overview on the Computing Method of the Lexical Chain Text Representation Model

  • Qu Yunpeng, Wang Wenling
Author information +
文章历史 +

摘要

[目的/意义] 词汇链文本表示方法是一种通过词汇链对语篇中的词汇衔接关系进行建模的文本表示方法,该方法能够体现语篇中丰富的语义信息,在自动摘要、文本切分等领域得到广泛应用。[方法/过程] 对词汇链相关研究论文进行收集和整理,对词汇链的构建方式和消歧方法进行了归纳。词汇衔接关系的计算方法包括基于语义关联的计算方法、基于统计信息的计算方法和基于图的计算方法。词汇链构建过程中的语义消歧是很重要的过程,直接影响词汇链的构建结果和效率。[结果/结论] 词汇链文本表示方法结构简单、应用范围广泛。词汇链文本表示模型还存在着一些问题,如使用词典构建存在很多局限性,没有完整考虑上下文的信息等。未来词汇链模型可能会向着融合语义关系方法和统计算法、使用分布式语义加强对上下文分析等方向发展。

Abstract

[Purpose/significance] Text representation is an important step in intelligence processing. An excellent text representation model can reflect the document content precisely and sufficiently. Besides, it can improve the processing effect. It can be broadly applied in the fields of automatic abstracting and text segmentation. [Method/process] In this article, we collected the related documents and analyzed them. The construction methods and disambiguation in the lexical chain computing were classified and concluded. The computing method of the lexical chain relation included the computing method based on semantic association, the computing method based on statistical information and the computing method based on charts. The semantic disambiguation was important in the construction of the lexical chain, which directly affected the results and efficiency of the lexical chain construction. [Result/conclusion] The lexical chain text representation can be easily constructed and broadly applied. There are still some problems in the text representation model of the lexical chain. For example, there are many limitations to construct it by dictionaries, which does not take the context into consideration. The lexical chain model will possibly develop towards the fusion semantic relation method, the statistical algorithm and the context analysis of distributed semantics in the future.

关键词

词汇链 / 词汇衔接 / 文本表示 / 自然语言处理

Key words

lexical chains / lexical cohesion / text representation / natural language processing

引用本文

导出引用
曲云鹏, 王文玲. 词汇链文本表示模型计算方法综述[J]. 知识管理论坛. 2016, 1(2): 136-144 https://doi.org/10.13266/j.issn.2095-5472.2016.018
Qu Yunpeng, Wang Wenling. An Overview on the Computing Method of the Lexical Chain Text Representation Model[J]. Knowledge Management Forum. 2016, 1(2): 136-144 https://doi.org/10.13266/j.issn.2095-5472.2016.018

PDF(746 KB)

Accesses

Citation

Detail

段落导航
相关文章

/