Journal of Software, Vol 4, No 1 (2009), 3-10, Feb 2009
doi:10.4304/jsw.4.1.3-10

A Context Analytical Method Basing on Text Structure

Yi Huang, Jianbin Tan, Lei Zhang

Abstract


In this paper the research techniques of complex network are introduced into the complement of missing data in text and a new method of text mining is put forward basing on the text structure of large-scale texts. First the GRE word net is constructed by using lots of relative articles specially for experiment, then the static characters of this network are analyzed and the context relationships of words are obtained in it according to the community discovery algorithm of complex network, next an complement algorithm is designed to judge whether it is the right complement words by following relationships among these words. In the experiment, we take the examination questions of GRE as test set and use this method to do the sentence completions in verbal sections, the result demonstrates the availability of this text analyzing method which focuses on topology information of network. It can not only apply to the imputation of missing data, but also the complement of full sentence after skeleton’s forming in machine dialogs.



Keywords


text mining, word net, community discovery, complement, missing data

References



Full Text: PDF


Journal of Software (JSW, ISSN 1796-217X)

Copyright @ 2006-2011 by ACADEMY PUBLISHER – All rights reserved.