A Comparative Study on Two Techniques of Reducing the Dimension of Text Feature Space

来源 :系统工程与电子技术 | 被引量 : 0次 | 上传用户:famzhang
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
With the development of large-scale text processing, the dimension of text feature space has become larger and larger, which has added a lot of difficulties to natural language processing. How to reduce the dimension has become a practical problem in the field. Here we present two clustering methods, i.e. concept association and concept abstract, to achieve the goal. The first refers to the keyword clustering based on the co-occurrence of keywords in the same text, and the second refers to that in the same category. Then we compare the difference between them. Our experiment results show that they are efficient to reduce the dimension of text feature space.
其他文献
Presents analytic criteria for the local activity theory in two-port cellular neural network (CNN) cells with four local state variables, and gives the applicat
In this paper, we present a cluster-based algorithm for time series outlier mining.We use discrete Fourier transformation (DFT) to transform time series from ti
A quasi three dimensions molecular dynamic method was used to simulate the effect of hydrogen on dislocation emission and crack propagation in nickel. In situ o
The solitary wave solutions for the Klein-Gordon-Schrodinger Equations were obtained by using the homogeaeous balance principle. The form of the solutions is mo
Based on pair potential, the Bragg Williams (B-W) model is modified to take into account the effect of the lattice parameter on theoretical order-disorder trans
A new process of low-temperature methanol synthesis from CO/CO2/H2 based on dual-catalysis has been developed. Some alcohols, especially 2-alcohol, were found t
给出了无向双环网络 ( UDLN)的直径的一个新上界 .并由此构造出了两类新的紧优双环网无限族 ,改进了已有的结果 A new upper bound on the diameter of undirected bicyclic