论文部分内容阅读
随着信息技术的快速发展,网络信息席卷全球,产生了大量的文本、图像、多媒体等各种形式的电子信息资源。为了能在海量的文本信息中找到自己的所需,人们迫切需要一个高效的检索工具。怎样高效的存储和查询文本这种非结构数据,就是一个颇值得研究的问题.这其中以全文检索技术成为国内外学者研究的热点。国外的全文检索软件虽然较早地得到应用,但对中国用户有很多不适用的地方。中文全文检索技术在原理上同西文全文检索是一致的,但汉语本身的特点使中文系统的实现比西文系统更为复杂。本文的重点放在了全文检索技术的研究上,对如何利用新技术、改善检索系统的结构、提高检索系统的性能和效率、加快检速度、不断适应网络信息发展等方面做了重点研究。
With the rapid development of information technology, the network information has swept the world, resulting in a great deal of text, image, multimedia and other forms of electronic information resources. In order to find their own needs in the massive text messages, people urgently need an efficient retrieval tool. How to efficiently store and query texts, such as non-structured data, is a quite worthwhile research issue, among which the full-text retrieval technology has become a hot research topic both at home and abroad. Foreign full-text search software has been applied earlier, but there are many Chinese users do not apply to the place. The Chinese full-text search technology is identical in principle with the full-text search of the Western languages, but the Chinese language itself makes the Chinese system more complicated than the Western system. This thesis focuses on the research of full-text retrieval technology, and focuses on how to make use of new technology, improve the structure of the retrieval system, improve the performance and efficiency of the retrieval system, speed up the inspection and adapt to the development of network information.