论文部分内容阅读
0引言随着互联网技术的不断发展,互联网在文化信息交互中已经成为了主要的传播媒体。在这些信息中存在着不少恶意、反动等不健康的内容。本文提出了一个基于多线程架构的网页预扫描系统,系统对监听到的网页采用跨层直接扫描的方式进行粗扫描,初步判定在网页中是否含有非法的内容,然后再决定是否对该网页进行详细的内容分析确定其安全性。1关键技术1.1数据包捕获技术常见的网络数据采集的方案是基于libpcap[1]库的。采用
0 Introduction With the continuous development of Internet technology, the Internet has become the main media in the interaction of cultural information. There is a lot of malicious and reactionary unhealthy content in these messages. This paper presents a web-based pre-scanning system based on multi-threaded architecture. The system uses a cross-layer direct scan of the web pages to perform a coarse scan, initially determines whether the web page contains illegal content, and then decides whether to perform the web page Detailed content analysis to determine its safety. 1 Key Technologies 1.1 Packet Capture Technology Common network data acquisition program is based on the libpcap [1] library. use