文档详情

Internet电的子邮件过滤器的设计.doc

发布:2017-12-27约2.31万字共46页下载文档
文本预览下载声明
Internet电子邮件过滤器的设计毕业论文 摘要 随着网络技术的不断发展和应用的,正以前所未有的速度和范围改变着我们的生活和工作。随之诞生的垃圾邮件也成为互联网领域的一重大难题。垃圾邮件相关知识的研究了垃圾邮件的发展状况以及危害,随后研究了电子邮件的工作原理和相关邮件协议关键词:电子邮件邮件过滤电子邮件过滤系统朴素贝叶斯算法 ABSTRACT With the continuous development of network technology and application technology unceasing promotion, in all aspects of daily life, We cannot leave the network information technology, information technology is changing at an unprecedented speed and scope of our life and work. But with the birth of spam has become a major problem in the field of the Internet. So the research and design efficient spam filtering system has important research significance. First of all, this paper make a brief overview of the background and research significance of spam , and account for domestic and foreign research present situation and the latest filtering technology. Then study the knowledge about spam, expound the developing situation of spam and harm, then the paper studied the E-mail the working principle and the related agreement. This article is based on the content of spam filters designed, through the analysis of the advantages and disadvantages of some mail filtering technology, finally choose the design on the theory of naive Bayes algorithm spam filters. Common in text categorization using Bayesian algorithm, it has extensive applicability, and spam filtering is essentially a text classification problem, so the article choose Bayesian algorithm design mail filters. Secondly, through the comparison, choosing more reasonable effective E-mail pretreatment techniques (including mail content extraction, email decoding, Chinese word segmentation, key words extraction, feature library, etc.), and thus designing better spam filters. Finally, using the Java language to realize the spam filtering system, and carring out the actual application layer surface test, the result of the experiment has reliability and practicability, the filtering spam classification and based on Chinese satisfactory results have been a
显示全部
相似文档