文档详情

基于FTP日志的数据挖掘系统-数据预处理系统的设计与实现(毕业设计论文).docx

发布:2017-05-25约1.99万字共39页下载文档
文本预览下载声明
本科毕业设计题目:基于FTP日志的数据挖掘系统 -数据预处理系统的设计与实现摘要计算机和互联网爆炸式发展的发展让我们看到信息时代到来的大潮,我们在网络上的行为也自然成为一种可挖掘的“财富”。国内外的高校或是科技企业也越来越多的投入到网络数据日志的数据挖掘挖掘中去,为的是帮助商业机构或是社会组织提供基于精准数据决策建议。校园教学FTP日志作为一种长期存在的数据也具有非常高的数据价值。针对FTP的运行日志的数据挖掘系统是通过对校园教学FTP日志的定时提取、数据预处理、数据挖掘、结果可视化的方法实现信息的挖掘。运用eclipse软件开发相关平台;采用多线程处理的方法为我们提高处理的基础数据时的处理效率;利用分布式计算的方式组建可根据需要添加计算能力的计算机客户机、搭建Lucene搜索引擎式的全文搜素数据库为数据挖掘提供高速索引的角色、经过我们优化的关联算法进行数据挖掘后,利用JFreeChart2应用程序用图形化的方法显示数据的相关联性。该系统实现了FTP日志数据的定期自动获得、并会对提供运算能力的客户机进行运算前的能力扫描、实现了主要服务器中的文件切割、建立了Lucene全文数据搜索引擎、实现结果信息图标可视化。关键词: FTP日志;数据挖掘;数据预处理;索引库AbstractThe development of the explosive growth of computers and the Internet allows us to see the tide of the arrival of the information age, our behavior on the network has naturally become a mining wealth. Universities or technology enterprises at home and abroad, more and more into the network data log data mining, in order to help commercial organizations or social organizations to provide policy recommendations based on accurate data. Campus teaching FTP log also has a very high data value as a long-term data.For a running log of the FTP data mining system is the timing of the campus teaching FTP log extraction, data preprocessing, data mining, mining results visualization information.Eclipse software development platform; multi-threaded processing method for us to improve the processing of data processing efficiency; using distributed computing to set up the computer client can add computing power, to build the full text of the Lucene search engine-Search prime database data mining role to provide high-speed index, we optimize association algorithm data mining using JFreeChart applications with graphical display data associated.The system realizes the of regular FTP log data automatically get and will provide the computing power of the client the ability to scan before the operation, the main file server cutting Lucene full-text search engine result information icon visualization.Keywords:FTP log; data mining; data preproce
显示全部
相似文档