文档详情

基于数据流的相关挖掘方法研究-计算机技术专业论文.docx

发布:2019-03-29约5.69万字共54页下载文档
文本预览下载声明
Subject : Related mining methods research based on data streams Specialty : Computer technology Name : Zhou Fenfen (Signature) Instructor : Yang Junrui (Signature) ABSTRACT The advances in technology make it easier to generate and collect data, how to get useful information from large amounts of data to guide some production process and behavior of human beings is becoming a significant research in data analysis. As an effective means of data analysis, data mining can discover interesting knowledge from large amounts of data, while the emergence of data stream has put forward higher requirements in the related processing technology. Different from the traditional data, data stream has a series of new features such as liquidity, unlimited and high speed, which decides that the mining algorithm must be high speed increment, and also can use the limited memory to get mining results effectively within a certain error range. This paper mainly did some research and analysis based on a wide range of literature review to several basic problems of data stream analysis and processing, and the main contents focus on the following points. Firstly, based on data stream environment, an effective algorithm DSM-Miner for mining maximal frequent patterns was proposed. It uses Transactions Sliding Window to specify the number of transactions in each treatment process, and distinguishes and treats the old and new transactions by the way of decaying, meanwhile it proposes a sliding window maximal frequent pattern tree SWM-Tree based on the improved classical FP-Tree structure, incrementally maintains and stores patterns by updating SWM-Tree dynamically. In the process of mining maximal frequent patterns, the algorithm uses the corresponding node of SWM-Tree as the root of an enumeration tree and uses this enumeration tree as a search space. In addition, the algorithm also adopts appropriate pruning operations, calculation method of bit items group and depth-firs
显示全部
相似文档