文档详情

大数据和数据挖掘方法.doc

发布:2017-03-14约1.76万字共45页下载文档
文本预览下载声明
山 东 科 技 大 学 本科毕业设计(论文) 题 目 学 院 名 称 专业班级 学生姓名 学 号 指 导 教 师 二0一年月 Abstract With the development of computer technology, the rapid development of Internet and new media, peoples life has entered the information era. Our everyday life is to have a large amount of data, so we get the growing data speed and scale, a large amount of data have been stored in the form of mass data storage medium.The storage, application and mining massive data has become an important proposition that people study. Data mining is stored in the database from the data warehouse, or other information in the library a lot of incomplete, noise fuzzy random data in which the extraction of implicit previously unknown, but potentially useful information and knowledge process. Manifestation: the rules, concepts, rules and patterns. Data mining is a crossed subject, database technology, artificial intelligence, statistics and other fields together to from a new point of view, from a more deep excavation in data within a novel, effective, with potentially useful and ultimately understandable patterns. In data mining, data is divided into training data, test data, and the application of data. The key to data mining is fact finding in the training data, the test data as test and modify the theory basis, the application of knowledge to the data. This paper firstly illustrates the concept and the rise and development of large data, and then introduce various mainstream data mining method. Keywords: large data data mining method of data analysis 目录 大数据及数据挖掘方法 1 摘要 1 Abstract 2 目录 3 1 大数据 1 1.1“大数据”的提出 1 1.2大数据概念、特征及价值 2 1.2.1大数据的概念 2 1.2.2大数据的特征 3 1.2.3大数据的价值 4 1.3大数据形成的必然性 4 1.4大数据发展现状 6 (一)政府积极介入推动 6 (二)资本市场也对大数据钟爱有加 7 (三)人才需求巨大 7 (四)国内情况 7 2大数据的处理 8 3数据挖掘方法 10 3.1神经网络 10 3.1.1人工神经网路基本介绍 10 3.1.2设计神经网路结构 12 3.1.3概率式学习 13 3.1.4神经网路方法优缺点 13 3.2遗传算法 14 3.2.1遗传算法特点 14 3.2.2遗传基本算法 16 3.2.3遗传
显示全部
相似文档