数据挖掘与决策支持系统课程论文.doc.doc
文本预览下载声明
?
?
?
数据挖掘与决策支持系统课程论文
--中原工学院信息商务学院
?
?
?
?
?
论文题目:关联规则挖掘算法
作者姓名:沈炜
作者学号:200880434217
专业名称:信息管理与信息系统
完成时间:2010年12月13日
摘要:云不同的关联规则表达数据集的不同规律性,并且它们通常预测不同的事情。根据韩家炜等观点,关联规则定义为:
假设I是项的集合。给定一个交易数据库,其中每个事务(Transaction)t是I的非空子集,即,每一个交易都与一个唯一的标识符TID(Transaction ID)对应。关联规则在D中的支持度(support)是D中事务同时包含X、Y的百分比,即概率;置信度(confidence)是包含X的事务中同时又包含Y的百分比,即条件概率。关联规则是有趣的,如果满足最小支持度阈值和最小置信度阈值。这些阈值是根据挖掘需要人为设定
关键字:关联规则 频级Apriori算法
Abstract:The expression of different data sets associated with the different rules of regularity, and they usually predict different things.?According to Han Wei and other point of view, association rule is defined as:?Suppose I is a collection of items.?Given a transaction database, where each transaction (Transaction) t is a nonempty set I, that is, each transaction with a unique identifier TID (Transaction ID) counterparts.?Association rules in D, the degree of support (support) is the D in the transaction also includes X, Y percentage of the probability; confidence (confidence) that contains the X, Y transaction also includes the percentage, the conditional probability.?Association rule is interesting, if the minimum support threshold and minimum confidence threshold.?These thresholds are based on need for artificial excavation.
Key Word:Association rules, Frequency level, Apriori algorithm
目录
绪 论-------------------------------------3
关联规则的挖掘过程-------------------------------------3
2.1:第一阶段------------------------------------------------3
2.2:第二阶段------------------------------------------------3
2.3:轻松共享数据--------------------------------------------3
关联规则的分类-------------------------------------------3
3.1:第一阶段:-----------------------------------------------------------------------3
3.2:第一阶段:-----------------------------------------------------------------------3
3.3:第一阶段:-----------------------------------------------------------------------3
关联规则挖掘的相关算法----------------------------------------------4
4. 1: Apriori性质:-----------------
显示全部