文档详情

《Druid 实时分析架构设计思路—Imply》.pdf

发布:2015-12-25约3万字共57页下载文档
文本预览下载声明
DRUID INTERACTIVE EXPLORATORY ANALYTICS AT SCALE FANGJIN YANG · DRUID COMMITTER OVERVIEW DEMO SEE SOME NEAT THINGS MOTIVATION WHY DRUID? ARCHITECTURE PICTURES WITH ARROWS COMMUNITY CONTRIBUTE TO DRUID THE PROBLEM ‣ Arbitrary and interactive exploration of time series data • Ad-tech, system/app metrics, network/website traffic analysis ‣ Multi-tenancy: lots of concurrent users ‣ Scalability: 10+ TB/day, ad-hoc queries on trillions of events ‣ Recency matters! Real-time analysis 2013 DEMO IN CASE THE INTERNET DIDN’T WORK PRETEND YOU SAW SOMETHING COOL REQUIREMENTS ‣ Scalable highly available ‣ Real-time data ingestion ‣ Arbitrary data exploration with ad-hoc queries ‣ Sub-second queries ‣ Many concurrent reads 2015 FINDING A SOLUTION ‣ Load all your data into Hadoop. Query it. Done! ‣ Good job guys, let’s go home 2015 FINDING A SOLUTION Hadoop s m a e t r h t g S i s t
显示全部
相似文档