《Druid 实时分析架构设计思路—Imply》.pdf
文本预览下载声明
DRUID
INTERACTIVE EXPLORATORY ANALYTICS AT SCALE
FANGJIN YANG · DRUID COMMITTER
OVERVIEW
DEMO SEE SOME NEAT THINGS
MOTIVATION WHY DRUID?
ARCHITECTURE PICTURES WITH ARROWS
COMMUNITY CONTRIBUTE TO DRUID
THE PROBLEM
‣ Arbitrary and interactive exploration of time series data
• Ad-tech, system/app metrics, network/website traffic analysis
‣ Multi-tenancy: lots of concurrent users
‣ Scalability: 10+ TB/day, ad-hoc queries on trillions of events
‣ Recency matters! Real-time analysis
2013
DEMO
IN CASE THE INTERNET DIDN’T WORK
PRETEND YOU SAW SOMETHING COOL
REQUIREMENTS
‣ Scalable highly available
‣ Real-time data ingestion
‣ Arbitrary data exploration with ad-hoc queries
‣ Sub-second queries
‣ Many concurrent reads
2015
FINDING A SOLUTION
‣ Load all your data into Hadoop. Query it. Done!
‣ Good job guys, let’s go home
2015
FINDING A SOLUTION
Hadoop
s
m
a
e t
r h
t g
S i
s
t
显示全部