大数据处理发展.pptx
TheEvolutionofLarge-ScaleDataProcessing
CONTENTSMapReduceStormSparkKafkaFlink
MapReduce
MapReduce
MapReduce
Hadoop
HDFS
HDFSReadDatafromHDFSWriteDatatoHDFS
YARN
Storm
Storm
LambdaArchitecture
Spark
Spark
SparkSQL
SparkStreamingUseProcessingTimenotEventTimeComplex,low-levelapireasonaboutend-to-endapplicationBatchandstreamingcodeisnotuniformDisadvantage
StructuredStreaming
SparkMllib
SparkGraphX
Kafka
Kafka
Kafka
KappaArchitecture
Flink
Flink
Flink
Flink
TimeandWindow
WaterMarker
BigDataProcessingArchitecture
StormFlink
Spark2019Flink2019Hadoop2019
Spark+AISummit2019
FlinkForward2019
FlinkChinaMeetup
WhathappenedtoHadoop?
DistributedSystems
ConsistencyandFaulttoleranceDistributedConsensusBasicPaxosMulti-PaxosRaftZabProof-Of-WorkProof-Of-Stake…
OtherProblemLamportClockDistributedstorage
OtherProblemRemoteProcedureCallDistributedComputing
Thanks