《A Three-Dimensional Data Model in HBase for Large Time-Series Dataset Analysis》.pdf
文本预览下载声明
2012 IEEE 6th International Workshop on the Maintenance and Evolution of Service-Oriented and Cloud-Based Systems (MESOCA)
A Three-Dimensional Data Model in HBase for Large Time-Series Dataset Analysis
Dan Han, Eleni Stroulia
Department of Computing Science
University of Alberta
Edmonton, Canada
{dhan3, stroulia}@cs.ualberta.ca
Abstract—In the transition of applications from the tra- are required for such applications, as the data generated in
ditional enterprise infrastructures to cloud infrastructures, these applications are growing monotonously over time [2].
scalable database management system plays an important role Therefore, large-scale ad-hoc analytical processing of the
in efficiently managing and analysing unprecedented massive
amount of data. Compared to RDBMSs, NoSQL databases, time-series data collected from those cloud-based applica-
are more attractive in addressing this challenge. However, it tions is becoming increasingly valuable to improving the
is not easy to manage data in NoSQL database effectively quality and efficiency of existing services, and discovering
for non-expert users because of the rare data-organization the knowledge.
support. A poor data organization may accidentally abuse the Moreover, the success of this movement necessitates a
features of NoSQL database and achieve unsatisfactory perfor-
mance. Therefore, a systematic method for NoSQL database design of scalable database management system which can
data-s
显示全部