Download PDFOpen PDF in browserQuantifying Scalability Using Extended TPC-DS Performance MetricEasyChair Preprint 363510 pages•Date: June 17, 2020AbstractThe TPC Benchmark™DS (TPC-DS) is a decision support benchmark that models several generally applicable aspects of a decision support system, including data loading, queries and data maintenance. The benchmark provides a representative evaluation of the System Under Test’s (SUT) performance as a general-purpose decision support system. TPC-DS defines three primary metrics. The most important is the Performance Metric, QphDS@SF, reflecting the TPC-DS query throughput at various scale factors. Performance metrics at different scale factors are not comparable, due to the substantially different computational challenges found at different data volumes. Data analytics platforms have two main components, Compute and Storage. In the last decade, many cloud data analytics platforms have begun to separate compute and storage. With this separation, data analytics platforms now can scale compute in and out independently on the same dataset with the same storage. The performance metric QphDS@SF at different compute levels demonstrates how well the system performance scales. This article shows some scalability analysis in the TPC-DS workload on a cloud data analytics platform and proposes a benchmark as an extension to TPC-DS. Keyphrases: Price Performance Metric, Quantify Scalability, Scalability, Scaled Performance Metric, TPC-DS, performance metric
|