![]() Constrained k-means clustering with background knowledge. Wagstaff K, Cardie C, Rogers S, Schrödl S. the 9th International Conference on Autonomic Computing, September 2012, pp.53-62. Automated profiling and resource management of pig programs for meeting service level objectives. The Journal of Machine Learning Research, 2016, 17(106): 1-37. Multitask learning for straggler avoiding predictive job scheduling. Yadwadkar N J, Hariharan B, Gonzalez J E, Katz R. the 20th IEEE International Symposium on High Performance Computer Architecture, Feb. BigDataBench: A big data benchmark suite from Internet services. Wang L, Zhan J, Luo C, Zhu Y, Yang Q, He Y, Gao W, Jia Z, Shi Y, Zhang S. the 12th USENIX Symposium on Operating Systems Design and Implementation, November 2016, pp.65-80. Altruistic scheduling in multi-resource clusters. Grandl R, Chowdhury M, Akella A, Ananthanarayanan G. the 9th IEEE World Congress on Services, June 28-July 3, 2013, pp.456-463. CloudAdvisor: A recommendation-as-a-service platform for cloud configuration and pricing. ![]() Jung G, Mukherjee T, Kunde S, Kim H, Sharma N, Goetz F. the 2nd ACM Symposium on Cloud Computing, October 2011, Article No. No one (cluster) size fits all: Automatic cluster sizing for data-intensive analytics. the 2013 International Workshop on Multi-Cloud Applications and Federated Clouds, April 2013, pp.21-26. Towards multi-cloud configurations using feature models and ontologies. Quinton C, Haderer N, Rouvoy R, Duchien L. Journal of Big Data, 2018, 5(1): Article No. A survey on addressing high-class imbalance in big data. ![]() Leevy J L, Khoshgoftaar T M, Bauder R A, Seliya N. In Encyclopedia of Measurement and Statistics, Salkind N J (ed.), SAGE, 2007, pp.508-510. The Kendall rank correlation coefficient. the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 2015, pp.1235-1244.Ībdi H. Collaborative deep learning for recommender systems. the 38th IEEE International Conference on Distributed Computing Systems, July 2018, pp.660-670. Arrow: Low-level augmented Bayesian optimization for finding the best cloud VM. the 13th USENIX Symposium on Networked Systems Design and Implementation, March 2016, pp.363-378. Ernest: Efficient performance prediction for large-scale advanced analytics. Venkataraman S, Yang Z, Franklin M, Recht B, Stoica I. ACM Transactions on Computer Systems, 2013, 31(4): Article No. QoS-aware scheduling in heterogeneous datacenters with paragon. the 14th USENIX Symposium on Networked Systems Design and Implementation, March 2017, pp.469-482.ĭelimitrou C, Kozyrakis C. Cherrypick: Adaptively unearthing the best cloud configurations for big data analytics. the 11th ACM Symposium on Cloud Computing, October 2020, pp.208-222.Īlipourfard O, Liu H H, Chen J, Venkataraman S, Yu M, Zhang M. Finding the right cloud configuration for analytics clusters. Our evaluation on 12 typical workloads in HiBench shows that compared with state-of-the-art approaches, Apollo can improve up to 30% search accuracy, while reducing as much as 50% overhead for picking the optimal cloud configurations.īilal M, Canini M, Rodrigues R. At last, we leverage a hierarchical regression model to measure which cluster is more suitable and use a local search strategy to pick the optimal cloud configurations in a few extra tests. Based on the rank, we then limit the search space of cloud configurations through a classification mechanism. When a new workload comes, we run it with several small datasets to rank its key characteristics and get its similar workloads. We first classify 12 typical workloads in BigDataBench by characterizing pairwise correlations in our offline benchmarks. We propose Apollo, a data-driven approach that can rapidly pick the optimal cloud configurations by reusing data from similar workloads. In this paper, we address this problem with a high accuracy and a low overhead. Big data analytics applications are increasingly deployed on cloud computing infrastructures, and it is still a big challenge to pick the optimal cloud configurations in a cost-effective way.
0 Comments
Leave a Reply. |