Abstract:
Hyperparameter optimization is often done manually or by using a grid search. However, recent research has shown that automatic optimization techniques are able to accele...Show MoreMetadata
Abstract:
Hyperparameter optimization is often done manually or by using a grid search. However, recent research has shown that automatic optimization techniques are able to accelerate this optimization process and find hyperparameter configurations that lead to better models. Currently, transferring knowledge from previous experiments to a new experiment is of particular interest because it has been shown that it allows to further improve the hyperparameter optimization. We propose to transfer knowledge by means of an initialization strategy for hyperparameter optimization. In contrast to the current state of the art initialization strategies, our strategy is neither limited to hyperparameter configurations that have been evaluated on previous experiments nor does it need meta-features. The initial hyperparameter configurations are derived by optimizing for a meta-loss formally defined in this paper. This loss depends on the hyperparameter response function of the data sets that were investigated in past experiments. Since this function is unknown and only few observations are given, the meta-loss is not differentiable. We propose to approximate the response function by a differentiable plug-in estimator. Then, we are able to learn the initial hyperparameter configuration sequence by applying gradient-based optimization techniques. Extensive experiments are conducted on two meta-data sets. Our initialization strategy is compared to the state of the art for initialization strategies and further methods that are able to transfer knowledge between data sets. We give empirical evidence that our work provides an improvement over the state of the art.
Date of Conference: 19-21 October 2015
Date Added to IEEE Xplore: 07 December 2015
Print ISBN:978-1-4673-8272-4
Citations are not available for this document.
Cites in Papers - |
Cites in Papers - IEEE (5)
Select All
1.
Niraj Prasad Bhatta, Harshdeep Singh, Ashutosh Ghimire, Md Tauhidur Rahman, Fathi Amsaad, "Aging of SRAM PUFs: Mitigation and Advancements Through Machine Learning Techniques", 2023 IEEE Physical Assurance and Inspection of Electronics (PAINE), pp.1-6, 2023.
2.
T. Peyton, J. L. Carpenter, S. Camp, M. Fadul, B. Dean, D. R. Reising, T. D. Loveless, "Supervised Deep Learning and Classification of Single-Event Transients", IEEE Transactions on Nuclear Science, vol.70, no.8, pp.1740-1746, 2023.
3.
Dalu Cao, Guang-Chen Bai, "DNN-Based Surrogate Modeling-Based Feasible Performance Reliability Design Methodology for Aircraft Engine", IEEE Access, vol.8, pp.229201-229218, 2020.
4.
Shih-Fan Chou, Hsiu-Wen Yen, Ai-Chun Pang, "A REM-Enabled Diagnostic Framework in Cellular-Based IoT Networks", IEEE Internet of Things Journal, vol.6, no.3, pp.5273-5284, 2019.
5.
Martin Wistuba, Nicolas Schilling, Lars Schmidt-Thieme, "Hyperparameter Optimization Machines", 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA), pp.41-50, 2016.
Cites in Papers - Other Publishers (22)
1.
Fernando Freitas, Pavel Brazdil, Carlos Soares, "Reducing algorithm configuration spaces for efficient search", International Journal of Data Science and Analytics, 2025.
2.
Nguyen Huu Tiep, Hae-Yong Jeong, Kyung-Doo Kim, Nguyen Xuan Mung, Nhu-Ngoc Dao, Hoai-Nam Tran, Van-Khanh Hoang, Nguyen Ngoc Anh, Mai The Vu, "A New Hyperparameter Tuning Framework for Regression Tasks in Deep Neural Network: Combined-Sampling Algorithm to Search the Optimized Hyperparameters", Mathematics, vol.12, no.24, pp.3892, 2024.
3.
Niraj Prasad Bhatta, Harshdeep Singh, Ashutosh Ghimire, Fathi Amsaad, "Analyzing Aging Effects on SRAM PUFs: Implications for Security and Reliability", Journal of Hardware and Systems Security, 2024.
4.
Kodjo Mawuena Amekoe, Hanane Azzag, Zaineb Chelly Dagdia, Mustapha Lebbah, Gregoire Jaffre, "Exploring accuracy and interpretability trade-off in tabular learning with novel attention-based models", Neural Computing and Applications, 2024.
5.
Wuttipong Kusonkhum, Korb Srinavin, Tanayut Chaitongrat, "The Adoption of a Big Data Approach Using Machine Learning to Predict Bidding Behavior in Procurement Management for a Construction Project", Sustainability, vol.15, no.17, pp.12836, 2023.
6.
Katarzyna Woźnica, Mateusz Grzyb, Zuzanna Trafas, Przemysław Biecek, "Consolidated learning: a domain-specific model-free optimization strategy with validation on metaMIMIC benchmarks", Machine Learning, 2023.
7.
Aman Mahajan, Stephen Esper, Thien Htay Oo, Jeffery McKibben, Michael Garver, Jamie Artman, Cynthia Klahre, John Ryan, Senthilkumar Sadhasivam, Jennifer Holder-Murray, Oscar C. Marroquin, "Development and Validation of a Machine Learning Model to Identify Patients Before Surgery at High Risk for Postoperative Adverse Events", JAMA Network Open, vol.6, no.7, pp.e2322285, 2023.
8.
A. Prakash, Vijender Kumar Solanki, "Performance Analysis of Classification Algorithms", Proceedings of 3rd International Conference on Recent Trends in Machine Learning, IoT, Smart Cities and Applications, vol.540, pp.647, 2023.
9.
Pavel Brazdil, Jan N. van Rijn, Carlos Soares, Joaquin Vanschoren, "Metalearning for Hyperparameter Optimization", Metalearning, pp.103, 2022.
10.
Katarzyna Woźnica, Przemysław Biecek, "Towards Explainable Meta-learning", Machine Learning and Principles and Practice of Knowledge Discovery in Databases, vol.1524, pp.505, 2021.
11.
Florian Pfisterer, Jan N. van Rijn, Philipp Probst, Andreas C. Müller, Bernd Bischl, "Learning multiple defaults for machine learning algorithms", Proceedings of the Genetic and Evolutionary Computation Conference Companion, pp.241, 2021.
12.
Xin Wang, Peng Yang, Shaopeng Chen, Lin Liu, Lian Zhao, Jiacheng Guo, Mingming Sun, Ping Li, "Efficient Learning to Learn a Robust CTR Model for Web-scale Online Sponsored Search Advertising", Proceedings of the 30th ACM International Conference on Information & Knowledge Management, pp.4203, 2021.
13.
Masahiro Nomura, Yuta Saito, "Efficient Hyperparameter Optimization under Multi-Source Covariate Shift", Proceedings of the 30th ACM International Conference on Information & Knowledge Management, pp.1376, 2021.
14.
Piali Das, Nikita Ivkin, Tanya Bansal, Laurence Rouesnel, Philip Gautier, Zohar Karnin, Leo Dirac, Lakshmi Ramakrishnan, Andre Perunicic, Iaroslav Shcherbatyi, Wilton Wu, Aida Zolic, Huibin Shen, Amr Ahmed, Fela Winkelmolen, Miroslav Miladinovic, Cedric Archembeau, Alex Tang, Bhaskar Dutt, Patricia Grao, Kumar Venkateswar, "Amazon SageMaker Autopilot", Proceedings of the Fourth International Workshop on Data Management for End-to-End Machine Learning, pp.1, 2020.
15.
Joaquin Vanschoren, "Meta-Learning", Automated Machine Learning, pp.35, 2019.
16.
Mine Kaya, Shima Hajimirza, "Using a Novel Transfer Learning Method for Designing Thin Film Solar Cells with Enhanced Quantum Efficiencies", Scientific Reports, vol.9, no.1, 2019.
17.
Wahed Hemati, Alexander Mehler, "CRFVoter: gene and protein related object recognition using a conglomerate of CRF-based tools", Journal of Cheminformatics, vol.11, no.1, 2019.
18.
Katharina Eggensperger, Marius Lindauer, Holger H. Hoos, Frank Hutter, Kevin Leyton-Brown, "Efficient benchmarking of algorithm configurators via model-based surrogates", Machine Learning, vol.107, no.1, pp.15, 2018.
19.
Martin Wistuba, Nicolas Schilling, Lars Schmidt-Thieme, "Scalable Gaussian process-based transfer surrogates for hyperparameter optimization", Machine Learning, 2017.
20.
Nicolas Schilling, Martin Wistuba, Lars Schmidt-Thieme, Machine Learning and Knowledge Discovery in Databases, vol.9851, pp.33, 2016.
21.
Martin Wistuba, Nicolas Schilling, Lars Schmidt-Thieme, Machine Learning and Knowledge Discovery in Databases, vol.9851, pp.199, 2016.
22.
Gang Luo, "A review of automatic selection methods for machine learning algorithms and hyper-parameter values", Network Modeling Analysis in Health Informatics and Bioinformatics, vol.5, no.1, 2016.