Deriving Consensus Rankings from Benchmarking Experiments

Whereas benchmarking experiments are very frequently used to investigate the performance of statistical or machine learning algorithms for supervised and unsupervised learning tasks, overall analyses of such experiments are typically only carried out on a heuristic basis, if at all. We suggest to de...

Full description

Bibliographic Details
Main Authors: Hornik, Kurt, Meyer, David
Format: Others
Language:en
Published: Department of Statistics and Mathematics, WU Vienna University of Economics and Business 2006
Subjects:
Online Access:http://epub.wu.ac.at/1300/1/document.pdf
id ndltd-VIENNA-oai-epub.wu-wien.ac.at-epub-wu-01_967
record_format oai_dc
spelling ndltd-VIENNA-oai-epub.wu-wien.ac.at-epub-wu-01_9672017-03-01T05:46:35Z Deriving Consensus Rankings from Benchmarking Experiments Hornik, Kurt Meyer, David benchmark experiments / consensus rankings / Borda / Condorcet / symmetric difference / linear order / poset / linear programming Whereas benchmarking experiments are very frequently used to investigate the performance of statistical or machine learning algorithms for supervised and unsupervised learning tasks, overall analyses of such experiments are typically only carried out on a heuristic basis, if at all. We suggest to determine winners, and more generally, to derive a consensus ranking of the algorithms, as the linear order on the algorithms which minimizes average symmetric distance (Kemeny-Snell distance) to the performance relations on the individual benchmark data sets. This leads to binary programming problems which can typically be solved reasonably efficiently. We apply the approach to a medium-scale benchmarking experiment to assess the performance of Support Vector Machines in regression and classification problems, and compare the obtained consensus ranking with rankings obtained by simple scoring and Bradley-Terry modeling. Department of Statistics and Mathematics, WU Vienna University of Economics and Business 2006 Paper NonPeerReviewed en application/pdf http://epub.wu.ac.at/1300/1/document.pdf Series: Research Report Series / Department of Statistics and Mathematics http://epub.wu.ac.at/1300/
collection NDLTD
language en
format Others
sources NDLTD
topic benchmark experiments / consensus rankings / Borda / Condorcet / symmetric difference / linear order / poset / linear programming
spellingShingle benchmark experiments / consensus rankings / Borda / Condorcet / symmetric difference / linear order / poset / linear programming
Hornik, Kurt
Meyer, David
Deriving Consensus Rankings from Benchmarking Experiments
description Whereas benchmarking experiments are very frequently used to investigate the performance of statistical or machine learning algorithms for supervised and unsupervised learning tasks, overall analyses of such experiments are typically only carried out on a heuristic basis, if at all. We suggest to determine winners, and more generally, to derive a consensus ranking of the algorithms, as the linear order on the algorithms which minimizes average symmetric distance (Kemeny-Snell distance) to the performance relations on the individual benchmark data sets. This leads to binary programming problems which can typically be solved reasonably efficiently. We apply the approach to a medium-scale benchmarking experiment to assess the performance of Support Vector Machines in regression and classification problems, and compare the obtained consensus ranking with rankings obtained by simple scoring and Bradley-Terry modeling. === Series: Research Report Series / Department of Statistics and Mathematics
author Hornik, Kurt
Meyer, David
author_facet Hornik, Kurt
Meyer, David
author_sort Hornik, Kurt
title Deriving Consensus Rankings from Benchmarking Experiments
title_short Deriving Consensus Rankings from Benchmarking Experiments
title_full Deriving Consensus Rankings from Benchmarking Experiments
title_fullStr Deriving Consensus Rankings from Benchmarking Experiments
title_full_unstemmed Deriving Consensus Rankings from Benchmarking Experiments
title_sort deriving consensus rankings from benchmarking experiments
publisher Department of Statistics and Mathematics, WU Vienna University of Economics and Business
publishDate 2006
url http://epub.wu.ac.at/1300/1/document.pdf
work_keys_str_mv AT hornikkurt derivingconsensusrankingsfrombenchmarkingexperiments
AT meyerdavid derivingconsensusrankingsfrombenchmarkingexperiments
_version_ 1718418765815742464