Deriving Consensus Rankings from Benchmarking Experiments
Whereas benchmarking experiments are very frequently used to investigate the performance of statistical or machine learning algorithms for supervised and unsupervised learning tasks, overall analyses of such experiments are typically only carried out on a heuristic basis, if at all. We suggest to de...
Main Authors: | , |
---|---|
Format: | Others |
Language: | en |
Published: |
Department of Statistics and Mathematics, WU Vienna University of Economics and Business
2006
|
Subjects: | |
Online Access: | http://epub.wu.ac.at/1300/1/document.pdf |
id |
ndltd-VIENNA-oai-epub.wu-wien.ac.at-epub-wu-01_967 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-VIENNA-oai-epub.wu-wien.ac.at-epub-wu-01_9672017-03-01T05:46:35Z Deriving Consensus Rankings from Benchmarking Experiments Hornik, Kurt Meyer, David benchmark experiments / consensus rankings / Borda / Condorcet / symmetric difference / linear order / poset / linear programming Whereas benchmarking experiments are very frequently used to investigate the performance of statistical or machine learning algorithms for supervised and unsupervised learning tasks, overall analyses of such experiments are typically only carried out on a heuristic basis, if at all. We suggest to determine winners, and more generally, to derive a consensus ranking of the algorithms, as the linear order on the algorithms which minimizes average symmetric distance (Kemeny-Snell distance) to the performance relations on the individual benchmark data sets. This leads to binary programming problems which can typically be solved reasonably efficiently. We apply the approach to a medium-scale benchmarking experiment to assess the performance of Support Vector Machines in regression and classification problems, and compare the obtained consensus ranking with rankings obtained by simple scoring and Bradley-Terry modeling. Department of Statistics and Mathematics, WU Vienna University of Economics and Business 2006 Paper NonPeerReviewed en application/pdf http://epub.wu.ac.at/1300/1/document.pdf Series: Research Report Series / Department of Statistics and Mathematics http://epub.wu.ac.at/1300/ |
collection |
NDLTD |
language |
en |
format |
Others
|
sources |
NDLTD |
topic |
benchmark experiments / consensus rankings / Borda / Condorcet / symmetric difference / linear order / poset / linear programming |
spellingShingle |
benchmark experiments / consensus rankings / Borda / Condorcet / symmetric difference / linear order / poset / linear programming Hornik, Kurt Meyer, David Deriving Consensus Rankings from Benchmarking Experiments |
description |
Whereas benchmarking experiments are very frequently used to investigate the performance of statistical or machine learning algorithms for supervised and unsupervised learning tasks, overall analyses of such experiments are typically only carried out on a heuristic basis, if at all. We suggest to determine winners, and more generally, to derive a consensus ranking of the algorithms, as the linear order on the algorithms which minimizes average symmetric distance (Kemeny-Snell distance) to the performance relations on the individual benchmark data sets. This leads to binary programming problems which can typically be solved reasonably efficiently. We apply the approach to a medium-scale benchmarking experiment to assess the performance of Support Vector Machines in regression and classification problems, and compare the obtained consensus ranking with rankings obtained by simple scoring and Bradley-Terry modeling. === Series: Research Report Series / Department of Statistics and Mathematics |
author |
Hornik, Kurt Meyer, David |
author_facet |
Hornik, Kurt Meyer, David |
author_sort |
Hornik, Kurt |
title |
Deriving Consensus Rankings from Benchmarking Experiments |
title_short |
Deriving Consensus Rankings from Benchmarking Experiments |
title_full |
Deriving Consensus Rankings from Benchmarking Experiments |
title_fullStr |
Deriving Consensus Rankings from Benchmarking Experiments |
title_full_unstemmed |
Deriving Consensus Rankings from Benchmarking Experiments |
title_sort |
deriving consensus rankings from benchmarking experiments |
publisher |
Department of Statistics and Mathematics, WU Vienna University of Economics and Business |
publishDate |
2006 |
url |
http://epub.wu.ac.at/1300/1/document.pdf |
work_keys_str_mv |
AT hornikkurt derivingconsensusrankingsfrombenchmarkingexperiments AT meyerdavid derivingconsensusrankingsfrombenchmarkingexperiments |
_version_ |
1718418765815742464 |