Deriving Consensus Rankings from Benchmarking Experiments

Whereas benchmarking experiments are very frequently used to investigate the performance of statistical or machine learning algorithms for supervised and unsupervised learning tasks, overall analyses of such experiments are typically only carried out on a heuristic basis, if at all. We suggest to de...

Full description

Bibliographic Details
Main Authors:	Hornik, Kurt, Meyer, David
Format:	Others
Language:	en
Published:	Department of Statistics and Mathematics, WU Vienna University of Economics and Business 2006
Subjects:	benchmark experiments / consensus rankings / Borda / Condorcet / symmetric difference / linear order / poset / linear programming
Online Access:	http://epub.wu.ac.at/1300/1/document.pdf

id	ndltd-VIENNA-oai-epub.wu-wien.ac.at-epub-wu-01_967
record_format	oai_dc
spelling	ndltd-VIENNA-oai-epub.wu-wien.ac.at-epub-wu-01_9672017-03-01T05:46:35Z Deriving Consensus Rankings from Benchmarking Experiments Hornik, Kurt Meyer, David benchmark experiments / consensus rankings / Borda / Condorcet / symmetric difference / linear order / poset / linear programming Whereas benchmarking experiments are very frequently used to investigate the performance of statistical or machine learning algorithms for supervised and unsupervised learning tasks, overall analyses of such experiments are typically only carried out on a heuristic basis, if at all. We suggest to determine winners, and more generally, to derive a consensus ranking of the algorithms, as the linear order on the algorithms which minimizes average symmetric distance (Kemeny-Snell distance) to the performance relations on the individual benchmark data sets. This leads to binary programming problems which can typically be solved reasonably efficiently. We apply the approach to a medium-scale benchmarking experiment to assess the performance of Support Vector Machines in regression and classification problems, and compare the obtained consensus ranking with rankings obtained by simple scoring and Bradley-Terry modeling. Department of Statistics and Mathematics, WU Vienna University of Economics and Business 2006 Paper NonPeerReviewed en application/pdf http://epub.wu.ac.at/1300/1/document.pdf Series: Research Report Series / Department of Statistics and Mathematics http://epub.wu.ac.at/1300/
collection	NDLTD
language	en
format	Others
sources	NDLTD
topic	benchmark experiments / consensus rankings / Borda / Condorcet / symmetric difference / linear order / poset / linear programming
spellingShingle	benchmark experiments / consensus rankings / Borda / Condorcet / symmetric difference / linear order / poset / linear programming Hornik, Kurt Meyer, David Deriving Consensus Rankings from Benchmarking Experiments
description	Whereas benchmarking experiments are very frequently used to investigate the performance of statistical or machine learning algorithms for supervised and unsupervised learning tasks, overall analyses of such experiments are typically only carried out on a heuristic basis, if at all. We suggest to determine winners, and more generally, to derive a consensus ranking of the algorithms, as the linear order on the algorithms which minimizes average symmetric distance (Kemeny-Snell distance) to the performance relations on the individual benchmark data sets. This leads to binary programming problems which can typically be solved reasonably efficiently. We apply the approach to a medium-scale benchmarking experiment to assess the performance of Support Vector Machines in regression and classification problems, and compare the obtained consensus ranking with rankings obtained by simple scoring and Bradley-Terry modeling. === Series: Research Report Series / Department of Statistics and Mathematics
author	Hornik, Kurt Meyer, David
author_facet	Hornik, Kurt Meyer, David
author_sort	Hornik, Kurt
title	Deriving Consensus Rankings from Benchmarking Experiments
title_short	Deriving Consensus Rankings from Benchmarking Experiments
title_full	Deriving Consensus Rankings from Benchmarking Experiments
title_fullStr	Deriving Consensus Rankings from Benchmarking Experiments
title_full_unstemmed	Deriving Consensus Rankings from Benchmarking Experiments
title_sort	deriving consensus rankings from benchmarking experiments
publisher	Department of Statistics and Mathematics, WU Vienna University of Economics and Business
publishDate	2006
url	http://epub.wu.ac.at/1300/1/document.pdf
work_keys_str_mv	AT hornikkurt derivingconsensusrankingsfrombenchmarkingexperiments AT meyerdavid derivingconsensusrankingsfrombenchmarkingexperiments
_version_	1718418765815742464

Deriving Consensus Rankings from Benchmarking Experiments

Similar Items