Distributed Optimization and Data Market Design

<p>We consider algorithms for distributed optimization and their applications. In this thesis, we propose a new approach for distributed optimization based on an emerging area of theoretical computer science – local computation algorithms. The approach is fundamentally different from existing...

Full description

Bibliographic Details
Main Author: London, Palma Alise den Nijs
Format: Others
Published: 2017
Online Access:https://thesis.library.caltech.edu/10158/1/PLondon_MS_Thesis.pdf
London, Palma Alise den Nijs (2017) Distributed Optimization and Data Market Design. Master's thesis, California Institute of Technology. doi:10.7907/Z9SX6B76. https://resolver.caltech.edu/CaltechTHESIS:05092017-112643697 <https://resolver.caltech.edu/CaltechTHESIS:05092017-112643697>
id ndltd-CALTECH-oai-thesis.library.caltech.edu-10158
record_format oai_dc
spelling ndltd-CALTECH-oai-thesis.library.caltech.edu-101582020-08-29T05:01:30Z Distributed Optimization and Data Market Design London, Palma Alise den Nijs <p>We consider algorithms for distributed optimization and their applications. In this thesis, we propose a new approach for distributed optimization based on an emerging area of theoretical computer science – local computation algorithms. The approach is fundamentally different from existing methodologies and provides a number of benefits, such as robustness to link failure and adaptivity to dynamic settings. Specifically, we develop an algorithm, LOCO, that given a convex optimization problem P with n variables and a “sparse” linear constraint matrix with m constraints, provably finds a solution as good as that of the best online algorithm for P using only O(log(n + m)) messages with high probability. The approach is not iterative and communication is restricted to a localized neighborhood. In addition to analytic results, we show numerically that the performance improvements over classical approaches for distributed optimization are significant, e.g., it uses orders of magnitude less communication than ADMM.</p> <p>We also consider the operations of a geographically distributed cloud data market. We consider design decisions that include which data to purchase (data purchasing) and where to place or replicate the data for delivery (data placement). We show that a joint approach to data purchasing and data placement within a cloud data market improves operating costs. This problem can be viewed as a facility location problem, and is thus NP-hard. However, we give a provably optimal algorithm for the case of a data market consisting of a single data center, and then generalize the result from the single data center setting in order to develop a near-optimal, polynomial-time algorithm for a geo-distributed data market. The resulting design, Datum, decomposes the joint purchasing and placement problem into two subproblems, one for data purchasing and one for data placement, using a transformation of the underlying bandwidth costs. We show, via a case study, that Datum is near-optimal (within 1.6%) in practical settings.</p> 2017 Thesis NonPeerReviewed application/pdf https://thesis.library.caltech.edu/10158/1/PLondon_MS_Thesis.pdf https://resolver.caltech.edu/CaltechTHESIS:05092017-112643697 London, Palma Alise den Nijs (2017) Distributed Optimization and Data Market Design. Master's thesis, California Institute of Technology. doi:10.7907/Z9SX6B76. https://resolver.caltech.edu/CaltechTHESIS:05092017-112643697 <https://resolver.caltech.edu/CaltechTHESIS:05092017-112643697> https://thesis.library.caltech.edu/10158/
collection NDLTD
format Others
sources NDLTD
description <p>We consider algorithms for distributed optimization and their applications. In this thesis, we propose a new approach for distributed optimization based on an emerging area of theoretical computer science – local computation algorithms. The approach is fundamentally different from existing methodologies and provides a number of benefits, such as robustness to link failure and adaptivity to dynamic settings. Specifically, we develop an algorithm, LOCO, that given a convex optimization problem P with n variables and a “sparse” linear constraint matrix with m constraints, provably finds a solution as good as that of the best online algorithm for P using only O(log(n + m)) messages with high probability. The approach is not iterative and communication is restricted to a localized neighborhood. In addition to analytic results, we show numerically that the performance improvements over classical approaches for distributed optimization are significant, e.g., it uses orders of magnitude less communication than ADMM.</p> <p>We also consider the operations of a geographically distributed cloud data market. We consider design decisions that include which data to purchase (data purchasing) and where to place or replicate the data for delivery (data placement). We show that a joint approach to data purchasing and data placement within a cloud data market improves operating costs. This problem can be viewed as a facility location problem, and is thus NP-hard. However, we give a provably optimal algorithm for the case of a data market consisting of a single data center, and then generalize the result from the single data center setting in order to develop a near-optimal, polynomial-time algorithm for a geo-distributed data market. The resulting design, Datum, decomposes the joint purchasing and placement problem into two subproblems, one for data purchasing and one for data placement, using a transformation of the underlying bandwidth costs. We show, via a case study, that Datum is near-optimal (within 1.6%) in practical settings.</p>
author London, Palma Alise den Nijs
spellingShingle London, Palma Alise den Nijs
Distributed Optimization and Data Market Design
author_facet London, Palma Alise den Nijs
author_sort London, Palma Alise den Nijs
title Distributed Optimization and Data Market Design
title_short Distributed Optimization and Data Market Design
title_full Distributed Optimization and Data Market Design
title_fullStr Distributed Optimization and Data Market Design
title_full_unstemmed Distributed Optimization and Data Market Design
title_sort distributed optimization and data market design
publishDate 2017
url https://thesis.library.caltech.edu/10158/1/PLondon_MS_Thesis.pdf
London, Palma Alise den Nijs (2017) Distributed Optimization and Data Market Design. Master's thesis, California Institute of Technology. doi:10.7907/Z9SX6B76. https://resolver.caltech.edu/CaltechTHESIS:05092017-112643697 <https://resolver.caltech.edu/CaltechTHESIS:05092017-112643697>
work_keys_str_mv AT londonpalmaalisedennijs distributedoptimizationanddatamarketdesign
_version_ 1719338964925546496