Log in to save to my catalogue

Factored Contextual Policy Search with Bayesian Optimization

Factored Contextual Policy Search with Bayesian Optimization

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_2216560088

Factored Contextual Policy Search with Bayesian Optimization

About this item

Full title

Factored Contextual Policy Search with Bayesian Optimization

Publisher

Ithaca: Cornell University Library, arXiv.org

Journal title

arXiv.org, 2019-04

Language

English

Formats

Publication information

Publisher

Ithaca: Cornell University Library, arXiv.org

More information

Scope and Contents

Contents

Scarce data is a major challenge to scaling robot learning to truly complex tasks, as we need to generalize locally learned policies over different task contexts. Contextual policy search offers data-efficient learning and generalization by explicitly conditioning the policy on a parametric context space. In this paper, we further structure the contextual policy representation. We propose to factor contexts into two components: target contexts that describe the task objectives, e.g. target position for throwing a ball; and environment contexts that characterize the environment, e.g. initial position or mass of the ball. Our key observation is that experience can be directly generalized over target contexts. We show that this can be easily exploited in contextual policy search algorithms. In particular, we apply factorization to a Bayesian optimization approach to contextual policy search both in sampling-based and active learning settings. Our simulation results show faster learning and better generalization in various robotic domains. See our supplementary video: https://youtu.be/MNTbBAOufDY....

Alternative Titles

Full title

Factored Contextual Policy Search with Bayesian Optimization

Authors, Artists and Contributors

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_proquest_journals_2216560088

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_2216560088

Other Identifiers

E-ISSN

2331-8422

How to access this item