Model-free and model-based time-optimal control of a badminton robot

None, None; None, None; None, None; None, None; None, None

Model-free and model-based time-optimal control of a badminton robot

Conference Paper (2013)

Author(s)

M Liu (External organisation)

B Depraetere (External organisation)

G Pinte (External organisation)

I. Grondman (TU Delft - OLD Intelligent Control & Robotics)

R. Babuska (TU Delft - OLD Intelligent Control & Robotics)

Research Group

OLD Intelligent Control & Robotics

DOI related publication

https://doi.org/10.1109/ASCC.2013.6606242

Conf.proc. > 3 pag

To reference this document use:

https://resolver.tudelft.nl/uuid:d67bcf1b-48a6-4240-9501-a59dfa4fca5f

More Info

expand_more

Publication Year

2013

Language

English

Research Group

OLD Intelligent Control & Robotics

Bibliographical Note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public. @en

Pages (from-to)

1-6

ISBN (print)

978-1-4673-5767-8

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In this research, time optimal control is considered for the hit motion of a badminton robot during a serve operation. For this task the racket always starts at rest in a given position and has to move to a target state, defined by a target position and a non-zero target velocity. The goal is to complete this motion in as little time as possible, yet without violating bounds on the actuator. To find controllers satisfying these requirements, a reinforcement learning approach is implemented, using a Natural Actor-Critic (NAC) reinforcement learning algorithm. This approach is experimentally shown to yield the desired robot motions after about 200 trials. Next to this model-free learning approach, the control signals obtained with a model-based optimization are also applied to the robot. The results achieved with both approaches are compared, and a thorough analysis is presented, highlighting the properties of each approach, as well as their advantages and drawbacks.

Files

Model-free_and_model-based_tim... (pdf)

(pdf | 1.52 Mb)

- Embargo expired in 06-11-2024

License info not available