H. Zhang, S. Bai, X. Lan, D. Hsu, and N. Zheng. Hindsight trust region policy optimization. In Proc. Int. Jnt. Conf. on Artificial Intelligence, 2021.
BibTeX  PDF

X. Ma, P. Karkus, D. Hsu, W.S. Lee, and N. Ye. Discriminative particle filter reinforcement learning for complex partial observations. In Proc. Int. Conf. on Learning Representations, 2020.
BibTeX  PDF

R. Pinsler, P. Karkus, A. Kupcsik, D. Hsu, and W.S. Lee. Factored contextual policy search with bayesian optimization. In Proc. IEEE Int. Conf. on Robotics & Automation, 2019.
BibTeX  PDF

A. Kupcsik, D. Hsu, and W.S. Lee. Learning dynamic robot-to-human object handover from human feedback. In Proc. Int. Symp. on Robotics Research, 2015.
BibTeX PDF

X.X. Wang, Y. Wang, D. Hsu, and Y. Wang. Exploration in interactive personalized music recommendation: A reinforcement learning approachACM Trans. on Multimedia Computing, Communications & Applications, 11(1), 2014.
BibTeX PDF

Y. Wang, K.S. Won, D. Hsu, and W.S. Lee. Monte Carlo Bayesian reinforcement learning. In Proc. Int. Conf. on Machine Learning, 2012.
BibTeX PDF (with supplementary material)