Följ
Jongmin Lee
Jongmin Lee
Verifierad e-postadress på berkeley.edu - Startsida
Titel
Citeras av
Citeras av
År
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
J Lee, W Jeon, BJ Lee, J Pineau, KE Kim
ICML, 2021
742021
Monte-Carlo Tree Search for Constrained POMDPs
J Lee, GH Kim, P Poupart, KE Kim
NeurIPS, 2018
652018
DemoDICE: Offline Imitation Learning with Supplementary Imperfect Demonstrations
GH Kim, S Seo, J Lee, W Jeon, HJ Hwang, H Yang, KE Kim
International Conference on Learning Representations (ICLR), 2022
602022
Multi-view automatic lip-reading using neural network
D Lee, J Lee, KE Kim
Computer Vision–ACCV 2016 Workshops: ACCV 2016 International Workshops …, 2017
602017
Representation balancing offline model-based reinforcement learning
BJ Lee, J Lee, KE Kim
International Conference on Learning Representations (ICLR), 2021
472021
GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems
Y Jang, J Lee, KE Kim
International Conference on Learning Representations (ICLR), 2022
392022
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
J Lee, C Paduraru, DJ Mankowitz, N Heess, D Precup, KE Kim, A Guez
International Conference on Learning Representations (ICLR), 2022
252022
Monte-Carlo Tree Search in Continuous Action Spaces with Value Gradients
J Lee, W Jeon, GH Kim, KE Kim
AAAI, 2020
202020
Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues
Y Jang, J Lee, KE Kim
AAAI, 2020
192020
Hierarchically-partitioned Gaussian Process Approximation
BJ Lee, J Lee, KE Kim
Artificial Intelligence and Statistics (AISTATS), 822-831, 2017
182017
Reinforcement Learning for Control with Multiple Frequencies
J Lee, BJ Lee, KE Kim
Advances in Neural Information Processing Systems (NeurIPS) 33, 2020
162020
Batch Reinforcement Learning with Hyperparameter Gradients
BJ Lee, J Lee, P Vrancx, D Kim, KE Kim
ICML, 2020
162020
PyOpenDial: a python-based domain-independent toolkit for developing spoken dialogue systems with probabilistic rules
Y Jang, J Lee, J Park, KH Lee, P Lison, KE Kim
Proceedings of the 2019 conference on empirical methods in natural language …, 2019
112019
Constrained Bayesian Reinforcement Learning via Approximate Linear Programming
J Lee, Y Jang, P Poupart, KE Kim
IJCAI, 2088-2095, 2017
102017
Monte-carlo planning and learning with language action value estimates
Y Jang, S Seo, J Lee, KE Kim
International Conference on Learning Representations (ICLR), 2021
82021
LobsDICE: Offline Imitation Learning from Observation via Stationary Distribution Correction Estimation
GH Kim, J Lee, Y Jang, H Yang, KE Kim
Advances in Neural Information Processing Systems (NeurIPS), 2022
72022
Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions
H Lee, J Lee, Y Choi, W Jeon, BJ Lee, YK Noh, KE Kim
Advances in Neural Information Processing Systems (NeurIPS), 2022
32022
Trust Region Sequential Variational Inference
GH Kim, Y Jang, J Lee, W Jeon, H Yang, KE Kim
Asian Conference on Machine Learning (ACML), 1033-1048, 2019
22019
Bayesian Reinforcement Learning with Behavioral Feedback.
T Hong, J Lee, KE Kim, PA Ortega, DD Lee
IJCAI, 1571-1577, 2016
22016
Tempo Adaption in Non-stationary Reinforcement Learning
H Lee, Y Ding, J Lee, M Jin, J Lavaei, S Sojoudi
Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS), 2023
12023
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20