Szymon Sidor
Szymon Sidor
Verified email at mit.edu - Homepage
TitleCited byYear
Evolution strategies as a scalable alternative to reinforcement learning
T Salimans, J Ho, X Chen, S Sidor, I Sutskever
arXiv preprint arXiv:1703.03864, 2017
3682017
Openai baselines
P Dhariwal, C Hesse, O Klimov, A Nichol, M Plappert, A Radford, ...
GitHub, GitHub repository, 2017
2652017
Parameter space noise for exploration
M Plappert, R Houthooft, P Dhariwal, S Sidor, RY Chen, X Chen, T Asfour, ...
arXiv preprint arXiv:1706.01905, 2017
1612017
Learning dexterous in-hand manipulation
M Andrychowicz, B Baker, M Chociej, R Jozefowicz, B McGrew, ...
arXiv preprint arXiv:1808.00177, 2018
1252018
Schema networks: Zero-shot transfer with a generative causal model of intuitive physics
K Kansky, T Silver, DA Mély, M Eldawy, M Lázaro-Gredilla, X Lou, ...
Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017
1042017
Emergent complexity via multi-agent competition
T Bansal, J Pachocki, S Sidor, I Sutskever, I Mordatch
arXiv preprint arXiv:1710.03748, 2017
852017
Stable baselines
A Hill, A Raffin, M Ernestus, A Gleave, R Traore, P Dhariwal, C Hesse, ...
GitHub repository, 2018
342018
UCB exploration via Q-ensembles
RY Chen, S Sidor, P Abbeel, J Schulman
arXiv preprint arXiv:1706.01502, 2017
162017
Openai baselines
C Hesse, M Plappert, A Radford, J Schulman, S Sidor, Y Wu
142017
Openai baselines (2017)
P Dhariwal, C Hesse, O Klimov, A Nichol, M Plappert, A Radford, ...
URL https://github. com/opfenai/baselines, 0
11
UCB and infogain exploration via q-ensembles
RY Chen, J Schulman, P Abbeel, S Sidor
arXiv preprint arXiv:1706.01502 9, 2017
92017
Integrating multi-purpose natural language understanding, robot’s memory, and symbolic planning for task execution in humanoid robots
M Wächter, E Ovchinnikova, V Wittenbeck, P Kaiser, S Szedmak, ...
Robotics and Autonomous Systems 99, 148-165, 2018
62018
Measuring offensive speech in online political discourse
R Nithyanand, B Schaffner, P Gill
7th {USENIX} Workshop on Free and Open Communications on the Internet ({FOCI …, 2017
52017
Time resource networks
S Sidor, P Yu, C Fang, B Williams
arXiv preprint arXiv:1602.03203, 2016
12016
Occam's gates
J Raiman, S Sidor
arXiv preprint arXiv:1506.08251, 2015
12015
Reinforcement learning with natural language signals
S Sidor
Massachusetts Institute of Technology, 2016
2016
Learning Dexterous In-Hand Manipulation
R Józefowicz, B McGrew, J Pachocki, A Petron, M Plappert, G Powell, ...
The system can't perform the operation now. Try again later.
Articles 1–17