Principle-driven self-alignment of language models from scratch with minimal human supervision Z Sun, Y Shen, Q Zhou, H Zhang, Z Chen, D Cox, Y Yang, C Gan Thirty-seventh Conference on Neural Information Processing Systems, 2023 | 136 | 2023 |
Grounding physical object and event concepts through dynamic visual reasoning Z Chen, J Mao, J Wu, KYK Wong, JB Tenenbaum, C Gan International Conference on Learning Representations, 2021 | 90* | 2021 |
Weakly-supervised spatio-temporally grounding natural sentence in video Z Chen, L Ma, W Luo, KYK Wong Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019 | 90 | 2019 |
Star: A benchmark for situated reasoning in real-world videos B Wu, S Yu, Z Chen, JB Tenenbaum, C Gan Thirty-fifth conference on neural information processing systems datasets …, 2021 | 89 | 2021 |
3d-llm: Injecting the 3d world into large language models Y Hong, H Zhen, P Chen, S Zheng, Y Du, Z Chen, C Gan Thirty-seventh Conference on Neural Information Processing Systems, 2023 | 82* | 2023 |
Look closer to ground better: Weakly-supervised temporal grounding of sentence in video Z Chen, L Ma, W Luo, P Tang, KYK Wong arXiv preprint arXiv:2001.09308, 2020 | 66 | 2020 |
Dynamic visual reasoning by learning differentiable physics models from video and language M Ding, Z Chen, T Du, P Luo, J Tenenbaum, C Gan Advances In Neural Information Processing Systems 34, 887-899, 2021 | 56 | 2021 |
Ps-nerf: Neural inverse rendering for multi-view photometric stereo W Yang, G Chen, C Chen, Z Chen, KYK Wong European Conference on Computer Vision, 266-284, 2022 | 54 | 2022 |
Cops-ref: A new dataset and task on compositional referring expression comprehension Z Chen, P Wang, L Ma, KYK Wong, Q Wu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 54 | 2020 |
Planning with large language models for code generation S Zhang, Z Chen, Y Shen, M Ding, JB Tenenbaum, C Gan International Conference on Learning Representations, 2023 | 53 | 2023 |
The blessings of unlabeled background in untrimmed videos Y Liu, J Chen, Z Chen, B Deng, J Huang, H Zhang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 40 | 2021 |
Comphy: Compositional physical reasoning of objects and events from videos Z Chen, K Yi, Y Li, M Ding, A Torralba, JB Tenenbaum, C Gan International Conference on Learning Representations, 2022 | 37 | 2022 |
Mod-squad: Designing mixtures of experts as modular multi-task learners Z Chen, Y Shen, M Ding, Z Chen, H Zhao, EG Learned-Miller, C Gan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 32 | 2023 |
S-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint W Yang, G Chen, C Chen, Z Chen, KYK Wong Advances in Neural Information Processing Systems 35, 1568-1582, 2022 | 27 | 2022 |
3d concept learning and reasoning from multi-view images Y Hong, C Lin, Y Du, Z Chen, JB Tenenbaum, C Gan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 26 | 2023 |
Salmon: Self-alignment with principle-following reward models Z Sun, Y Shen, H Zhang, Q Zhou, Z Chen, D Cox, Y Yang, C Gan arXiv preprint arXiv:2310.05910, 2023 | 24 | 2023 |
Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning Z Chen, Q Zhou, Y Shen, Y Hong, Z Sun, D Gutfreund, C Gan AAAI Conference on Artificial Intelligence, 2024 | 21* | 2024 |
Embodied concept learner: Self-supervised learning of concepts and mapping through instruction following M Ding, Y Xu, Z Chen, DD Cox, P Luo, JB Tenenbaum, C Gan Conference on Robot Learning, 1743-1754, 2023 | 15 | 2023 |
Deep face video inpainting via UV mapping W Yang, Z Chen, C Chen, G Chen, KYK Wong IEEE Transactions on Image Processing 32, 1145-1157, 2023 | 10 | 2023 |
Moduleformer: Learning modular large language models from uncurated data Y Shen, Z Zhang, T Cao, S Tan, Z Chen, C Gan arXiv preprint arXiv:2306.04640, 2023 | 8 | 2023 |