Follow
Roopal Garg
Roopal Garg
Staff Software Engineer @ Google Research
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
9752024
Davidsonian scene graph: Improving reliability in fine-grained evaluation for text-image generation
J Cho, Y Hu, R Garg, P Anderson, R Krishna, J Baldridge, M Bansal, ...
arXiv preprint arXiv:2310.18235, 2023
662023
Docci: Descriptions of connected and contrasting images
Y Onoe, S Rane, Z Berger, Y Bitton, J Cho, R Garg, A Ku, Z Parekh, ...
European Conference on Computer Vision, 291-309, 2024
312024
ImageInWords: Unlocking Hyper-Detailed Image Descriptions
R Garg, A Burns, BK Ayan, Y Bitton, C Montgomery, Y Onoe, A Bunner, ...
arXiv preprint arXiv:2405.02793, 2024
172024
Imagen 3
J Baldridge, J Bauer, M Bhutani, N Brichtova, A Bunner, K Chan, Y Chen, ...
arXiv preprint arXiv:2408.07009, 2024
162024
Mismatch quest: Visual and textual feedback for image-text misalignment
B Gordon, Y Bitton, Y Shafir, R Garg, X Chen, D Lischinski, D Cohen-Or, ...
European Conference on Computer Vision, 310-328, 2024
62024
Greedy growing enables high-resolution pixel-based diffusion models
CN Vasconcelos, A Rashwan, A Waters, T Walker, K Xu, J Yan, R Qian, ...
Transactions on Machine Learning Research, 2024
22024
Automated classification of network-accessible content based on events
R Garg
US Patent 10,504,145, 2019
12019
The system can't perform the operation now. Try again later.
Articles 1–8