2024 EMNLP ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback Ju-Seung Byun*, Jiyun Chun*, Jihyung Kil, Andrew Perrault EMNLP 2024 paper / code ACL II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering Jihyung Kil, Farideh Tavazoee, Dongyeop Kang, Joo-Kyung Kim ACL Findings 2024 paper ICML GPT-4V(ision) is a Generalist Web Agent, if Grounded Boyuan Zheng, Boyu Gou, Jihyung Kil, Huan Sun, Yu Su ICML 2024 paper / code / website CVPR Dual-View Visual Contextualization for Web Navigation Jihyung Kil, Chan Hee Song, Boyuan Zheng, Xiang Deng, Yu Su, Wei-Lun Chao CVPR 2024 paper / poster 2023 ICCV PreSTU: Pre-Training for Scene-Text Understanding Jihyung Kil, Soravit Changpinyo, Xi Chen, Hexiang Hu, Sebastian Goodman, Wei-Lun Chao, Radu Soricut ICCV 2023 paper / poster 2022 CVPR One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones Chan Hee Song, Jihyung Kil, Tai-Yu Pan, Brian M Sadler, Wei-Lun Chao, Yu Su CVPR 2022 paper / code / poster 2021 EMNLP Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering Jihyung Kil, Cheng Zhang, Dong Xuan, Wei-Lun Chao EMNLP 2021 paper / code / poster NAACL Revisiting Document Representations for Large-Scale Zero-Shot Learning Jihyung Kil, Wei-Lun Chao NAACL 2021 paper / code / poster