Publications
2026
- DocPrune:Efficient Document Question Answering via Background, Question, and Comprehension-aware Token PruningJoonmyung Choi, Sanghyeok Lee, Jongha Kim, Sehyung Kim, Dohwan Ko, Jihyung Kil, Hyunwoo J. KimCVPR 2026
- AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation ModelsZheda Mai, Arpita Chowdhury, Zihe Wang, Sooyoung Jeon, Lemeng Wang, Jiacheng Hou, Jihyung Kil, Wei-Lun ChaoCVPR 2026
- Spinning Straw into Gold: Relabeling LLM Agent Trajectories in Hindsight for Successful DemonstrationsZichao Li, Gang Wu, Zichao Wang, Ruiyi Zhang, Wanrong Zhu, Ryan A. Rossi, Vlad I Morariu, Jihyung KilICLR 2026
2025
- VisR-Bench: An Empirical Study on Visual Retrieval-Augmented Generation for Multilingual Long Document UnderstandingJian Chen, Ming Li, Jihyung Kil, Chenguang Wang, Tong Yu, Ryan Rossi, Tianyi Zhou, Changyou Chen, Ruiyi ZhangarXiv 2025
- Representation Shift: Unifying Token Compression with FlashAttentionJoonmyung Choi*, Sanghyeok Lee*, Byungoh Ko, Eunseo Kim, Jihyung Kil, Hyunwoo J. KimICCV 2025
- GUI Agents: A SurveyDang Nguyen, Jian Chen, Yu Wang, Gang Wu, Namyong Park, Zhengmian Hu, Hanjia Lyu, Junda Wu, othersACL 2025
2024
- II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question AnsweringJihyung Kil, Farideh Tavazoee, Dongyeop Kang, Joo-Kyung KimACL 2024