Jihyung Kil
Ph.D. Candidate
Computer Science
The Ohio State University
  kil.5@osu.edu
LinkedIn / Twitter / Google Scholar / GitHub
I am a Ph.D. candidate in Computer Science and Engineering at The Ohio State University, advised by Wei-Lun (Harry) Chao.
I am interested in machine learning and its applications to Vision-Language. Recently, I have focused on the following topics:
- Multimodal Foundation Models
- Multimodal Agents (Web Agents, Embodied AI)
- Visual Question Answering, Scene-Text Understanding
- Few/Zero-Shot Learning
🔥 I am on the job market in 2024. Please contact me if you are interested in my research. 🔥
Experience
Amazon Alexa AI Â Â Â Â Â Â Â Â Â Â Research Intern May 2023 - Dec 2023
Google Research           Research Intern May 2022 - Nov 2022
The Ohio State University    Graduate Research Assistant Aug 2018 - Present
News
Mar, 2024 | I am selected for Doctoral Consortium at CVPR 2024. |
---|---|
Feb, 2024 | Our Dual-VCR on Web Navigation accepted to CVPR 2024. |
Jan, 2024 | Our II-MMR on Visual Question Answering now available on arXiv. |
Jan, 2024 | Our SeeAct on Web Navigation now available on arXiv. |
Jul, 2023 | Our PreSTU on Scene-Text Undersatnding accepted to ICCV 2023. |
Mar, 2022 | Our M-Track on Vision and Language Navigation accepted to CVPR 2022. |
Dec, 2021 | Our team is selected to participate in the Amazon Alexa Prize SimBot Challenge. |
Aug, 2021 | Our SimpleAug on Visual Question Answering accepted to EMNLP 2021. |
Apr, 2021 | Our paper on Zero Shot Learning accepted to NAACL 2021. |
Research
-
preprintII-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question AnsweringarXiv 2024
-
CVPR