Welcome!
Hi, I’m HyoJung.
I am a 4th year Ph.D. student in Computer Science at the University of Maryland, College Park (UMD). I am fortunate to be advised by Marine Carpuat and Jordan Boyd-Graber participating in the CLIP Lab. In previous years, I interned with Huda Khayrallah at Microsoft and Changhan Wang at Meta FAIR. I am expecting to graduate in early 2026.
I’m interested in Multilingual and Multimodal NLP and its evaluation method for tackling language barriers and even background gaps. Specifically, I have been working on machine translation and speech translation.
Before my Ph.D., I was a research engineer at Samsung Research (SR), the advanced R&D hub of Samsung Electronics. I mainly worked on simultaneous and offline speech/text translation in the Efficient Neural Machine Translation Team at SR. I completed my M.S. at KAIST.
For more details about me, here are my Curriculum Vitae and a short version (2 paged CV).
Selected Publications
VocADT - Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?
HyoJung Han, Akiko Eriguchi, Haoran Xu, Hieu Hoang, Marine Carpuat and Huda Khayrallah.
ICLR 2025.
[arXiv][code][model]
SpeechQE: Estimating the Quality of Direct Speech Translation
HyoJung Han, Kevin Duh and Marine Carpuat.
EMNLP 2024. (main, long)
[arXiv][poster][video][code][model&dataset]
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception.
HyoJung Han, Mohamed Anwar, Juan Pino, Wei-Ning Hsu, Marine Carpuat, Bowen Shi and Changhan Wang.
ACL 2024. (main, long)
[arXiv][poster][video]
Bridging Background Knowledge Gaps in Translation with Automatic Explicitation.
HyoJung Han, Jordan Boyd-Graber and Marine Carpuat.
EMNLP 2023. (main, long)
[arXiv][poster][video][dataset]
SimQA: Detecting Simultaneous MT Errors through Word-by-Word Question Answering.
HyoJung Han, Marine Carpuat and Jordan Boyd-Graber.
EMNLP 2022. (main, long)
[pdf][poster][video][dataset]
Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement.
HyoJung Han*, Seokchan Ahn*, Yoonjung Choi, Insoo Chung, Sangha Kim and Kyunghyun Cho.
WMT 2021. ( * as same contribution).
[arXiv][video]
Talks
Models for Trustworthy Speech Translation - Using Tower
IST-Unbable Seminar, Nov 25th 2024 [slide]Models for Robust & Trustworthy Speech Translation - Expanding Modalities
KIT (Karlsruhe Institute of Technology), Nov 4th 2024 [slide]
News
- Jan/2025 - VocADT accepted at
ICLR 2025
- Nov/2024 - Invited to give a remote talk at IST-Unbable Seminar.
- Nov/2024 - Presented SpeechQE at EMNLP2024, Miami!
- Nov/2024 - Invited to give a remote talk at KIT.
- Oct/2024 - VocADT arXiv paper is out! Thanks to my manager and all the authors I worked with during my Microsoft internship.
- Sep/2024 - SpeechQE accepted at
EMNLP 2024 main conference
AGAINx2!
- Aug/2024 - Presented XLAVS-R at ACL2024, Bangkok
! Also, presented Automatic Explicitation at C3NLP@ACL
- May/2024 - Start research internship at Microsoft in Redmond!
- May/2024 - XLAVS-R accepted at
ACL 2024 main conference
- Mar/2024 - XLAVS-R arXiv paper is out! Thanks to my manager and all the authors I worked with during my Meta AI internship.
- Dec/2023 - Presented Automatic Explicitation at EMNLP2023, Singapore
- Oct/2023 - Automatic Explicitation accepted at
EMNLP 2023 main conference
AGAIN!
- May/2023 - Start Research Scientist Intern at Meta FAIR in New York City!
- Jan/2023 - I won UMD Graduated School’s Outstanding Graduate Assistant Award
All thanks to my advisors and CS department!
- Dec/2022 - Presented SimQA at EMNLP2022, Abu Dhabi
- Oct/2022 - SimQA accepted at
EMNLP 2022 main conference
!
- Sep/2021 - Paper accepted at WMT 2021.
- Aug/2021 - I am excited to start as a PhD student @UMD!
- Jan/2021 - Paper accepted at ICASSP 2021.
- Dec/2020 - Another arXiv paper is out
- Oct/2020 - I won Samsung Best Paper Awards 2020 as a first author! Thank you for the praise and prize
!
- July/2020 - Invited as panelists in IWSLT 2020 Simultaneous Speech Translation task Panel discussion. See y’all at 9th July 12:30 PST!
- July/2020 - Our submission achieved the best score
in the low-latency mode at Simultaneous Speech Translation task IWSLT 2020!
- May/2020 - Two papers accepted at IWSLT 2020.
- April/2020 - Gave my speech in PML4DC@ICLR2020.
- Jan/2020 - Paper accepted at ICASSP 2020.
HyoJung Han ← HouJeung Han
To avoid any confusion, I want to clarify that my alphabetical name has been officially changed to “HyoJung Han” from “HouJeung Han” since July 2020. (same Korean name “한효정”) After the Amendment to the Enforcement Decree of the Passport Act enables the revision, I decided to change it despite the mess afterward. The main reason is that there is an obvious discrepancy between the original pronunciation in Korean(close to HyoJung) and in the previous alphabetical name(HouJeung) as well as the resulting problems from small to critical one because of this.
I know most of my listed publications are in “HouJeung” but it’s me, “HyoJung”!