Low-Resource English-Vietnamese Translation: A Comparative Study of mBART-50 and NLLB

Paper Information

Paper ID: FETC25-071

Type: Full Paper

Date: Sep 08, 2025

Status: Accepted

Paper Details

Abstract

Recent advances in multilingual machine translation have demonstrated remarkable potential for improving translation quality, particularly for low-resource languages. In this study, we conduct a comparative analysis of two prominent multilingual models: the mBART-50 and the No Language Left Behind (NLLB) models, both fine-tuned on the IWSLT2015 English-Vietnamese dataset. Specifically, we compare facebook/mbart-large-50-many-to-many-mmt with facebook/nllb-200-distilled-600M, implementing identical preprocessing, fine-tuning strategies, and evaluation metrics to ensure a fair comparison. Our experiments reveal that the NLLB model achieves a BLEU score of 35.81, outperforming mBART-50's score of 33.97 on the same test set, despite having a smaller parameter footprint. We analyze the strengths and limitations of each model, particularly examining their ability to handle domain-specific terminology and syntactic structures common in the TED talks domain. This study contributes to the understanding of how different multilingual architectures perform on low-resource language pairs and provides insights into selecting appropriate models for English-Vietnamese translation tasks. The source code, data, and fine-tuned models are publicly available at: GitHub repository ([https://github.com/vuhuyng04/NMT\_mBART-50\_NLLB.git](https://github.com/vuhuyng04/NMT_mBART-50_NLLB.git)), fine-tuned mBART-50 model ([https://huggingface.co/nguyenvuhuy/en-vi-mbart50\_TMG301](https://huggingface.co/nguyenvuhuy/en-vi-mbart50_TMG301)), and fine-tuned NLLB-200 model ([https://huggingface.co/nguyenvuhuy/en-vi-nllb-200\_TMG301](https://huggingface.co/nguyenvuhuy/en-vi-nllb-200_TMG301)).

Keywords

mBART NLLB; Low-Resource Languages English- Vietnamese Translation.

Contact Information

Huy Vu Nguyen (Corresponding Author)

FPT University, Can Tho, Vietnam, Vietnam

HuyNVCE180384@fpt.edu.vn

0379934607

All Authors (7)

Huy Vu Nguyen C

Affiliation: FPT University, Can Tho, Vietnam

Country: Vietnam

Email: HuyNVCE180384@fpt.edu.vn

Phone: 0379934607

Tuyen Thi Bich Nguyen

Affiliation: FPT University, Can Tho, Vietnam

Country: Vietnam

Email: TuyenNTBCE182206@fpt.edu.vn

Phone: 0366095322

Kiet Hoang Dang

Affiliation: FPT University, Can Tho, Vietnam

Country: Vietnam

Email: KietDHCE180342@fpt.edu.vn

Phone: 0979015867

Chanh Minh Nguyen

Affiliation: FPT University, Can Tho, Vietnam

Country: Vietnam

Email: ChanhNMCE180818@fpt.edu.vn

Phone: 0366168101

Khoi Van Anh Dinh

Affiliation: FPT University, Can Tho, Vietnam

Country: Vietnam

Email: KhoiDVACE180756@fpt.edu.vn

Phone: 0919469042

Trinh Khanh Nguyen

Affiliation: FPT University, Can Tho, Vietnam

Country: Vietnam

Email: TrinhNKCE182226@fpt.edu.vn

Phone: 0855992423

Vinh Dinh Nguyen

Affiliation: FPT University, Can Tho, Vietnam

Country: Vietnam

Email: VinhND29@fe.edu.vn

Phone: 0975894685

Back to Accepted Papers

Latest News

There are no new news updates at the moment.

Important dates

Submission Deadline: ~~June 30, 2025~~ July 31, 2025 (Firm Deadline)
Notification of Acceptance: August 15, 2025
Camera Ready Submission: September 10, 2025
Registration Deadline and Fee Payment: September 15, 2025
Conference Dates: October 25-26, 2025

Conference Fee

International Authors/Listeners

Registration Type	Region	Inclusive Package
Registration Type	International	Include Gala dinner	Include Academic tour
Author (Regular)	300 USD	Yes	Yes
Author (Student)	250 USD	Yes	Yes
Author (Industry/Poster)	300 USD	Yes	Yes
Listener	100 USD	Yes	Yes

Domestic Authors/Listeners

Registration Type	Region	Inclusive Package
Registration Type	Vietnam	Include Gala dinner	Include Academic tour
Author (Regular)	5,000,000 VND	Yes	Yes
Author (Student)	4,500,000 VND	Yes	Yes
Author (Industry/Poster)	5,000,000 VND	Yes	Yes
Listener	1,000,000 VND	Yes	Yes

Contact

Website: science.fpt.edu.vn/fetc
Phone: +84 2466549806
Email: FETC@fe.edu.vn

Keynote Speakers

Prof. Natalia Loukachevitch
Lomonosov Moscow State University (MSU), Russia
Prof. Long Tran-Thanh
University of Warwick, United Kingdom
Dr. Long Duong
Oracle, Australia

Conference Themes

AI Solution for Developing Countries

Data Availability and Quality
Energy Efficiency and Optimization
Edge Computing and Decentralization
NLP for Low-resource Languages
Image and Video Understanding
Machine Learning Applications

Important dates

Submission Deadline: ~~June 30, 2025~~ July 31, 2025 (Firm Deadline)
Notification of Acceptance: August 15, 2025
Camera Ready Submission: September 10, 2025
Registration Deadline and Fee Payment: September 15, 2025
Conference Dates: October 25-26, 2025

Version: 1.0.9428.17720