Xuanjun Chen (陳炫均)

Ph.D. Candidate, NTU EECS · Taipei, Taiwan

I am a Ph.D. student at National Taiwan University (NTU), advised by Hung-yi Lee and Jyh-Shing Roger Jang. Prior to this, I earned my M.S. and B.S. in Computer Science from NTU (2023) and Taiwan Tech (2020).

My recent research focus areas are speech tokenization, speech generation, and spoken language modeling. Previously, my research spanned three threads: leading work on audio deepfakes (SingGraph, CodecFake+, SASTNet), contributing to large audio-language models and benchmarks (DeSTA 2.5-Audio, Dynamic-SUPERB Phase-2, Codec-SUPERB), and mentoring research on LLMs for RAG efficiency and their inherent limits.

Feel free to reach out to me at d12942018 [at] ntu.edu.tw, or find me on LinkedIn, Scholar, GitHub, 𝕏.

Research Highlights * equal contribution

[1]

CodecFake+: Codec-Based Resynthesized Data as a Proxy for Detecting CodecFake Speech
Xuanjun Chen*, Jiawei Du*, Haibin Wu, Lin Zhang, I-Ming Lin, ..., Jyh-Shing Roger Jang, Hung-yi Lee

IEEE TASLP 2026 bib · arXiv · IEEE · Project · HF · Code

@ARTICLE{chen2026codecfake,
author={Chen, Xuanjun and Du, Jiawei and Wu, Haibin and Zhang, Lin and Lin, I-Ming and Chiu, I-Hsiang and Ren, Wenze and Tseng, Yuan and Tsao, Yu and Jang, Jyh-Shing Roger and Lee, Hung-yi},
journal={IEEE Transactions on Audio, Speech and Language Processing},
title={CodecFake+: Codec-Based Resynthesized Data as a Proxy for Detecting CodecFake Speech},
year={2026},
volume={34},
pages={2929--2944},
doi={10.1109/TASLPRO.2026.3692291}
}

[2]

Joint Fullband-Subband Modeling for High-Resolution SingFake Detection
Xuanjun Chen*, Chia-Yu Hu*, Sung-Feng Huang, Haibin Wu, Hung-yi Lee, Jyh-Shing Roger Jang

INTERSPEECH 2026 (Long Paper) bib · arXiv

@article{chen2026joint,
  title={Joint Fullband-Subband Modeling for High-Resolution SingFake Detection},
  author={Chen, Xuanjun and Hu, Chia-Yu and Huang, Sung-Feng and Wu, Haibin and Lee, Hung-yi and Jang, Jyh-Shing Roger},
  journal={arXiv preprint arXiv:2604.04841},
  year={2026}
}

[3]

How Does Instrumental Music Help SingFake Detection?
Xuanjun Chen, Chia-Yu Hu, I-Ming Lin, Yi-Cheng Lin, I-Hsiang Chiu, ..., Hung-yi Lee, Jyh-Shing Roger Jang

ICASSP 2026 bib · arXiv · IEEE

@misc{chen2025how,
    title={How Does Instrumental Music Help SingFake Detection?},
    author={Xuanjun Chen and Chia-Yu Hu and I-Ming Lin and Yi-Cheng Lin and I-Hsiang Chiu and You Zhang and Sung-Feng Huang and Yi-Hsuan Yang and Haibin Wu and Hung-yi Lee and Jyh-Shing Roger Jang},
    year={2025},
    eprint={2509.14675},
    archivePrefix={arXiv},
    primaryClass={cs.SD}
}

[4]

CodaRAG: Connecting the Dots with Associativity Inspired by Complementary Learning
Cheng-Yen Li*, Xuanjun Chen*, Claire Lin, Wei-Yu Chen, Wenhua Nie, Hung-yi Lee, Jyh-Shing Roger Jang

ACM Trans. Intell. Syst. Technol. (ACM TIST) bib · arXiv

@misc{cyli2026codarag,
    title={CodaRAG: Connecting the Dots with Associativity Inspired by Complementary Learning},
    author={Cheng-Yen Li and Xuanjun Chen and Claire Lin and Wei-Yu Chen and Wenhua Nie and Hung-yi Lee and Jyh-Shing Roger Jang},
    year={2026},
    eprint={2604.10426},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}

[5]

Only Ask What You Don't Know: Grounded Delta Planning for Efficient Multi-step RAG
Wei-Chieh Chou*, Xuanjun Chen*, Jian-Ren Lin, Claire Lin, Hung-yi Lee, Jyh-Shing Roger Jang

COLM 2026 bib · arXiv

@misc{chou2026efficient,
    title={Only Ask What You Don't Know: Grounded Delta Planning for Efficient Multi-step RAG},
    author={Wei-Chieh Chou and Xuanjun Chen and Jian-Ren Lin and Claire Lin and Hung-yi Lee and Jyh-Shing Roger Jang},
    year={2026},
    note={Submitted to COLM 2026}
}

[6]

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
Ke-Han Lu, Zhehuai Chen, Szu-Wei Fu, ..., Xuanjun Chen, ..., Boris Ginsburg, Yu-Chiang Frank Wang, Hung-yi Lee

IEEE TASLP 2026 bib · arXiv · IEEE · Code

@ARTICLE{lu2026desta25,
author={Lu, Ke-Han and Chen, Zhehuai and Fu, Szu-Wei and Yang, Chao-Han Huck and Huang, Sung-Feng and Yang, Chih-Kai and Yu, Chee-En and Chen, Chun-Wei and Chen, Wei-Chih and Huang, Chien-yu and Lin, Yi-Cheng and Lin, Yu-Xiang and Fu, Chi-An and Kuan, Chun-Yi and Ren, Wenze and Chen, Xuanjun and Huang, Wei-Ping and Hu, En-Pei and Lin, Tzu-Quan and Wu, Yuan-Kuei and Huang, Kuan-Po and Huang, Hsiao-Ying and Chou, Huang-Cheng and Chang, Kai-Wei and Chiang, Cheng-Han and Ginsburg, Boris and Wang, Yu-Chiang Frank and Lee, Hung-yi},
journal={IEEE Transactions on Audio, Speech and Language Processing},
title={DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model With Self-Generated Cross-Modal Alignment},
year={2026},
volume={34},
pages={2062--2076},
doi={10.1109/TASLPRO.2026.3675792}
}

[7]

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Chien-yu Huang, Wei-Chih Chen, Shu-wen Yang, ..., Xuanjun Chen, ..., Shinji Watanabe, Hung-yi Lee

ICLR 2025 bib · arXiv · OpenReview · Code

@inproceedings{huang2025dynamic,
    title={Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the <br> Capabilities of Spoken Language Models with 180 Tasks},
    author={Chien-yu Huang and Wei-Chih Chen and Shu-wen Yang and Andy T. Liu and Chen-An Li and Yu-Xiang Lin and Wei-Cheng Tseng and Anuj Diwan and Yi-Jen Shih and Jiatong Shi and William Chen and Chih-Kai Yang and Wenze Ren and Xuanjun Chen and Chi-Yuan Hsiao and Puyuan Peng and Shih-Heng Wang and Chun-Yi Kuan and Ke-Han Lu and Kai-Wei Chang and Fabian Ritter-Gutierrez and Kuan-Po Huang and Siddhant Arora and You-Kuan Lin and Ming To Chuang and Eunjung Yeo and Kalvin Chang and Chung-Ming Chien and Kwanghee Choi and Jun-You Wang and Cheng-Hsiu Hsieh and Yi-Cheng Lin and Chee-En Yu and I-Hsiang Chiu and Heitor R. Guimarães and Jionghao Han and Tzu-Quan Lin and Tzu-Yuan Lin and Homu Chang and Ting-Wu Chang and Chun Wei Chen and Shou-Jen Chen and Yu-Hua Chen and Hsi-Chun Cheng and Kunal Dhawan and Jia-Lin Fang and Shi-Xin Fang and Kuan-Yu Fang Chiang and Chi An Fu and Hsien-Fu Hsiao and Ching Yu Hsu and Shao-Syuan Huang and Lee Chen Wei and Hsi-Che Lin and Hsuan-Hao Lin and Hsuan-Ting Lin and Jian-Ren Lin and Ting-Chun Liu and Li-Chun Lu and Tsung-Min Pai and Ankita Pasad and Shih-Yun Shan Kuan and Suwon Shon and Yuxun Tang and Yun-Shao Tsai and Jui-Chiang Wei and Tzu-Chieh Wei and Chengxi Wu and Dien-Ruei Wu and Chao-Han Huck Yang and Chieh-Chi Yang and Jia Qi Yip and Shao-Xiang Yuan and Vahid Noroozi and Zhehuai Chen and Haibin Wu and Karen Livescu and David Harwath and Shinji Watanabe and Hung-yi Lee},
    booktitle={International Conference on Learning Representations},
    year={2025},
    url={https://openreview.net/forum?id=s7lzZpAW7T}
}

[8]

A Preliminary Study of RAG for Taiwanese Historical Archives
Claire Lin*, Bo-Han Feng*, Xuanjun Chen*, Te-Lun Yang, Hung-yi Lee, Jyh-Shing Roger Jang

ROCLING 2025 Best Paper Award bib · arXiv · Anthology

@inproceedings{lin2025preliminary,
    title = "A Preliminary Study of {RAG} for {T}aiwanese Historical Archives",
    author = "Lin, Claire  and Feng, Bo-Han  and Chen, Xuanjun  and Yang, Te-Lun  and Lee, Hung-yi  and Jang, Jyh-Shing Roger",
    booktitle = "Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025)",
    year = "2025",
    url = "https://aclanthology.org/2025.rocling-main.6"
}

[9]

Towards Generalized Source Tracing for Codec-Based Deepfake Speech
Xuanjun Chen*, I-Ming Lin*, Lin Zhang, Haibin Wu, Hung-yi Lee, Jyh-Shing Roger Jang

IEEE ASRU 2025 Best Student Paper nominee bib · arXiv · Code

@inproceedings{chen2025towards,
    title={Towards Generalized Source Tracing for Codec-Based Deepfake Speech},
    author={Xuanjun Chen and I-Ming Lin and Lin Zhang and Haibin Wu and Hung-yi Lee and Jyh-Shing Roger Jang},
    booktitle={2025 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)},
    year={2025}
}

[10]

Codec-Based Deepfake Source Tracing via Neural Audio Codec Taxonomy
Xuanjun Chen*, I-Ming Lin*, Lin Zhang, Jiawei Du, Haibin Wu, Hung-yi Lee, Jyh-Shing Roger Jang

INTERSPEECH 2025 bib · arXiv · ISCA · Code

@inproceedings{chen2025codecbased,
    title={Codec-Based Deepfake Source Tracing via Neural Audio Codec Taxonomy},
    author={Xuanjun Chen and I-Ming Lin and Lin Zhang and Jiawei Du and Haibin Wu and Hung-yi Lee and Jyh-Shing Roger Jang},
    booktitle={Interspeech 2025},
    year={2025},
    url={https://www.isca-archive.org/interspeech_2025/chen25j_interspeech.pdf}
}

[11]

Singing Voice Graph Modeling for SingFake Detection
Xuanjun Chen, Haibin Wu, Jyh-Shing Roger Jang, Hung-yi Lee

INTERSPEECH 2024 (Oral) bib · arXiv · ISCA · Code · Lightning Talk

@inproceedings{chen2024singing,
    title={Singing Voice Graph Modeling for SingFake Detection},
    author={Xuanjun Chen and Haibin Wu and Jyh-Shing Roger Jang and Hung-yi Lee},
    booktitle={Interspeech 2024},
    year={2024},
    url={https://www.isca-archive.org/interspeech_2024/chen24o_interspeech.pdf}
}

[12]

Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Haibin Wu, Ho-Lam Chung, Yi-Cheng Lin, ..., Xuanjun Chen, ..., Kai-Wei Chang, Alexander H. Liu, Hung-yi Lee

Findings of ACL 2024 bib · arXiv · Anthology · Leaderboard · Code · HF

@inproceedings{wu-etal-2024-codec,
    title = "Codec-{SUPERB}: An In-Depth Analysis of Sound Codec Models",
    author = "Wu, Haibin  and Chung, Ho-Lam  and Lin, Yi-Cheng  and Wu, Yuan-Kuei  and Chen, Xuanjun  and Pai, Yu-Chi  and Wang, Hsiu-Hsuan  and Chang, Kai-Wei  and Liu, Alexander  and Lee, Hung-yi",
    editor = "Ku, Lun-Wei  and Martins, Andre  and Srikumar, Vivek",
    booktitle = "Findings of the Association for Computational Linguistics: ACL 2024",
    year = "2024",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.findings-acl.616/",
    doi = "10.18653/v1/2024.findings-acl.616",
    pages = "10330--10348",
}

[13]

Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural codec models
Haibin Wu, Xuanjun Chen, Yi-Cheng Lin, Jiawei Du, Kai-Wei Chang, ..., Shinji Watanabe, Hung-yi Lee

IEEE SLT 2024 bib · arXiv · IEEE Xplore

@inproceedings{wu2024codecsuperbslt,
    title={Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural codec models},
    author={Haibin Wu and Xuanjun Chen and Yi-Cheng Lin and Jiawei Du and Kai-Wei Chang and Ke-Han Lu and Alexander Liu and Ho-Lam Chung and Yuan-Kuei Wu and Dongchao Yang and Songxiang Liu and Yi-Chiao Wu and Xu Tan and James Glass and Shinji Watanabe and Hung-yi Lee},
    booktitle={2024 IEEE Spoken Language Technology Workshop (SLT)},
    year={2024}
}

[14]

Towards audio language modeling-an overview
Haibin Wu, Xuanjun Chen, Yi-Cheng Lin, Kai-wei Chang, Ho-Lam Chung, Alexander Liu, Hung-yi Lee

Technical Report, Feb. 2024 bib · arXiv · Awesome

@misc{wu2024towards,
    title={Towards audio language modeling - an overview},
    author={Haibin Wu and Xuanjun Chen and Yi-Cheng Lin and Kai-wei Chang and Ho-Lam Chung and Alexander Liu and Hung-yi Lee},
    year={2024},
    eprint={2402.13236},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}

Experience

Apr 2026 – Present04/2026 – Present: Research Intern, Shanda Group, Tokyo, Japan
Sep 2023 – Present09/2023 – Present: Ph.D. in Communication EngineeringCommunication Engineering, National Taiwan University · GPA 4.3/4.3
Jan 202301/2023: M.S. in Computer ScienceComputer Science, National Taiwan University · GPA 4.19/4.3
Jun 202006/2020: B.S. in Computer ScienceComputer Science, Taiwan Tech · GPA 4.11/4.3

Selected Honors & Service

Research and Academic Honors

2026: NTU Mr. Wen Tzu-Hsiang Memorial Scholarship One of nine recipients at NTU
2025: Best Student Paper nominee The IEEE Automatic Speech Recognition and Understanding Workshop
2025: Best Paper Award The 37th Conference on Computational Linguistics and Speech Processing
2024 – 2025: Student Travel Grants Google APAC 2024, ACLCLP 2025 & NSTC Subsidy 2025
2024 – 2025: CTCI Foundation Bursary & Research Scholarships Awarded by the CTCI Foundation for overseas students
2021: Ranked 3rd of 42 worldwide On the LA track of ASVspoof 2021 challenge
2020 – 2026: Kwong Tung Community Outstanding Student Scholarship Awarded across six academic years
2018 – 2020: Certificate of Achievement Top 5% in CSIE, Taiwan Tech, across three semesters
2017: 3rd Prize, SZIIT Academic Award Top 20% of students
2016: National Encouragement Scholarship Ministry of Education

International Event Organizer & Reviewer

2025: Co-Organizer, Responsible Speech & Audio Generative AI Special Session at IEEE ASRU 2025
2024: Technical Committee, Codec-SUPERB Challenge Special Session at IEEE SLT 2024
2023 – 2026: Admin Assistant, NVIDIA-NTU AI Joint Innovation Center Supported Industry-University Projects (Dir. Hung-yi Lee)
2023 – Present: Reviewer AAAI, ICML, NeurIPS, ACL, EMNLP, COLM, ICASSP, ICME, INTERSPEECH, ASRU, SLT, COLING