Ph.D. Candidate, NTU EECS · Taipei, Taiwan
I am a Ph.D. student at National Taiwan University (NTU), advised by Hung-yi Lee and Jyh-Shing Roger Jang. Currently, my research focuses on speech tokenization, speech generation, and spoken language modeling. Before my Ph.D., I earned my M.S. and B.S. in Computer Science from NTU (2023) and Taiwan Tech (2020).
Previously, my research spanned three threads: I led work on audio deepfakes (SingGraph, CodecFake+, SASTNet), contributed to large audio-language models and benchmarks (DeSTA 2.5-Audio, Dynamic-SUPERB Phase-2, Codec-SUPERB), and mentored research on LLMs for RAG efficiency and their inherent limits.
Feel free to reach out to me at d12942018 [at] ntu.edu.tw, or find me on LinkedIn, Scholar, GitHub, 𝕏.