Ph.D. Candidate, NTU EECS · Taipei, Taiwan
I am a Ph.D. student at National Taiwan University (NTU), advised by Hung-yi Lee and Jyh-Shing Roger Jang. Prior to this, I earned my M.S. and B.S. in Computer Science from NTU (2023) and Taiwan Tech (2020). Currently, my research focuses on speech tokenization, speech generation, and spoken language modeling.
Previously, my research spanned three threads: I led work on audio deepfakes (SingGraph, CodecFake+, SASTNet), contributed to large audio-language models and benchmarks (DeSTA 2.5-Audio, Dynamic-SUPERB Phase-2, Codec-SUPERB), and mentored research on LLMs for RAG efficiency and their inherent limits.
Feel free to reach out to me at d12942018 [at] ntu.edu.tw, or find me on LinkedIn, Scholar, GitHub, 𝕏.