Ph.D. Candidate, NTU EECS · Taipei, Taiwan
I am a Ph.D. student at National Taiwan University (NTU), advised by Hung-yi Lee and Jyh-Shing Roger Jang. Prior to this, I earned my M.S. and B.S. in Computer Science from NTU (2023) and Taiwan Tech (2020).
My recent research focus areas are speech tokenization, speech generation, and spoken language modeling. Previously, my research spanned three threads: leading work on audio deepfakes (SingGraph, CodecFake+, SASTNet), contributing to large audio-language models and benchmarks (DeSTA 2.5-Audio, Dynamic-SUPERB Phase-2, Codec-SUPERB), and mentoring research on LLMs for RAG efficiency and their inherent limits.
Feel free to reach out to me at d12942018 [at] ntu.edu.tw, or find me on LinkedIn, Scholar, GitHub, 𝕏.