VoxParadox
Adversarial benchmark to evaluate language dominance on audio LLMs' paralinguistic understanding, with controlled linguistic-acoustic contradiction (2k clips, 10 tasks). Proposed PCLM, a novel layer fusion strategy improving baselines by 48% coupled with DPO.