4 52 2

Chih-Kai Yang

zenyn

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition

upvoted a collection 18 days ago

Awesome papers from 臺大李宏毅 (Hung-yi Lee)

upvoted a paper 19 days ago

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

View all activity

Organizations

upvoted a paper 14 days ago

Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition

Paper • 2510.08047 • Published Oct 9 • 7

upvoted a collection 18 days ago

Awesome papers from 臺大李宏毅 (Hung-yi Lee)

Collection

Recent papers authored by Hung-yi Lee. Sorted by ID • 8 items • Updated 19 days ago • 17

upvoted a paper 19 days ago

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

Paper • 2510.16917 • Published 23 days ago • 19

commented a paper 19 days ago

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

Paper • 2510.16917 • Published 23 days ago • 19 •

upvoted a paper 19 days ago

Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations

Paper • 2510.16893 • Published 23 days ago • 17

commented a paper 19 days ago

Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations

Paper • 2510.16893 • Published 23 days ago • 17 •

authored 2 papers 21 days ago

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

Paper • 2510.16917 • Published 23 days ago • 19

Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations

Paper • 2510.16893 • Published 23 days ago • 17

upvoted 3 papers about 1 month ago

authored a paper 3 months ago

AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models

Paper • 2506.05140 • Published Jun 5

upvoted a paper 4 months ago

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Paper • 2507.02768 • Published Jul 3 • 18

authored 2 papers 4 months ago

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Paper • 2507.02768 • Published Jul 3 • 18

Analyzing Mitigation Strategies for Catastrophic Forgetting in End-to-End Training of Spoken Language Models

Paper • 2505.17496 • Published May 23 • 2

upvoted 3 papers 4 months ago

STITCH: Simultaneous Thinking and Talking with Chunked Reasoning for Spoken Language Models

Paper • 2507.15375 • Published Jul 21 • 30

Mitigating Object Hallucinations via Sentence-Level Early Intervention

Paper • 2507.12455 • Published Jul 16 • 7

Einstein Fields: A Neural Perspective To Computational General Relativity

Paper • 2507.11589 • Published Jul 15 • 8

upvoted a collection 4 months ago

Evaluations of Large Audio-Language Models (LALMs)

Collection

This collection contains papers for various LALM evaluation frameworks. • 45 items • Updated Jul 17 • 3

updated a collection 4 months ago

Evaluations of Large Audio-Language Models (LALMs)

Collection

This collection contains papers for various LALM evaluation frameworks. • 45 items • Updated Jul 17 • 3

Chih-Kai Yang

AI & ML interests

Recent Activity

Organizations

zenyn's activity