Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
-
Updated
Oct 16, 2025
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Deep Learning for Speech
The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”
FastLongSpeech is a novel framework designed to extend the capabilities of Large Speech-Language Models for efficient long-speech processing without necessitating dedicated long-speech training data.
Add a description, image, and links to the speech-llms topic page so that developers can more easily learn about it.
To associate your repository with the speech-llms topic, visit your repo's landing page and select "manage topics."