Neural audio codecs are foundational to speech language models. It is expected to have a low frame rate and decoupled semantic and acoustic information. A lower frame rate codec can reduce the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results