Codec size is huge #4

MonolithFoundation · 2025-01-21T07:32:44Z

The current codec model huge, about 14GB, this even not count the LLM size.

Any plan to release a smaller size codec?

zhenye234 · 2025-01-21T10:59:56Z

The codec model itself is actually not that large; it only uses about 3GB, with parameter size being less than 1B, https://huggingface.co/HKUST-Audio/xcodec2/blob/main/pytorch_model.bin For inference, you only need this. The larger size is because I also provided checkpoint includes all the state, making it suitable for continued training as well. https://huggingface.co/HKUST-Audio/xcodec2/blob/main/ckpt/epoch%3D4-step%3D1400000.ckpt Hope this clears things up!

MonolithFoundation · 2025-01-22T02:43:35Z

thank u, is that possible make 3G even smaller but still maintain the ability?

Provide feedback