Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
peter65374 authored Sep 30, 2024
1 parent 04e4aa9 commit ce4ccf1
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -31,6 +31,7 @@ To the best of our knowledge, the earliest end-to-end voice models originated fr
- AudioLM: Borsos et al. (2023) proposed a language modeling approach to audio generation.[More Info][3]
- SpeechGPT: Zhang et al. (2023) enhanced the cross-modal conversational capabilities of large language models.[More Info][4]
- SpeechFlow:Liu et al. (2024) introduced a speech generation pretraining method using flow matching. [More Info][5]
- SimpleSpeech2: Yang et al. (2024) proposed an efficient speech codec. [More Info][6]

[1]: https://arxiv.org/abs/2402.05755 "SpiRit-LM: Interleaved Spoken and Written Language Model"
[2]: https://arxiv.org/abs/2102.01192 "Generative Spoken Language Modeling from Raw Audio"
Expand Down

0 comments on commit ce4ccf1

Please sign in to comment.