Vai offline con l'app Player FM !
Better & Faster Large Language Models via Multi-token Prediction
Manage episode 415604503 series 3524393
Training language models to predict multiple future tokens at once improves sample efficiency, downstream capabilities, and inference speed without increasing training time, especially beneficial for larger models and generative tasks.
https://arxiv.org/abs//2404.19737
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
1047 episodi
Manage episode 415604503 series 3524393
Training language models to predict multiple future tokens at once improves sample efficiency, downstream capabilities, and inference speed without increasing training time, especially beneficial for larger models and generative tasks.
https://arxiv.org/abs//2404.19737
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
1047 episodi
Tutti gli episodi
×Benvenuto su Player FM!
Player FM ricerca sul web podcast di alta qualità che tu possa goderti adesso. È la migliore app di podcast e funziona su Android, iPhone e web. Registrati per sincronizzare le iscrizioni su tutti i tuoi dispositivi.