Vai offline con l'app Player FM !
[QA] Let’s Think Dot by Dot: Hidden Computation in Transformer Language Models
Manage episode 415008604 series 3524393
Transformers can use meaningless filler tokens to solve tasks, but learning to use them is challenging. Additional tokens can provide computational benefits independently of token choice.
https://arxiv.org/abs//2404.15758
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
1049 episodi
Manage episode 415008604 series 3524393
Transformers can use meaningless filler tokens to solve tasks, but learning to use them is challenging. Additional tokens can provide computational benefits independently of token choice.
https://arxiv.org/abs//2404.15758
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
1049 episodi
Все серии
×Benvenuto su Player FM!
Player FM ricerca sul web podcast di alta qualità che tu possa goderti adesso. È la migliore app di podcast e funziona su Android, iPhone e web. Registrati per sincronizzare le iscrizioni su tutti i tuoi dispositivi.