AI & ML interests

Designated by the National Science Foundation (NSF) in 2020, IFML develops the key foundational tools for the next decade of AI innovation.

Sunny111ย 
posted an update about 1 month ago
view post
Post
1619
Are you familiar with reverse residual connections or looping in language models?

Excited to share my Looped-GPT blog post and codebase ๐Ÿš€
https://github.com/sanyalsunny111/Looped-GPT

TL;DR: looping during pre-training improves generalization.

Plot shows GPT2 LMs pre-trained with 15.73B OWT tokens

P.S. This is my first post here โ€” I have ~4 followers and zero expectations for reach ๐Ÿ˜„
ยท