Granite Time Series Models Collection A collection of time series models trained by IBM licensed under Apache 2.0 license. • 9 items • Updated 4 days ago • 47
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers Paper • 2311.10642 • Published Nov 17, 2023 • 25