1 year agoRetentive Network: A Successor to Transformer for Large Language Models (Paper Explained)ykilcher