How has DeepSeek improved the Transformer architecture? (epoch.ai)3 points by h8hawk 17 hours ago | 0 comments