Google & Columbia U’s Mnemosyne: Learning to Train Transformers With Transformers | Synced

In the new paper Mnemosyne: Learning to Train Transformers with Transformers, a research team from Google and Columbia University presents Mnemosyne Optimizer, a learning-to-learn system for traini...

By · · 1 min read

Source: Synced | AI Technology & Industry Review

In the new paper Mnemosyne: Learning to Train Transformers with Transformers, a research team from Google and Columbia University presents Mnemosyne Optimizer, a learning-to-learn system for training entire neural network architectures without any task-specific optimizer tuning.