* activations.py contains a mapping from string to activation function * resolves some `gelu` vs `gelu_new` ambiguity