mirror of
https://github.com/huggingface/transformers.git
synced 2025-07-27 00:09:00 +06:00
![]() * hidden layers, huh, what are they good for (absolutely nothing) * Some tests break with 1 hidden layer, use 2 * Use 1 hidden layer in a few slow models * Use num_hidden_layers=2 everywhere * Slightly higher tol for groupvit * Slightly higher tol for groupvit |
||
---|---|---|
.. | ||
__init__.py | ||
test_modeling_flax_gpt2.py | ||
test_modeling_gpt2.py | ||
test_modeling_tf_gpt2.py | ||
test_tokenization_gpt2_tf.py | ||
test_tokenization_gpt2.py |