* [modelcard] Set model name if empty
* no magic
Co-authored-by: Sylvain Gugger <sylvain@huggingface.co>
Co-authored-by: Sylvain Gugger <sylvain@huggingface.co>
* add minimal working gpt2 tokenizer
* graph mode and output equivalence tests working
* not today tensorflow. serialization test passing!
* fix style, documentation, docstrings and all that jazz
* passing consistency checks
* move keras nlp to tf dependencies
* fix tf modeling utils and gpt2 attention to enable compiling
* fix (I hope) keras nlp dependencies
* rever changes on generation
* remove debug prints
* remove redundant tf dummy objects
* add from config, get config and max length settings to address review
* let flake ignore the error on distillation you are welcome
* test from config
* add padding test
* address sgugger review
* Add Donut image processor
* Update src/transformers/image_transforms.py
Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com>
* Fix docstrings
* Full var names in docstring
Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com>
* First draft
* Fix backwards compatibility
* More fixes
* More fixes
* Make backbone more general
* Improve backbone
* Improve test
* Fix config checkpoint
* Address comments
* Use model_type
* Address more comments
* Fix special model names
* Remove MaskFormerSwinModel and MaskFormerSwinPreTrainedModel from main init
* Fix typo
* Update backbone
* Apply suggestion
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
* Add hidden states and attentions to backbone outputs
* Update ResNet
* Fix more tests
* Debug test
* Fix test_determinism
* Fix test_save_load
* Remove file
* Disable fx tests
* Test
* Add fx support for backbones
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
* INtegrate safetensos in weight offloading
* Use safetensors checkpoint for offload when available
* Make naming consistent
* Make load faster
* Quality
* Add default
* Changed assert into 7-8 exceptions
* updated syntax error
* updated error
* updated file (Co-autho: Batese2001)
* Successful test on test_modeling_distilbert.py
Successful raising errors and exceptions on the revised code in test_modeling_distilbert.py .
Co-credit: @batese2001
* Delete test_modeling_distilbert.ipynb
* Update modeling_distilbert.py
* Successful raising of exceptions with the conditions that are contrary to defined condition that asserts statements (Co-author: Batese2001)
* Successful raising of exceptions with the conditions that are contrary to defined condition that asserts statements (Co-author: Batese2001)
* committing the reformatted distilbert model
* reformatted distilbert model
* reformatted distilbert model
* reformatted distilbert model
* reformatted distilbert model with black
* Changed comments that explain better about raising exceptions for not having the even number of multi heads
* Changed comments that explain better about raising exceptions for not having the even number of multi heads
* changed based on the feedback
* Changed line 833 based on the suggestion made from @younesbelkada
* Changed line 833 based on the suggestion made from @younesbelkada draft2
* reformatted file
* Update src/transformers/models/distilbert/modeling_distilbert.py
* Update src/transformers/models/distilbert/modeling_distilbert.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
* Add a test to ensure int dummy inputs are int64
* Move the test into the existing int64 test and update a lot of existing dummies
* Fix remaining dummies
* Fix remaining dummies
* Test for int64 serving sigs as well
* Update core tests to use tf.int64
* Add better messages to the assertions
* Update all serving sigs to int64
* More sneaky hiding tf.int32s
* Add an optional int32 signature in save_pretrained
* make fixup
* Add Amy's suggestions
* Switch all serving sigs back to tf.int32
* Switch all dummies to tf.int32
* Adjust tests to check for tf.int32 instead of tf.int64
* Fix base dummy_inputs dtype
* Start casting to tf.int32 in input_processing
* Change dtype for unpack_inputs test
* Add proper tf.int32 test
* Make the alternate serving signature int64
* change the way sentinel tokens can retrived
* Fix line length for doc string
* Fix line length for doc string
* Add more stronger test for t5 tokenization
* Format file changes
* Make a stronger test for filtering sentinel tokens
* fix file format issues