transformers/utils
NielsRogge 31ee80d556
Add LayoutLMv3 (#17060)
* Make forward pass work

* More improvements

* Remove unused imports

* Remove timm dependency

* Improve loss calculation of token classifier

* Fix most tests

* Add docs

* Add model integration test

* Make all tests pass

* Add LayoutLMv3FeatureExtractor

* Improve integration test + make fixup

* Add example script

* Fix style

* Add LayoutLMv3Processor

* Fix style

* Add option to add visual labels

* Make more tokenizer tests pass

* Fix more tests

* Make more tests pass

* Fix bug and improve docs

* Fix import of processors

* Improve docstrings

* Fix toctree and improve docs

* Fix auto tokenizer

* Move tests to model folder

* Move tests to model folder

* change default behavior add_prefix_space

* add prefix space for fast

* add_prefix_spcae set to True for Fast

* no space before `unique_no_split` token

* add test to hightligh special treatment of added tokens

* fix `test_batch_encode_dynamic_overflowing` by building a long enough example

* fix `test_full_tokenizer` with add_prefix_token

* Fix tokenizer integration test

* Make the code more readable

* Add tests for LayoutLMv3Processor

* Fix style

* Add model to README and update init

* Apply suggestions from code review

* Replace asserts by value errors

* Add suggestion by @ducviet00

* Add model to doc tests

* Simplify script

* Improve README

* a step ahead to fix

* Update pair_input_test

* Make all tokenizer tests pass - phew

* Make style

* Add LayoutLMv3 to CI job

* Fix auto mapping

* Fix CI job name

* Make all processor tests pass

* Make tests of LayoutLMv2 and LayoutXLM consistent

* Add copied from statements to fast tokenizer

* Add copied from statements to slow tokenizer

* Remove add_visual_labels attribute

* Fix tests

* Add link to notebooks

* Improve docs of LayoutLMv3Processor

* Fix reference to section

Co-authored-by: SaulLu <lucilesaul.com@gmail.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-05-24 09:53:45 +02:00
..
test_module Fix from_pretrained with default base_model_prefix (#15814) 2022-02-24 11:43:51 +01:00
tf_ops Check TF ops for ONNX compliance (#10025) 2021-02-15 07:55:10 -05:00
check_config_docstrings.py Add a check on config classes docstring checkpoints (#17012) 2022-04-30 10:40:46 +02:00
check_copies.py Black preview (#17217) 2022-05-12 16:25:55 -04:00
check_dummies.py Fix dummy creation script (#17304) 2022-05-17 12:56:24 -04:00
check_inits.py Add MLFLOW_FLATTEN_PARAMS support in MLflowCallback (#17148) 2022-05-10 14:29:18 -04:00
check_repo.py Black preview (#17217) 2022-05-12 16:25:55 -04:00
check_table.py Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
check_tf_ops.py Check TF ops for ONNX compliance (#10025) 2021-02-15 07:55:10 -05:00
custom_init_isort.py Fix custom init sorting script (#16864) 2022-04-20 17:05:39 -04:00
documentation_tests.txt Add LayoutLMv3 (#17060) 2022-05-24 09:53:45 +02:00
download_glue_data.py Raise exceptions instead of asserts (#13907) 2021-10-07 12:44:23 +05:30
get_modified_files.py Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
notification_service_doc_tests.py Black preview (#17217) 2022-05-12 16:25:55 -04:00
notification_service.py Fix ci_url might be None (#17332) 2022-05-18 21:49:08 +02:00
prepare_for_doc_test.py [DocTests Speech] Add doc tests for all speech models (#15031) 2022-01-27 14:29:31 +01:00
print_env_pt.py [CI] move env print to util, add pt, nccl versions (#14607) 2021-12-03 08:18:36 -05:00
release.py Rename master to main for notebooks links and leftovers (#16397) 2022-03-25 09:12:23 -04:00
sort_auto_mappings.py Automatically sort auto mappings (#17250) 2022-05-16 13:24:20 -04:00
tests_fetcher.py Update self-push workflow (#17177) 2022-05-13 16:28:00 +02:00
update_metadata.py Replace commit sha by commit url for update jobs (#14852) 2021-12-21 11:17:11 -05:00