transformers/docs/source/en
NielsRogge 31ee80d556
Add LayoutLMv3 (#17060)
* Make forward pass work

* More improvements

* Remove unused imports

* Remove timm dependency

* Improve loss calculation of token classifier

* Fix most tests

* Add docs

* Add model integration test

* Make all tests pass

* Add LayoutLMv3FeatureExtractor

* Improve integration test + make fixup

* Add example script

* Fix style

* Add LayoutLMv3Processor

* Fix style

* Add option to add visual labels

* Make more tokenizer tests pass

* Fix more tests

* Make more tests pass

* Fix bug and improve docs

* Fix import of processors

* Improve docstrings

* Fix toctree and improve docs

* Fix auto tokenizer

* Move tests to model folder

* Move tests to model folder

* change default behavior add_prefix_space

* add prefix space for fast

* add_prefix_spcae set to True for Fast

* no space before `unique_no_split` token

* add test to hightligh special treatment of added tokens

* fix `test_batch_encode_dynamic_overflowing` by building a long enough example

* fix `test_full_tokenizer` with add_prefix_token

* Fix tokenizer integration test

* Make the code more readable

* Add tests for LayoutLMv3Processor

* Fix style

* Add model to README and update init

* Apply suggestions from code review

* Replace asserts by value errors

* Add suggestion by @ducviet00

* Add model to doc tests

* Simplify script

* Improve README

* a step ahead to fix

* Update pair_input_test

* Make all tokenizer tests pass - phew

* Make style

* Add LayoutLMv3 to CI job

* Fix auto mapping

* Fix CI job name

* Make all processor tests pass

* Make tests of LayoutLMv2 and LayoutXLM consistent

* Add copied from statements to fast tokenizer

* Add copied from statements to slow tokenizer

* Remove add_visual_labels attribute

* Fix tests

* Add link to notebooks

* Improve docs of LayoutLMv3Processor

* Fix reference to section

Co-authored-by: SaulLu <lucilesaul.com@gmail.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-05-24 09:53:45 +02:00
..
internal [WIP] Enable reproducibility for distributed trainings (#16907) 2022-05-11 09:37:13 -04:00
main_classes Use Accelerate in from_pretrained for big model inference (#17341) 2022-05-23 14:32:21 -04:00
model_doc Add LayoutLMv3 (#17060) 2022-05-24 09:53:45 +02:00
tasks Update audio examples with MInDS-14 (#16633) 2022-04-08 15:55:42 -05:00
_config.py Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
_toctree.yml Add LayoutLMv3 (#17060) 2022-05-24 09:53:45 +02:00
accelerate.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
add_new_model.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
add_new_pipeline.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
autoclass_tutorial.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
benchmarks.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
bertology.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
big_models.mdx Make Trainer compatible with sharded checkpoints (#17053) 2022-05-03 09:55:10 -04:00
community.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
contributing.md Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
converting_tensorflow_models.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
create_a_model.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
custom_models.mdx Update custom_models.mdx (#16964) 2022-04-27 16:46:55 +02:00
debugging.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
fast_tokenizers.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
glossary.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
index.mdx Add LayoutLMv3 (#17060) 2022-05-24 09:53:45 +02:00
installation.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
migration.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
model_sharing.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
model_summary.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
multilingual.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
notebooks.md Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
pad_truncation.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
perf_hardware.mdx [WIP] [doc] performance/scalability revamp (#15723) 2022-05-16 13:36:41 +02:00
perf_train_gpu_many.mdx [WIP] [doc] performance/scalability revamp (#15723) 2022-05-16 13:36:41 +02:00
perf_train_gpu_one.mdx [WIP] [doc] performance/scalability revamp (#15723) 2022-05-16 13:36:41 +02:00
performance.mdx [WIP] [doc] performance/scalability revamp (#15723) 2022-05-16 13:36:41 +02:00
perplexity.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
philosophy.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
pipeline_tutorial.mdx docs(transformers): fix typo (#17263) 2022-05-16 17:04:30 -04:00
pr_checks.mdx Add a check on config classes docstring checkpoints (#17012) 2022-04-30 10:40:46 +02:00
preprocessing.mdx Fixing the output of code examples in the preprocessing chapter (#17162) 2022-05-10 12:16:28 -04:00
quicktour.mdx Fix doc test quicktour dataset (#16929) 2022-04-25 16:26:59 +02:00
run_scripts.mdx Fix all docs for accelerate install directions (#17145) 2022-05-09 15:45:18 -04:00
sagemaker.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
serialization.mdx Add CvT (#17299) 2022-05-18 17:47:18 +02:00
task_summary.mdx [Doctests] Correct task summary (#16644) 2022-04-11 14:59:35 +02:00
testing.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
tokenizer_summary.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
training.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
troubleshooting.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00