transformers/docs/source/en
NielsRogge 836921fdeb
Add UDOP (#22940)
* First draft

* More improvements

* More improvements

* More fixes

* Fix copies

* More improvements

* More fixes

* More improvements

* Convert checkpoint

* More improvements, set up tests

* Fix more tests

* Add UdopModel

* More improvements

* Fix equivalence test

* More fixes

* Redesign model

* Extend conversion script

* Use real inputs for conversion script

* Add image processor

* Improve conversion script

* Add UdopTokenizer

* Add fast tokenizer

* Add converter

* Update README's

* Add processor

* Add fully fledged tokenizer

* Add fast tokenizer

* Use processor in conversion script

* Add tokenizer tests

* Fix one more test

* Fix more tests

* Fix tokenizer tests

* Enable fast tokenizer tests

* Fix more tests

* Fix additional_special_tokens of fast tokenizer

* Fix tokenizer tests

* Fix more tests

* Fix equivalence test

* Rename image to pixel_values

* Rename seg_data to bbox

* More renamings

* Remove vis_special_token

* More improvements

* Add docs

* Fix copied from

* Update slow tokenizer

* Update fast tokenizer design

* Make text input optional

* Add first draft of processor tests

* Fix more processor tests

* Fix decoder_start_token_id

* Fix test_initialization

* Add integration test

* More improvements

* Improve processor, add test

* Add more copied from

* Add more copied from

* Add more copied from

* Add more copied from

* Remove print statement

* Update README and auto mapping

* Delete files

* Delete another file

* Remove code

* Fix test

* Fix docs

* Remove asserts

* Add doc tests

* Include UDOP in exotic model tests

* Add expected tesseract decodings

* Add sentencepiece

* Use same design as T5

* Add UdopEncoderModel

* Add UdopEncoderModel to tests

* More fixes

* Fix fast tokenizer

* Fix one more test

* Remove parallelisable attribute

* Fix copies

* Remove legacy file

* Copy from T5Tokenizer

* Fix rebase

* More fixes, copy from T5

* More fixes

* Fix init

* Use ArthurZ/udop for tests

* Make all model tests pass

* Remove UdopForConditionalGeneration from auto mapping

* Fix more tests

* fixups

* more fixups

* fix the tokenizers

* remove un-necessary changes

* nits

* nits

* replace truncate_sequences_boxes with truncate_sequences for fix-copies

* nit current path

* add a test for input ids

* ids that we should get taken from c9f7a32f57

* nits converting

* nits

* apply ruff

* nits

* nits

* style

* fix slow order of addition

* fix udop fast range as well

* fixup

* nits

* Add docstrings

* Fix gradient checkpointing

* Update code examples

* Skip tests

* Update integration test

* Address comment

* Make fixup

* Remove extra ids from tokenizer

* Skip test

* Apply suggestions from code review

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update year

* Address comment

* Address more comments

* Address comments

* Add copied from

* Update CI

* Rename script

* Update model id

* Add AddedToken, skip tests

* Update CI

* Fix doc tests

* Do not use Tesseract for the doc tests

* Remove kwargs

* Add original inputs

* Update casting

* Fix doc test

* Update question

* Update question

* Use LayoutLMv3ImageProcessor

* Update organization

* Improve docs

* Update forward signature

* Make images optional

* Remove deprecated device argument

* Add comment, add add_prefix_space

* More improvements

* Remove kwargs

---------

Co-authored-by: ArthurZucker <arthur.zucker@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2024-03-04 18:49:02 +01:00
..
internal Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
main_classes Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
model_doc Add UDOP (#22940) 2024-03-04 18:49:02 +01:00
tasks Starcoder2 model - bis (#29215) 2024-02-28 01:24:34 +01:00
_config.py [Styling] stylify using ruff (#27144) 2023-11-16 17:43:19 +01:00
_redirects.yml Extended semantic segmentation to image segmentation (#27039) 2023-11-23 15:58:21 +00:00
_toctree.yml Add UDOP (#22940) 2024-03-04 18:49:02 +01:00
accelerate.md Fix typos (#25936) 2023-09-04 11:15:12 +01:00
add_new_model.md [Docs] Fix broken links and syntax issues (#28918) 2024-02-08 14:13:35 -08:00
add_new_pipeline.md Fix broken link on page (#28451) 2024-01-11 09:26:13 -08:00
add_tensorflow_model.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
attention.md [Docs] Fix broken links and syntax issues (#28918) 2024-02-08 14:13:35 -08:00
autoclass_tutorial.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
benchmarks.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
bertology.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
big_models.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
chat_templating.md Add chat support to text generation pipeline (#28945) 2024-02-16 16:41:01 +00:00
community.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
contributing.md Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
create_a_model.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
custom_models.md [Docs] Add language identifiers to fenced code blocks (#28955) 2024-02-12 10:48:31 -08:00
custom_tools.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
debugging.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
deepspeed.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
fast_tokenizers.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
fsdp.md [docs] Trainer docs (#28145) 2023-12-20 10:37:23 -08:00
generation_strategies.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
glossary.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
hf_quantizer.md [CI] Quantization workflow (#29046) 2024-02-28 10:09:25 -05:00
hpo_train.md Remove-auth-token (#27060) 2023-11-13 14:20:54 +01:00
index.md Add UDOP (#22940) 2024-03-04 18:49:02 +01:00
installation.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
llm_tutorial_optimization.md F.scaled_dot_product_attention support (#26572) 2023-12-09 05:38:14 +09:00
llm_tutorial.md Generate: All logits processors are documented and have examples (#27796) 2023-12-07 15:11:35 +00:00
model_memory_anatomy.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
model_sharing.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
model_summary.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
multilingual.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
notebooks.md Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
pad_truncation.md [Doc] Spanish translation of pad_truncation.md (#27890) 2023-12-08 10:32:18 -08:00
peft.md [Peft] modules_to_save support for peft integration (#27466) 2023-11-14 10:32:57 +01:00
perf_hardware.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
perf_infer_cpu.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
perf_infer_gpu_one.md Starcoder2 model - bis (#29215) 2024-02-28 01:24:34 +01:00
perf_torch_compile.md Fix rendering for torch.compile() docs (#25432) 2023-08-10 13:25:00 +02:00
perf_train_cpu_many.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
perf_train_cpu.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
perf_train_gpu_many.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
perf_train_gpu_one.md Fix torch.compile with fullgraph=True when attention_mask input is used (#29211) 2024-02-22 16:40:06 +01:00
perf_train_special.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
perf_train_tpu_tf.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
performance.md [docs] Update CPU/GPU inference docs (#26881) 2023-10-31 09:44:51 -07:00
perplexity.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
philosophy.md [docs] fixed links with 404 (#27327) 2023-11-06 19:45:03 +00:00
pipeline_tutorial.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
pipeline_webserver.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
pr_checks.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
preprocessing.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
quantization.md Add training version check for AQLM quantizer. (#29142) 2024-02-21 17:09:36 +01:00
quicktour.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
run_scripts.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
sagemaker.md [docs] fixed links with 404 (#27327) 2023-11-06 19:45:03 +00:00
serialization.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
task_summary.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
tasks_explained.md [docs] Spanish translation of tasks_explained.md (#29224) 2024-02-26 08:18:15 -08:00
testing.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
tf_xla.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
tflite.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
tokenizer_summary.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
torchscript.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
trainer.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
training.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
transformers_agents.md [doc] Always call it Agents for consistency (#25958) 2023-09-05 12:27:20 +01:00
troubleshooting.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00