transformers/utils
NielsRogge 4ef0abb738
Add TAPEX (#16473)
* Add TapexTokenizer

* Improve docstrings and provide option to provide answer

* Remove option for pretokenized inputs

* Add TAPEX to README

* Fix copies

* Remove option for pretokenized inputs

* Initial commit: add tapex fine-tuning examples on both table-based question answering and table-based fact verification.

* - Draft a README file for running the script and introducing some background.
- Remove unused code lines in tabfact script.
- Disable the deafult `pad_to_max_length` option which is memory-consuming.

* * Support `as_target_tokenizer` function for TapexTokenizer.
* Fix the do_lower_case behaviour of TapexTokenizer.
* Add unit tests for target scenarios and cased/uncased scenarios for both source and target.

* * Replace the label BartTokenizer with TapexTokenizer's as_target_tokenizer function.
* Fix typos in tapex example README.

* * fix the evaluation script - remove the property `task_name`

* * Make the label space more clear for tabfact tasks

* * Using a new fine-tuning script for tapex-base on tabfact.

* * Remove the lowercase code outside the tokenizer - we use the tokenizer to control whether do_lower_case
* Guarantee the hyper-parameter can be run without out-of-memory on 16GB card and report the new reproduced number on wikisql

* * Remove the default tokenizer_name option.
* Provide evaluation command.

* * Support for WikiTableQuestion dataset.

* Fix a typo in README.

* * Fix the datasets's key name in WikiTableQuestions

* Run make fixup and move test to folder

* Fix quality

* Apply suggestions from code review

* Apply suggestions from code review

Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Apply suggestions from code review

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply some more suggestions from code review

* Improve docstrings

* Overwrite failing test

* Improve comment in example scripts

* Fix rebase

* Add TAPEX to Auto mapping

* Add TAPEX to auto config mappings

* Put TAPEX higher than BART in auto mapping

* Add TAPEX to doc tests

Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
Co-authored-by: SivilTaram <qianlxc@outlook.com>
Co-authored-by: Niels Rogge <nielsrogge@nielss-mbp.home>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-04-08 10:57:51 +02:00
..
test_module Fix from_pretrained with default base_model_prefix (#15814) 2022-02-24 11:43:51 +01:00
tf_ops Check TF ops for ONNX compliance (#10025) 2021-02-15 07:55:10 -05:00
check_copies.py Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
check_dummies.py Reorganize file utils (#16264) 2022-03-23 10:26:33 -04:00
check_inits.py Ignore empty subfolders when identifying submodules (#15204) 2022-01-18 09:48:46 -05:00
check_repo.py Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
check_table.py Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
check_tf_ops.py Check TF ops for ONNX compliance (#10025) 2021-02-15 07:55:10 -05:00
custom_init_isort.py Sort init import (#10801) 2021-03-19 16:17:13 -04:00
documentation_tests.txt Add TAPEX (#16473) 2022-04-08 10:57:51 +02:00
download_glue_data.py Raise exceptions instead of asserts (#13907) 2021-10-07 12:44:23 +05:30
get_modified_files.py Updates the default branch from master to main (#16326) 2022-03-23 03:46:59 -04:00
notification_service_deprecated.py Fix the push run (#15807) 2022-02-24 19:30:17 +01:00
notification_service_doc_tests.py Add Slack notification support for doc tests (#16253) 2022-03-21 11:33:18 +01:00
notification_service.py Framework split model report (#15825) 2022-02-25 12:00:00 -05:00
prepare_for_doc_test.py [DocTests Speech] Add doc tests for all speech models (#15031) 2022-01-27 14:29:31 +01:00
print_env_pt.py [CI] move env print to util, add pt, nccl versions (#14607) 2021-12-03 08:18:36 -05:00
release.py Rename master to main for notebooks links and leftovers (#16397) 2022-03-25 09:12:23 -04:00
tests_fetcher.py Fix example test and test_fetcher for examples (#16478) 2022-03-29 12:21:19 -04:00
update_metadata.py Replace commit sha by commit url for update jobs (#14852) 2021-12-21 11:17:11 -05:00