transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-17 03:28:22 +06:00

Author	SHA1	Message	Date
Stas Bekman	86b40073e9	[doc] post-porting (#14890 ) found a few oddities: 1. https://huggingface.co/docs/transformers/main_classes/logging#transformers.utils.logging.enable_explicit_format has a :: - this PR fixes it 2. this looks borked too: https://huggingface.co/docs/transformers/main_classes/logging#transformers.utils.logging.set_verbosity has a < but I'm not sure where this one is coming from	2021-12-23 10:19:34 -08:00
Anton Lozhkov	ee55ea692b	Update diarization and WavLM tolerances (#14902 )	2021-12-23 19:53:56 +03:00
Patrick von Platen	ef47d4f848	[AutoTokenizer] Fix incorrect from pretrained (#14900 )	2021-12-23 17:22:33 +01:00
Yih-Dar	8f2cc1c3ab	Add TFCLIPModel (#13967 ) * Start the work for TFCLIPModel * Convert to TF code (TODO: loss + doc) * Clean up * Fix pooled_output for TFCLIPTextTransformer - using tf.gather_nd * assert -> raise error * Expose TFCLIPModel * Deal with dummy_inputs * Add tests * Fix all tests. TODO: manual check weight loading + add more comments * Fix pt tf equivalence test * fixes * update TFCLIPVisionEmbeddings's Conv2D * Fix loss + overwrite test_pt_tf_model_equivalence from common * Add a comment about the change about MainLayer in test_keras_save_load * Set return_loss=True in TFCLIPModelTester + make tests pass * overwrite test_pt_tf_model_equivalence from tf common * fix base_model_prefix * Fix examples * remove unused * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply review suggestions * change self.pre_layrnorm to self.pre_layernorm * apply more review suggestions * return attention probs before dropout (to align with PT) * fix weight init * fix * build doc * fix missing doc * fix for test Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-12-23 11:19:44 -05:00
Yang Dong	2d30443cd3	Set `run_name` in MLflowCallback (#14894 ) * Set run_name in MLflowCallback * Update the docs for `run_name` argument	2021-12-23 10:53:33 -05:00
Leandro von Werra	1d651868d6	add custom stopping criteria to human eval script (#14897 )	2021-12-23 14:59:11 +01:00
lewtun	6b655cc63f	Add ONNX support for MarianMT models (#14586 ) * First commit to add MarianMT to ONNX * Now MarianModel.forward() automatically generates decoder_input_ids, like BartModel.forward() * Adjusted MarianOnnxConfig.inputs and outputs to work with seq2seq-lm feature * Style fix * Added support for other features for already supported models * Partial support for causal and seq2seq models * Partial support for causal and seq2seq models * Add default task for MarianMT ONNX * Remove automatic creation of decoder_input_ids * Extend inputs and outputs for MarianMT ONNX config * Add MarianMT to ONNX unit tests * Refactor * OnnxSeq2SeqConfigWithPast to support seq2seq models * Parameterized the onnx tests * Restored run_mlm.py * Restored run_mlm.py * [WIP] BART update * BART and MBART * Add past_key_values and fix dummy decoder inputs Using a sequence length of 1 in generate_dummy_outputs() produces large discrepancies, presumably due to some hidden optimisations. * Refactor MarianOnnxConfig to remove custom past_key_values logic * Fix quality * Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)" This reverts commit `0f4e39c559`. * is_torch_available test to avoid failing imports * sorting parameterize parameters to solve ERROR gw0 gw1 * tests fix * tests fix * GPT2 with past fix * Fixed stateful class attribute change that was breaking things when converting multiple models sequentially * Removed onnx file * Refactor Marian export to account for base changes * Fix copies * Implemented suggestions * Extend support for causal LM * Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)" This reverts commit `0f4e39c559`. * is_torch_available test to avoid failing imports * sorting parameterize parameters to solve ERROR gw0 gw1 * tests fix * tests fix * GPT2 with past fix * Fixed stateful class attribute change that was breaking things when converting multiple models sequentially * Removed onnx file * Implemented suggestions * Fixed __init__ to resolve conflict with master * Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)" This reverts commit `0f4e39c559`. * is_torch_available test to avoid failing imports * sorting parameterize parameters to solve ERROR gw0 gw1 * tests fix * tests fix * GPT2 with past fix * Fixed stateful class attribute change that was breaking things when converting multiple models sequentially * Removed onnx file * Implemented suggestions * Fixed __init__ to resolve conflict with master * Remove commented import * Remove ONNX model * Remove redundant class method * Tidy up imports * Fix quality * Refactor dummy input function * Add copied from statements to Marian config functions * Remove false copied from comments * Fix copy from comment Co-authored-by: Massimiliano Bruni <massimiliano.bruni@hcl.com> Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>	2021-12-23 13:35:56 +01:00
Henrik Holm	6a7b9da2ae	Add 'with torch.no_grad()' to integration test forward pass (#14808 )	2021-12-23 04:23:39 -05:00
Alex Hedges	d8c09c6541	Fix AttributeError from PreTrainedTokenizerFast.decoder (#14691 )	2021-12-23 04:19:25 -05:00
Yih-Dar	4210579522	Fix doc examples: ... takes no keyword arguments (#14701 ) * Fix doc examples: ... takes no keyword arguments * fix copies Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-12-23 04:07:21 -05:00
lewtun	355dc0ce67	Fix installation instructions for BART ONNX example (#14885 )	2021-12-23 04:05:32 -05:00
Sylvain Gugger	207594be81	Convert rst files (#14888 ) * Convert all tutorials and guides * Convert all remaining rst to mdx * Track and fix bad links	2021-12-22 16:14:35 -05:00
Matt	b0c7d2ec58	Keras metric callback (#14867 ) * Working on splitting out labels * First working version * Fixed concatenation of outputs and labels * val_dataset -> eval_dataset * Only pass input arrays in tokenizer.model_input_names * Only pass input arrays in tokenizer.model_input_names * Only remove unexpected keys when predict_with_generate is True * Adding proper docstring * Adding example to docstring * Add a proper ROUGE metric example * Add a proper ROUGE metric example * Add version checking * Update src/transformers/keras_callbacks.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/keras_callbacks.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/keras_callbacks.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/keras_callbacks.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Remove requirement for tokenizer with predict_with_generate Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-12-22 20:35:39 +00:00
Patrick von Platen	fa39ff9fc4	Docs for v4.16.0dev0	2021-12-22 20:39:44 +01:00
Patrick von Platen	05fa1a7ac1	Release: v4.15.0	2021-12-22 18:43:15 +01:00
Sylvain Gugger	87a033d9fa	Properly indent return block (#14887 )	2021-12-22 12:28:45 -05:00
Michael Benayoun	13504dcbea	Onnx enable tasks for supported models (part 2) (#14700 ) * Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)" This reverts commit `0f4e39c559`. * is_torch_available test to avoid failing imports * sorting parameterize parameters to solve ERROR gw0 gw1 * tests fix * tests fix * GPT2 with past fix * Fixed stateful class attribute change that was breaking things when converting multiple models sequentially * Removed onnx file * Implemented suggestions * Fixed __init__ to resolve conflict with master * Remove commented import	2021-12-22 14:43:11 +01:00
Mario Šaško	1045a36c1f	Fix pytorch image classification example (#14883 ) * Update example * Remove skip in tests	2021-12-22 14:42:19 +01:00
NielsRogge	7df4b90c76	Fix Perceiver docs (#14879 )	2021-12-22 14:18:03 +01:00
Sylvain Gugger	e37bc579fc	Fix typo in error message	2021-12-22 08:19:36 -05:00
charon____	17efc806b4	IterableDatasetShard should use per device batch size instead of real batch size (#14714 )	2021-12-22 07:52:07 -05:00
guillaume-be	2a56edb321	Updated deberta attention (#14625 ) * Removed unused p2p attention handling * Updated DeBERTa configuration * Updated TF DeBERTa attention * Rolled back accidental comment deletion Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-12-22 07:36:08 -05:00
Ryokan RI	824fd44fc3	Feature/fix slow test in mluke (#14749 ) * make MLukeTokenizerTest fast * make LukeTokenizerTest fast * add entry to _toctree.yaml	2021-12-22 06:35:59 -05:00
SaulLu	c94c1b8967	update the arguments `add_prefix_space` and `trim_offsets` in `backend_tokenizer.post_processor` of `RobertaTokenizerFast` (#14752 ) * add tests * change post-processor, pre-tokenizer and decoder (can't update decoder) * update test (remove decoder which doesn't depend on trim and add_prefix) * just update the post_processor * fix change * `trim_offsets` has no influence on `pre_tokenizer` * remove a test that need some input from the `tokenizers` lib maintainers * format * add new test offsets roberta * polish comments	2021-12-22 10:51:55 +01:00
Lysandre Debut	ec3567fe20	Convert model files from rst to mdx (#14865 ) * First pass * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-12-22 03:27:30 -05:00
Sylvain Gugger	d0422de563	Fix doc mistakes (#14874 ) * Remove double returns * Last fixes * Quality * Last fix for Lxmert	2021-12-21 18:54:41 -05:00
Sylvain Gugger	e846a56ca4	Fix `FlaxMarianMTModel` return block. (#14873 ) * Fixes in marian doc * Another time * Add return block in FlaxMarianMTModel	2021-12-21 17:57:37 -05:00
Sylvain Gugger	a6b7b47a39	Fixes in marian doc (#14872 ) * Fixes in marian doc * Another time	2021-12-21 17:17:02 -05:00
Mishig Davaadorj	eec9c8bbd7	Fix FLAX_MULTIPLE_CHOICE_SAMPLE typo (#14871 )	2021-12-21 16:54:10 -05:00
Sylvain Gugger	e51c7b5872	Skip failing test	2021-12-21 15:15:17 -05:00
Sylvain Gugger	27b3031de2	Mass conversion of documentation from rst to Markdown (#14866 ) * Convert docstrings of all configurations and tokenizers * Processors and fixes * Last modeling files and fixes to models * Pipeline modules * Utils files * Data submodule * All the other files * Style * Missing examples * Style again * Fix copies * Say bye bye to rst docstrings forever	2021-12-21 15:06:33 -05:00
Stas Bekman	185876392c	[doc porting] several docs (#14858 ) * [doc porting] 2 docs * [doc porting] 2 docs * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/main_classes/deepspeed.mdx * cleanup Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-12-21 09:55:25 -08:00
Stas Bekman	033c3ed95a	[examples/summarization] deal with None in data records (#14816 ) * [examples/summarization] deal with None in data records * rewrite to use a simpler (slower) variant	2021-12-21 09:17:28 -08:00
Sylvain Gugger	c075fb7855	Replace commit sha by commit url for update jobs (#14852 ) * Replace commit sha by commit url for update jobs * Typo * Update .github/workflows/build_documentation.yml Co-authored-by: Julien Chaumond <julien@huggingface.co> * Apply review comments Co-authored-by: Julien Chaumond <julien@huggingface.co>	2021-12-21 11:17:11 -05:00
Leandro von Werra	5722d05831	Add custom `stopping_criteria` and `logits_processor` to `generate` (#14779 ) * add custom `stopping_criteria` and `logits_processor` to `generate` * add tests for custom `stopping_criteria` and `logits_processor` * fix typo in RAG * address reviewer comments * improve custom logits processor/stopping criteria error message * fix types in merge function signature * change default for custom list from `None` to empty list * fix rag generate * add string split suggestion Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-12-21 16:47:41 +01:00
Zed	0062058399	Fix the value error typo of AdamW's betas' valid values checking (#14780 ) * Fix the value error typo of AdamW's betas value check * error fixed	2021-12-21 09:44:09 -05:00
Patrick von Platen	7ae6f07004	[ASR example] Improve example + add more examples (#14848 ) * up * load up * up	2021-12-21 13:12:22 +01:00
Sylvain Gugger	97ec17f73b	Only create the model card on process 0 (#14857 )	2021-12-21 06:34:47 -05:00
Patrick von Platen	b513ec8bbd	[Bart] better error message (#14854 )	2021-12-21 11:57:42 +01:00
Sylvain Gugger	7af80f6618	Convert docstrings of modeling files (#14850 ) * Convert file_utils docstrings to Markdown * Test on BERT * Return block indent * Temporarily disable doc styler * Remove from quality checks as well * Remove doc styler mess * Remove check from circleCI * Fix typo * Convert file_utils docstrings to Markdown * Test on BERT * Return block indent * Temporarily disable doc styler * Remove from quality checks as well * Remove doc styler mess * Remove check from circleCI * Fix typo * Let's go on all other model files * Add templates too * Styling and quality	2021-12-21 05:37:32 -05:00
Sylvain Gugger	2a33734606	Make the onnx submodule init lazy (#14855 ) * Use lazy init for onnx submodule * Remove debug statements	2021-12-21 03:11:25 -05:00
Stas Bekman	b6ec956976	[logging] implement warning_advice / TRANSFORMERS_NO_ADVISORY_WARNINGS (#14669 ) * [logging] implement warning_advice / TRANSFORMERS_NO_ADVISORY_WARNINGS * reword	2021-12-20 20:48:38 -08:00
Stas Bekman	c1125dc2ba	[doc] typo (#14849 ) fix small typo	2021-12-20 12:20:21 -05:00
Sylvain Gugger	33f36c869f	Add a main_input_name attribute to all models (#14803 ) * Add a main_input_name attribute to all models * Fix tests * Wtf Vs Code? * Update src/transformers/models/imagegpt/modeling_imagegpt.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Style * Fix copies Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-12-20 11:19:08 -05:00
Henrik Holm	0940e9b242	Add 'with torch.no_grad()' to integration test forward pass (#14820 )	2021-12-20 09:28:17 -05:00
Henrik Holm	b37cf7dee4	Add 'with torch.no_grad()' to integration test forward pass (#14821 )	2021-12-20 09:25:34 -05:00
Patrick von Platen	952a77b05d	[Perceiver] Skip multi-gpu tests for now (#14813 ) * [Perceiver] Skip multi-gpu tests for now * Update tests/test_modeling_perceiver.py * up * up	2021-12-20 15:22:50 +01:00
Derek Chia	8a818c26cb	Fix dead link to benchmarks.ipynb (#14842 ) Notebook has been updated here https://github.com/huggingface/notebooks/tree/master/examples/benchmark.ipynb	2021-12-20 09:08:05 -05:00
Kamal Raj	1b0ca7d270	Update CONTRIBUTING.md (#14835 ) fix cmd typo	2021-12-20 08:42:03 -05:00
Chang Lan	1531b31978	Add an argument to set bucket_cap_mb for PyTorch DDP (#14756 ) * [trainer] Set bucket_cap_mb for DDP from arguments * Put find_unused_parameters into kwargs	2021-12-20 08:41:40 -05:00

... 4 5 6 7 8 ...

8821 Commits