transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Michael Benayoun	a6d62aaba0	GPT-Neo ONNX export (#12911 ) GPT-Neo ONNX export and task / feature refactoring Authored-by: Michael Benayoun <michael@huggingface.co>	2021-08-05 10:12:13 +02:00
Sasha Luccioni	8aa01d2a6d	Create perplexity.rst (#13004 ) Updating the import for load_dataset	2021-08-05 02:56:13 -04:00
NielsRogge	83e5a10603	Add BEiT (#12994 ) * First pass * Make conversion script work * Improve conversion script * Fix bug, conversion script working * Improve conversion script, implement BEiTFeatureExtractor * Make conversion script work based on URL * Improve conversion script * Add tests, add documentation * Fix bug in conversion script * Fix another bug * Add support for converting masked image modeling model * Add support for converting masked image modeling * Fix bug * Add print statement for debugging * Fix another bug * Make conversion script finally work for masked image modeling models * Move id2label for datasets to JSON files on the hub * Make sure id's are read in as integers * Add integration tests * Make style & quality * Fix test, add BEiT to README * Apply suggestions from @sgugger's review * Apply suggestions from code review * Make quality * Replace nielsr by microsoft in tests, add docs * Rename BEiT to Beit * Minor fix * Fix docs of BeitForMaskedImageModeling Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-08-04 18:29:23 +02:00
Lysandre Debut	0dd1152c18	Skip ProphetNet test (#12462 )	2021-08-04 18:24:54 +02:00
Arman Cohan	f82653874b	create tensors on device (#12846 )	2021-08-04 17:58:30 +02:00
Patrick von Platen	fbf468b057	[Flax] Correct flax docs (#12782 ) * fix_torch_device_generate_test * remove @ * fix flax docs * correct more docs in flax * another correction * fix flax docs * Apply suggestions from code review	2021-08-04 16:31:23 +02:00
Patrick von Platen	a317e6c3be	[Flax] Correctly Add MT5 (#12988 ) * finish PR * finish mt5 * push * up * Update tests/test_modeling_flax_mt5.py Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Suraj Patil <surajp815@gmail.com>	2021-08-04 16:03:13 +02:00
Patrick von Platen	da9754a3a0	[Flax] Align jax flax device name (#12987 ) * [Flax] Align device name in docs * make style * fix import error	2021-08-04 16:00:09 +02:00
Aktsvigun	07df5578d9	pad_to_multiple_of added to DataCollatorForWholeWordMask (#12999 ) * pad_to_multiple_of added to DataCollatorForWholeWordMask * pad_to_multiple_of added to DataCollatorForWholeWordMask Co-authored-by: Цвигун Аким Олегович <AOTsvigun@sberbank.ru>	2021-08-04 15:49:21 +02:00
Lysandre Debut	3f44a66cb6	Return raw outputs in TextClassificationPipeline (#8328 ) * Return raw outputs in TextClassificationPipeline * Style * Support for problem type * Update src/transformers/pipelines/text_classification.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply Nicolas' comments Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-08-04 08:42:47 -04:00
Sylvain Gugger	d4c834d2e0	Fix from_pretrained with corrupted state_dict (#12939 ) * Fix from_pretrained with corrupted state_dict * Adapt test * Use better checkpoint * Style * Clean up	2021-08-04 11:48:39 +02:00
NielsRogge	a28da4c490	Replace nielsr by google namespace in tests (#12453 )	2021-08-04 03:29:34 -04:00
Michal Szutenberg	f064e0a43d	Cast logits to fp32 at the end of TF_T5 (#12332 ) This change enables tf.keras.mixed_precision with bf16	2021-08-03 20:02:59 +01:00
Philip May	b7439675b8	fix `Trainer.train(resume_from_checkpoint=False)` is causing an exception (#12981 ) * fix #12970 * Update tests/test_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/test_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/test_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove unnecessary issue link * fix test formatting Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-08-03 10:10:33 +02:00
Sylvain Gugger	790f1c9545	Fix template for inputs docstrings (#12976 )	2021-08-03 08:28:25 +02:00
Chungman Lee	75b8990d90	fix typo in example/text-classification README (#12974 ) * fix typo in example/text-classification README * add space to align the table	2021-08-02 12:58:43 +02:00
Sylvain Gugger	c1a65385a1	Place BigBirdTokenizer in sentencepiece-only objects (#12975 )	2021-08-02 08:26:38 +02:00
Tadej Svetina	b5995badc9	Fix typo in example of DPRReader (#12954 )	2021-08-02 08:08:57 +02:00
Alex Hedges	a4340d3b85	Set tb_writer to None in TensorBoardCallback.on_train_end() (#12963 )	2021-08-01 08:35:47 +02:00
Stefan Schweter	3d4b3bc3fd	examples: use correct way to get vocab size in flax lm readme (#12947 )	2021-07-30 21:57:53 +05:30
Sylvain Gugger	23d6761f30	Fix division by zero in NotebookProgressPar (#12953 )	2021-07-30 09:31:29 -04:00
Kevin Canwen Xu	8ff619d95e	Add multilingual documentation support (#12952 ) * Add multilingual documentation support * Add multilingual documentation support * make style * make style * revert	2021-07-30 20:56:14 +08:00
wulu473	fe6ff4a920	Add substep callbacks (#12951 ) Co-authored-by: Lukas Wutschitz <lukas.wutschitz@microsoft.com>	2021-07-30 08:20:38 -04:00
harshithapv	f84226b7a1	Log Azure ML metrics only for rank 0 (#12766 ) * minor change to log azureml only for rank 0 * fix typo	2021-07-30 15:11:31 +08:00
21jun	5c673efad7	fix typo in gradient_checkpointing arg (#12855 ) help for `ModelArguments.gradient_checkpointing` should be "If True, use gradient checkpointing to save memory at the expense of slower backward pass." not "Whether to freeze the feature extractor layers of the model." (which is duplicated from `freeze_feature_extractor` arg)	2021-07-30 15:06:33 +08:00
Kevin Canwen Xu	fd0255b41d	Add CpmTokenizerFast (#12938 ) * Add CpmTokenizerFast * Fix isort * Overwrite _batch_encode_plus	2021-07-30 03:05:16 +08:00
Nicolas Patry	e2d22eef14	Moving feature-extraction pipeline to new testing scheme (#12843 ) * Update feature extraction pipelilne. * Leaving 1 small model for actual values check. * Fixes tests - Better support for tokenizer with no pad token - Increasing PegasusModelTesterConfig for pipelines - Test of feature extraction are more permissive + don't test Multimodel models + encoder-decoder. * Fixing model loading with incorrect shape (+ model with HEAD). * Update tests/test_pipelines_common.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Revert modeling_utils modification. * Some corrections. * Update tests/test_pipelines_common.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/test_pipelines_feature_extraction.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Syntax. * Fixing text-classification tests. * Don't modify this file. Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-07-29 19:35:55 +02:00
Funtowicz Morgan	640421c0ec	ONNX v2 raises an Exception when using PyTorch < 1.8.0 (#12933 ) * Raise an issue if the pytorch version is < 1.8.0 * Attempt to add a test to ensure it correctly raises. * Missing docstring. * Second attempt, patch with string absolute import. * Let's do the call before checking it was called ... * use the correct function ... 🤦 * Raise ImportError and AssertionError respectively when unable to find torch and torch version is not sufficient. * Correct path mock patching * relax constraint for torch_onnx_dict_inputs to ge instead of eq. * Style. * Split each version requirements for torch. * Let's compare version directly. * Import torch_version after checking pytorch is installed. * @require_torch	2021-07-29 18:02:29 +02:00
Will Frey	9160d81c98	Fix docstring typo in tokenization_auto.py (#12891 ) Change `PreTrainedConfig` -> `PretrainedConfig` in the docstring for `AutoTokenizer.from_pretrained(...)`.	2021-07-29 02:19:34 +08:00
Will Frey	0d00c08da0	Fix typo in tokenization_auto.py (#12896 ) Fix `config.decoder.__class` -> `config.decoder.__class__`	2021-07-29 02:17:57 +08:00
Will Frey	c3287ebd31	Update typing in generation_logits_process.py (#12900 ) Change `torch.Tensor` -> `torch.FloatTensor` in `TemperatureLogitsWarper` to be consistent with the `LogitsWarper` ABC signature annotation.	2021-07-29 02:17:20 +08:00
Will Frey	df55c2b9b1	Update typing in generation_logits_process.py (#12901 ) While `Iterable[Iterable[int]]` is a nicer annotation (it's covariant!), the defensive statements parsing out `bad_words_ids` in `__init__(...)` force the caller to pass in `List[List[int]]`. I've changed the annotation to make that clear.	2021-07-29 02:16:34 +08:00
chutaklee	c164064eef	Fix distiller.py (#12910 ) * fix distiller * fix style	2021-07-29 02:11:38 +08:00
Will Frey	1da782cb28	Add missing classmethod decorators (#12927 ) `_BaseAutoModelClass` was missing `classmethod` decorators on the `from_config(...)` and `from_pretrained(...)` methods.	2021-07-29 01:01:38 +08:00
Will Frey	bf78f523aa	Fix StoppingCriteria ABC signature (#12918 ) Change `score` -> `scores` because the argument is not positional-only, so you need consistently named parameters for the subclasses. The subclasses appear to favor `scores` over `score`.	2021-07-29 00:47:15 +08:00
Sylvain Gugger	63f2b9ab33	Print defaults when using --help for scripts (#12930 )	2021-07-28 11:37:20 -04:00
Sylvain Gugger	3ec851dc5e	Fix QA examples for roberta tokenizer (#12928 )	2021-07-28 09:47:49 -04:00
Sylvain Gugger	fd85734e0e	Add option to set max_len in run_ner (#12929 )	2021-07-28 09:38:12 -04:00
Buddhi Chathuranga Senarathna	1486fb8108	Fix typo in the example of MobileBertForPreTraining (#12919 )	2021-07-28 19:45:30 +08:00
Elysium1436	f3d0866ed9	Correct validation_split_percentage argument from int (ex:5) to float (0.05) (#12897 ) * Fixed train_test_split test_size argument * `Seq2SeqTrainer` set max_length and num_beams only when non None (#12899) * set max_length and num_beams only when non None * fix instance variables * fix code style * [FLAX] Minor fixes in CLM example (#12914) * readme: fix retrieval of vocab size for flax clm example * examples: fix flax clm example when using training/evaluation files * Fix module path for symbolic_trace example Co-authored-by: cchen-dialpad <47165889+cchen-dialpad@users.noreply.github.com> Co-authored-by: Stefan Schweter <stefan@schweter.it> Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>	2021-07-27 21:01:40 -04:00
Sylvain Gugger	68a441fa4c	Fix module path for symbolic_trace example	2021-07-27 13:47:22 -04:00
Stefan Schweter	d3c3e722d6	[FLAX] Minor fixes in CLM example (#12914 ) * readme: fix retrieval of vocab size for flax clm example * examples: fix flax clm example when using training/evaluation files	2021-07-27 19:48:04 +05:30
cchen-dialpad	12e02e339f	`Seq2SeqTrainer` set max_length and num_beams only when non None (#12899 ) * set max_length and num_beams only when non None * fix instance variables * fix code style	2021-07-27 08:37:46 -04:00
Sylvain Gugger	ba15fe7995	Fix push_to_hub for TPUs (#12895 )	2021-07-26 17:10:34 -04:00
Sylvain Gugger	b3f95dceca	Merge remote-tracking branch 'origin/master'	2021-07-26 10:27:25 -04:00
Sylvain Gugger	a492aec82d	Update doc	2021-07-26 10:27:14 -04:00
Nicolas Patry	a3bd763732	Better heuristic for token-classification pipeline. (#12611 ) * Better heuristic for token-classification pipeline. Relooking at the problem makes thing actually much simpler, when we look at ids from a tokenizer, we have no way in general to recover if some substring is part of a word or not. However, within the pipeline, with offsets we still have access to the original string, so we can simply look if previous character (if it exists) of a token, is actually a space. This will obviously be wrong for tokenizers that contain spaces within tokens, tokenizers where offsets include spaces too (Don't think there are a lot). This heuristic hopefully is fully bc and still can handle non-word based tokenizers. * Updating test with real values. * We still need the older "correct" heuristic to prevent fusing punctuation. * Adding a real warning when important.	2021-07-26 16:21:26 +02:00
Matt	569f61a760	Add TF multiple choice example (#12865 ) * Add new multiple-choice example, remove old one	2021-07-26 15:15:51 +01:00
Sylvain Gugger	4f19881f88	Fix documentation of BigBird tokenizer (#12889 )	2021-07-26 10:11:25 -04:00
Sylvain Gugger	303989de0e	Add accelerate to examples requirements (#12888 )	2021-07-26 09:57:34 -04:00

1 2 3 4 5 ...

7695 Commits