transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Sanchit Gandhi	fde901877a	Freeze Feature Encoder in FlaxSpeechEncoderDecoder (#15997 ) * Freeze Feature Encoder in FlaxSpeechEncoderDecoder * add backprop test	2022-03-10 09:59:19 +01:00
Pavel Belevich	65f9653ed0	Fix warning message in ElectraForCausalLM (#16023 )	2022-03-09 17:27:15 -05:00
Suraj Patil	a69e185074	add doctests for bart like seq2seq models (#15987 ) * boom boom * enable doctest for few seq2seq models * add seq2seq models in documentation_tests.txt * fix docstring blenderbot * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fix seq classif doc sample * don't check loss for seq classif examples * +IGNORE_OUTPUT => +IGNORE_RESULT * fix _SEQ_CLASS_EXPECTED_OUTPUT_SHAPE * fix some docs * more fixes * last fix (hopefully) * fix big bird gen example * fix mbart gen example Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-09 20:30:38 +01:00
Sanchit Gandhi	b256f3518d	Add FlaxBartForCausalLM (#15995 ) * add causal lm * add CausalLM tests * Add FlaxBartForCausalLM * Add EncoderDecoder model tests * change docstring * make repo-consistency * suggested changes * remove jax ops * correction * rename pre-trained decoder model	2022-03-09 19:53:01 +01:00
lewtun	50dd314d93	Add ONNX export for ViT (#15658 ) * Add ONNX support for ViT * Refactor to use generic preprocessor * Add vision dep to tests * Extend ONNX slow tests to ViT * Add dummy image generator * Use model_type to determine modality * Add deprecation warnings for tokenizer argument * Add warning when overwriting the preprocessor * Add optional args to docstrings * Add minimum PyTorch version to OnnxConfig * Refactor OnnxConfig class variables from CONSTANT_NAME to snake_case * Add reasonable value for default atol Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-09 17:36:59 +01:00
Yih-Dar	b7fa1e3dee	Use tiny models for get_pretrained_model in TFEncoderDecoderModelTest (#15989 ) * Use tiny model for TFRembertEncoderDecoderModelTest.get_pretrained_model() Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-09 17:16:25 +01:00
Shotaro Ishihara	8feede229c	Fix broken code blocks in README.md (#15967 ) at transformers/examples/pytorch/contrastive-image-text	2022-03-09 17:07:52 +01:00
Francesco Saverio Zuppichini	1e8f37992f	done (#16012 )	2022-03-09 15:51:56 +01:00
Basile Van Hoorick	38bce1d4cf	Make `pos` optional to avoid crashing `PerceiverModel` operation (#15972 ) Updates `PerceiverAudioPreprocessor` `forward()` implementation to match most other preprocessors / postprocessors	2022-03-09 15:48:52 +01:00
Sylvain Gugger	cec89e1a0e	Simplify release utils (#15921 ) * Simplify release utils * Quality	2022-03-09 08:47:58 -05:00
Lysandre Debut	e493a3a5e2	Fix github actions comment (#16009 ) * Add issue number * Dev	2022-03-09 08:39:03 -05:00
Joao Gante	e7f34ccd4f	Swag example: Update doc format (#16014 )	2022-03-09 13:25:34 +00:00
Yih-Dar	3ea046995e	Removed an outdated check about hdf5_version (#16011 ) * removed an outdated check about hdf5_version Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-09 14:21:23 +01:00
Patrick von Platen	c1aaa43935	[Doctests] Move doctests to new GPU & Fix bugs (#15969 ) * test * up * up * Empty test commit * up * update tests * up * fix some vision models * correct * correct docs * Trigger notification * finalize * check * correct quicktour * Apply suggestions from code review * improve doctests * Trigger Build * next try * next try * and again * Output current clone information * Output current clone information * Correct path * add tf round again * revert to daily job Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2022-03-09 13:09:56 +01:00
Nicolas Patry	f4e4ad34cc	Add `ForInstanceSegmentation` models to `image-segmentation` pipelines (#15937 ) * Adding ForInstanceSegmentation to pipelines. * Last fix `category_id` renamed to `label_id`. * Can't be none no more. * No `is_thing_map` anymore.	2022-03-09 10:19:05 +01:00
David Hall	5b7dcc7342	Seed _get_train_sampler's generator with arg seed to improve reproducibility (#15961 ) * Seed get_train_sampler's generator with arg seed to improve reproducibility and make the world_size<=1 code path more similar to the others * move test file into trainer test explicitly * dumb typo * make style lint happy * per discussion, switch to data_seed * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-08 13:45:41 -05:00
Joao Gante	70203b5937	TF generate refactor - past without encoder outputs (#15944 ) * Remove packed past from generation_tf_utils * update models with the new past format * update template accordingly	2022-03-08 14:46:44 +00:00
Joao Gante	62d847602a	Update TF multiple choice example (#15868 )	2022-03-08 13:16:34 +00:00
Patrick von Platen	ab2f8d12a7	add hf hub to env version command (#15981 )	2022-03-08 14:03:03 +01:00
Yih-Dar	72983303c5	Fix TFEncoderDecoderModelTest - Pytorch device (#15979 ) * fix device Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-08 13:37:20 +01:00
Sylvain Gugger	f5a080dd10	Do a pull in case docs were updated during build (#15922 )	2022-03-08 07:19:41 -05:00
Yeb Havinga	91fb62d01c	Speedup training by using numpy instead of jnp for batch shuffling (#15963 ) Speedup training by using numpy instead of jnp for batch shuffling Co-authored-by: Yeb Havinga <y.t.havinga@mgrid.net>	2022-03-08 12:18:38 +01:00
Nicolas Patry	ea07064a5c	Returning outputs only when asked for for MaskFormer. (#15936 ) * Returning outputs only when asked for for MaskFormer. * Adding `output_auxiliary_logits` to the config.	2022-03-08 11:17:57 +01:00
NielsRogge	b19f3e69a0	[Tests] Fix ViTMAE integration test (#15949 ) * Fix test across both cpu and gpu * Fix typo	2022-03-08 10:49:44 +01:00
NielsRogge	9879a1d5f0	Fix LayoutLMv2 test (#15939 ) * Fix LayoutLMv2 test * Update black	2022-03-08 10:49:30 +01:00
Yih-Dar	8b9ae45549	Set scale_embedding to False in some TF tests (#15952 ) * set scale_embedding to False to avoid large (> 1e-5) output differences between PT/TF Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-07 22:14:33 +01:00
Steven Liu	38cc35069c	Update training scripts docs (#15931 ) * 📝 first draft * 🖍 apply feedback * 🖍 remove examples from toctree * 🗑 remove examples from docs/source	2022-03-07 13:29:14 -06:00
Sylvain Gugger	c87cfd653c	Better error message when inputs are empty	2022-03-07 13:29:16 -05:00
Francesco Saverio Zuppichini	e9fa7cd5d7	Make is_thing_map in Feature Extractor post_process_panoptic_segmentation defaults to all instances (#15954 ) * is_thing_map defaults to all instances * better naming * control flow * resolving conversations	2022-03-07 19:10:32 +01:00
Sanchit Gandhi	2596f95e84	Fix Embedding Module Bug in Flax Models (#15920 )	2022-03-07 18:17:45 +01:00
Sanchit Gandhi	1a62b25caf	Backprop Test for Freeze FlaxWav2Vec2 Feature Encoder (#15938 ) * Backprop Test for Freeze FlaxWav2Vec2 Feature Encoder * remove jnp.ndarray type suggestion * assert frozen grads are precisely zero	2022-03-07 18:10:15 +01:00
Konstantin Dobler	544fd9876b	Support modern list type hints in HfArgumentParser (#15951 ) * Support modern list type hint in HfArgumentParser * Fix formatting with black	2022-03-07 10:22:48 -05:00
Suraj Patil	60b81dfa6f	remove re-defination of FlaxWav2Vec2ForCTCModule (#15965 )	2022-03-07 14:58:44 +01:00
Chan Woo Kim	ef9c3ca348	[Bug Fix] Beam search example in docs fails & a fix (integrating `max_length` in `BeamScorer.finalize()`) (#15555 ) * added the test and fix * had left out a comment	2022-03-07 09:10:18 +01:00
Francesco Saverio Zuppichini	9932ee4b4b	made MaskFormerModelTest faster (#15942 )	2022-03-04 19:11:48 +01:00
NielsRogge	e8efaecb87	Move dependency to call method (#15941 )	2022-03-04 18:53:54 +01:00
Chan Woo Kim	5c6f57ee75	Constrained Beam Search [With Disjunctive Decoding] (#15761 ) * added classes to get started with constrained beam search * in progress, think i can directly force tokens now but not yet with the round robin * think now i have total control, now need to code the bank selection * technically works as desired, need to optimize and fix design choices leading to undersirable outputs * complete PR #1 without disjunctive decoding * removed incorrect tests * Delete k.txt * Delete test.py * Delete test.sh * revert changes to test scripts * genutils * full implementation with testing, no disjunctive yet * shifted docs * passing all tests realistically ran locally * removing accidentally included print statements * fixed source of error in initial PR test * fixing the get_device() vs device trap * fixed documentation docstrings about constrained_beam_search * fixed tests having failing for Speech2TextModel's floating point inputs * fix cuda long tensor * added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search * deleted accidentally added test halting code with assert False * code reformat * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py * fixing based on comments on PR * took out the testing code that should but work fails without the beam search moditification ; style changes * fixing comments issues * docstrings for ConstraintListState * typo in PhrsalConstraint docstring * docstrings improvements * finished adding what is sort of an opinionated implementation of disjunctive generation, but it revealed errors in inner beam search logic during testing. * fixed bug found in constrained beam search that used beam_idx that were not global across all the batches * disjunctive constraint working 100% correctly * passing all tests * Accidentally included mlruns * Update src/transformers/generation_beam_constraints.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/generation_beam_constraints.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * complete overhaul of type complexities and other nits * strict type checks in generate() * fixing second round of feedback by narsil * fixed failing generation test because of type check overhaul * generation test fail fix * fixing test fails Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-03-04 18:18:34 +01:00
Francesco Saverio Zuppichini	040c11f6da	Tests for MaskFormerFeatureExtractor's post_process*** methods (#15929 ) * proper tests for post_process*** methods in feature extractor * mask th == 0 * Update tests/maskformer/test_feature_extraction_maskformer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * make style Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-04 18:04:19 +01:00
Yih-Dar	f0aacc140b	Do not change the output from tuple to list - to match PT's version (#15918 ) * Do not change the output from tuple to list - to match PT's version * Fix the same issues for 5 other models and the template Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-04 17:50:24 +01:00
Patrick von Platen	10b76987fc	[FlaxT5 Example] fix flax t5 example pretraining (#15835 )	2022-03-04 17:04:43 +01:00
Javier de la Rosa	01485ceec3	Add missing support for Flax XLM-RoBERTa (#15900 ) * Adding Flax XLM-RoBERTa * Add Flax to __init__ * Adding doc and dummy objects * Add tests * Add Flax XLM-R models autodoc * Fix tests * Add Flask XLM-RoBERTa to TEST_FILES_WITH_NO_COMMON_TESTS * Update src/transformers/models/xlm_roberta/modeling_flax_xlm_roberta.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update tests/xlm_roberta/test_modeling_flax_xlm_roberta.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update tests/xlm_roberta/test_modeling_flax_xlm_roberta.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Remove test on large Flask XLM-RoBERTa * Add tokenizer to the test Co-authored-by: Suraj Patil <surajp815@gmail.com>	2022-03-04 14:36:28 +01:00
Nicolas Patry	89c7d9cfba	Making MaskFormerForInstanceSegmentation. (#15934 ) Small adjustments. Adding in type hint. Last fix ? Only include the default dict thing, not the pipelines.	2022-03-04 13:56:15 +01:00
Nicolas Patry	7ade7c1794	Updating the slow tests: (#15893 ) Linked to https://github.com/huggingface/transformers/pull/15826	2022-03-04 12:32:19 +01:00
ParkSangJun	6b104c5bb0	Support CLIPTokenizerFast for CLIPProcessor (#15913 ) * Fix to support fast tokenizer with `CLIPProcessor` * Update CLIPProcessor test for fast tokenizer * Fix Docstring Style * Rename into meaningful Variable name in test code	2022-03-04 11:57:09 +01:00
Sanchit Gandhi	b71474895d	Update README.md	2022-03-04 09:58:45 +01:00
Nicolas Patry	a6e3b17981	Re-enabling all fast pipeline tests. (#15924 )	2022-03-04 09:53:00 +01:00
Patrick von Platen	a7df656f03	Update README.md (#15926 )	2022-03-04 00:22:38 +01:00
davidleonfdez	c0281feb50	Fix #15898 (#15928 )	2022-03-03 14:41:03 -05:00
NielsRogge	9251427c38	Add vision models to doc tests (#15905 ) * Add vision models to doc tests * Apply suggestions from code review * Add more models Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-03-03 19:46:31 +01:00
Francesco Saverio Zuppichini	742273a52a	fix for the output from post_process_panoptic_segmentation (#15916 )	2022-03-03 19:35:48 +01:00

1 2 3 4 5 ...

9160 Commits