transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-03 03:31:05 +06:00

Author	SHA1	Message	Date
Sylvain Gugger	19597998f6	Don't compute metrics in LM examples on TPU (#16029 )	2022-03-10 07:44:51 -05:00
Sylvain Gugger	10591399d6	Build the doc in a seperate folder then move it (#16020 ) * Build the doc in a seperate folder then move it * Allow job * Is this it? * Dislike comments? * Copy instead of move * Removing version built * Typos * No variable * Take _versions.yml into account * Finish main job and add dev job * Forgot the run * Fix syntax error * Execute builder from the repo * Typo	2022-03-10 07:44:29 -05:00
Yih-Dar	2f463effb3	Fix TFDebertaV2ConvLayer in TFDebertaV2Model (#16031 ) * fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-10 12:23:46 +01:00
Sanchit Gandhi	1da84ae02c	Fix Bug in Flax-Speech-Encoder-Decoder Test (#16041 ) * Fix Bug in Flax-Speech-Encoder-Decoder Test * change thresholds for CPU precision	2022-03-10 12:09:29 +01:00
Suraj Patil	b2a1c994cb	[README] fix url for Preprocessing tutorial (#16042 )	2022-03-10 12:09:05 +01:00
NielsRogge	8d83ebdf18	[Tests] Add attentions_option to ModelTesterMixin (#15909 ) * Add attentions_option to common tester * Fix tests, apply suggestion * Apply suggestion from code review Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-03-10 12:00:30 +01:00
Patrick von Platen	6ce11c2c0f	[Docs] Improve PyTorch, Flax generate API (#15988 ) * Move generate docs * up * Update docs/source/_toctree.yml * correct * correct some stuff * correct tests * more fixes * finish generate * add to doc stest * finish * finalize * add warning to generate method	2022-03-10 11:54:45 +01:00
André Storhaug	0951d31788	Fix dependency error message in ServeCommand (#16033 ) "uvicorn" is misspelled as "unicorn".	2022-03-10 11:35:26 +01:00
NielsRogge	0835119bf3	Add Document Image Transformer (DiT) (#15984 ) * Add conversion script * Improve script * Fix bug * Add option to push to hub * Add support for classification models * Update model name * Upload feature extractor files first * Remove hash checking * Fix config * Add id2label * Add import * Fix id2label file name * Fix expected shape * Add model to README * Improve docs * Add integration test and fix CI * Fix code style * Add missing init * Add model to SPECIAL_MODULE_TO_TEST_MAP Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-03-10 11:34:44 +01:00
Sanchit Gandhi	6c9010ef63	Update README.md	2022-03-10 10:20:37 +01:00
Sanchit Gandhi	fde901877a	Freeze Feature Encoder in FlaxSpeechEncoderDecoder (#15997 ) * Freeze Feature Encoder in FlaxSpeechEncoderDecoder * add backprop test	2022-03-10 09:59:19 +01:00
Pavel Belevich	65f9653ed0	Fix warning message in ElectraForCausalLM (#16023 )	2022-03-09 17:27:15 -05:00
Suraj Patil	a69e185074	add doctests for bart like seq2seq models (#15987 ) * boom boom * enable doctest for few seq2seq models * add seq2seq models in documentation_tests.txt * fix docstring blenderbot * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fix seq classif doc sample * don't check loss for seq classif examples * +IGNORE_OUTPUT => +IGNORE_RESULT * fix _SEQ_CLASS_EXPECTED_OUTPUT_SHAPE * fix some docs * more fixes * last fix (hopefully) * fix big bird gen example * fix mbart gen example Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-09 20:30:38 +01:00
Sanchit Gandhi	b256f3518d	Add FlaxBartForCausalLM (#15995 ) * add causal lm * add CausalLM tests * Add FlaxBartForCausalLM * Add EncoderDecoder model tests * change docstring * make repo-consistency * suggested changes * remove jax ops * correction * rename pre-trained decoder model	2022-03-09 19:53:01 +01:00
lewtun	50dd314d93	Add ONNX export for ViT (#15658 ) * Add ONNX support for ViT * Refactor to use generic preprocessor * Add vision dep to tests * Extend ONNX slow tests to ViT * Add dummy image generator * Use model_type to determine modality * Add deprecation warnings for tokenizer argument * Add warning when overwriting the preprocessor * Add optional args to docstrings * Add minimum PyTorch version to OnnxConfig * Refactor OnnxConfig class variables from CONSTANT_NAME to snake_case * Add reasonable value for default atol Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-09 17:36:59 +01:00
Yih-Dar	b7fa1e3dee	Use tiny models for get_pretrained_model in TFEncoderDecoderModelTest (#15989 ) * Use tiny model for TFRembertEncoderDecoderModelTest.get_pretrained_model() Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-09 17:16:25 +01:00
Shotaro Ishihara	8feede229c	Fix broken code blocks in README.md (#15967 ) at transformers/examples/pytorch/contrastive-image-text	2022-03-09 17:07:52 +01:00
Francesco Saverio Zuppichini	1e8f37992f	done (#16012 )	2022-03-09 15:51:56 +01:00
Basile Van Hoorick	38bce1d4cf	Make `pos` optional to avoid crashing `PerceiverModel` operation (#15972 ) Updates `PerceiverAudioPreprocessor` `forward()` implementation to match most other preprocessors / postprocessors	2022-03-09 15:48:52 +01:00
Sylvain Gugger	cec89e1a0e	Simplify release utils (#15921 ) * Simplify release utils * Quality	2022-03-09 08:47:58 -05:00
Lysandre Debut	e493a3a5e2	Fix github actions comment (#16009 ) * Add issue number * Dev	2022-03-09 08:39:03 -05:00
Joao Gante	e7f34ccd4f	Swag example: Update doc format (#16014 )	2022-03-09 13:25:34 +00:00
Yih-Dar	3ea046995e	Removed an outdated check about hdf5_version (#16011 ) * removed an outdated check about hdf5_version Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-09 14:21:23 +01:00
Patrick von Platen	c1aaa43935	[Doctests] Move doctests to new GPU & Fix bugs (#15969 ) * test * up * up * Empty test commit * up * update tests * up * fix some vision models * correct * correct docs * Trigger notification * finalize * check * correct quicktour * Apply suggestions from code review * improve doctests * Trigger Build * next try * next try * and again * Output current clone information * Output current clone information * Correct path * add tf round again * revert to daily job Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2022-03-09 13:09:56 +01:00
Nicolas Patry	f4e4ad34cc	Add `ForInstanceSegmentation` models to `image-segmentation` pipelines (#15937 ) * Adding ForInstanceSegmentation to pipelines. * Last fix `category_id` renamed to `label_id`. * Can't be none no more. * No `is_thing_map` anymore.	2022-03-09 10:19:05 +01:00
David Hall	5b7dcc7342	Seed _get_train_sampler's generator with arg seed to improve reproducibility (#15961 ) * Seed get_train_sampler's generator with arg seed to improve reproducibility and make the world_size<=1 code path more similar to the others * move test file into trainer test explicitly * dumb typo * make style lint happy * per discussion, switch to data_seed * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-08 13:45:41 -05:00
Joao Gante	70203b5937	TF generate refactor - past without encoder outputs (#15944 ) * Remove packed past from generation_tf_utils * update models with the new past format * update template accordingly	2022-03-08 14:46:44 +00:00
Joao Gante	62d847602a	Update TF multiple choice example (#15868 )	2022-03-08 13:16:34 +00:00
Patrick von Platen	ab2f8d12a7	add hf hub to env version command (#15981 )	2022-03-08 14:03:03 +01:00
Yih-Dar	72983303c5	Fix TFEncoderDecoderModelTest - Pytorch device (#15979 ) * fix device Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-08 13:37:20 +01:00
Sylvain Gugger	f5a080dd10	Do a pull in case docs were updated during build (#15922 )	2022-03-08 07:19:41 -05:00
Yeb Havinga	91fb62d01c	Speedup training by using numpy instead of jnp for batch shuffling (#15963 ) Speedup training by using numpy instead of jnp for batch shuffling Co-authored-by: Yeb Havinga <y.t.havinga@mgrid.net>	2022-03-08 12:18:38 +01:00
Nicolas Patry	ea07064a5c	Returning outputs only when asked for for MaskFormer. (#15936 ) * Returning outputs only when asked for for MaskFormer. * Adding `output_auxiliary_logits` to the config.	2022-03-08 11:17:57 +01:00
NielsRogge	b19f3e69a0	[Tests] Fix ViTMAE integration test (#15949 ) * Fix test across both cpu and gpu * Fix typo	2022-03-08 10:49:44 +01:00
NielsRogge	9879a1d5f0	Fix LayoutLMv2 test (#15939 ) * Fix LayoutLMv2 test * Update black	2022-03-08 10:49:30 +01:00
Yih-Dar	8b9ae45549	Set scale_embedding to False in some TF tests (#15952 ) * set scale_embedding to False to avoid large (> 1e-5) output differences between PT/TF Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-07 22:14:33 +01:00
Steven Liu	38cc35069c	Update training scripts docs (#15931 ) * 📝 first draft * 🖍 apply feedback * 🖍 remove examples from toctree * 🗑 remove examples from docs/source	2022-03-07 13:29:14 -06:00
Sylvain Gugger	c87cfd653c	Better error message when inputs are empty	2022-03-07 13:29:16 -05:00
Francesco Saverio Zuppichini	e9fa7cd5d7	Make is_thing_map in Feature Extractor post_process_panoptic_segmentation defaults to all instances (#15954 ) * is_thing_map defaults to all instances * better naming * control flow * resolving conversations	2022-03-07 19:10:32 +01:00
Sanchit Gandhi	2596f95e84	Fix Embedding Module Bug in Flax Models (#15920 )	2022-03-07 18:17:45 +01:00
Sanchit Gandhi	1a62b25caf	Backprop Test for Freeze FlaxWav2Vec2 Feature Encoder (#15938 ) * Backprop Test for Freeze FlaxWav2Vec2 Feature Encoder * remove jnp.ndarray type suggestion * assert frozen grads are precisely zero	2022-03-07 18:10:15 +01:00
Konstantin Dobler	544fd9876b	Support modern list type hints in HfArgumentParser (#15951 ) * Support modern list type hint in HfArgumentParser * Fix formatting with black	2022-03-07 10:22:48 -05:00
Suraj Patil	60b81dfa6f	remove re-defination of FlaxWav2Vec2ForCTCModule (#15965 )	2022-03-07 14:58:44 +01:00
Chan Woo Kim	ef9c3ca348	[Bug Fix] Beam search example in docs fails & a fix (integrating `max_length` in `BeamScorer.finalize()`) (#15555 ) * added the test and fix * had left out a comment	2022-03-07 09:10:18 +01:00
Francesco Saverio Zuppichini	9932ee4b4b	made MaskFormerModelTest faster (#15942 )	2022-03-04 19:11:48 +01:00
NielsRogge	e8efaecb87	Move dependency to call method (#15941 )	2022-03-04 18:53:54 +01:00
Chan Woo Kim	5c6f57ee75	Constrained Beam Search [With Disjunctive Decoding] (#15761 ) * added classes to get started with constrained beam search * in progress, think i can directly force tokens now but not yet with the round robin * think now i have total control, now need to code the bank selection * technically works as desired, need to optimize and fix design choices leading to undersirable outputs * complete PR #1 without disjunctive decoding * removed incorrect tests * Delete k.txt * Delete test.py * Delete test.sh * revert changes to test scripts * genutils * full implementation with testing, no disjunctive yet * shifted docs * passing all tests realistically ran locally * removing accidentally included print statements * fixed source of error in initial PR test * fixing the get_device() vs device trap * fixed documentation docstrings about constrained_beam_search * fixed tests having failing for Speech2TextModel's floating point inputs * fix cuda long tensor * added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search * deleted accidentally added test halting code with assert False * code reformat * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py * fixing based on comments on PR * took out the testing code that should but work fails without the beam search moditification ; style changes * fixing comments issues * docstrings for ConstraintListState * typo in PhrsalConstraint docstring * docstrings improvements * finished adding what is sort of an opinionated implementation of disjunctive generation, but it revealed errors in inner beam search logic during testing. * fixed bug found in constrained beam search that used beam_idx that were not global across all the batches * disjunctive constraint working 100% correctly * passing all tests * Accidentally included mlruns * Update src/transformers/generation_beam_constraints.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/generation_beam_constraints.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * complete overhaul of type complexities and other nits * strict type checks in generate() * fixing second round of feedback by narsil * fixed failing generation test because of type check overhaul * generation test fail fix * fixing test fails Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-03-04 18:18:34 +01:00
Francesco Saverio Zuppichini	040c11f6da	Tests for MaskFormerFeatureExtractor's post_process*** methods (#15929 ) * proper tests for post_process*** methods in feature extractor * mask th == 0 * Update tests/maskformer/test_feature_extraction_maskformer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * make style Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-04 18:04:19 +01:00
Yih-Dar	f0aacc140b	Do not change the output from tuple to list - to match PT's version (#15918 ) * Do not change the output from tuple to list - to match PT's version * Fix the same issues for 5 other models and the template Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-04 17:50:24 +01:00
Patrick von Platen	10b76987fc	[FlaxT5 Example] fix flax t5 example pretraining (#15835 )	2022-03-04 17:04:43 +01:00

1 2 3 4 5 ...

9170 Commits