transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Steven Liu	5b4c97d09d	Update troubleshoot guide (#16001 ) * 📝 first draft * 🖍 apply feedback * 🖍 apply feedback	2022-03-11 13:05:44 -06:00
Kevin Bondzio	9442b3ce31	Add soft length regulation for sequence generation (#15245 ) * add possibility to softly regulate length when using sampling method in model.generate() function * fix test config, fix formatting * fix rag integration, fix docstyling * fix wrong docstring * change param to tuple, add test * fix old param in rag_model, remove unused import * change test according to new param * fix formatting * fix test case * fix doc style * move start_length calculation to Logitprocessor * add possibility to softly regulate length when using sampling method in model.generate() function * fix rag integration, fix docstyling * fix test config, fix formatting * change param to tuple, add test * fix old param in rag_model, remove unused import * add possibility to softly regulate length when using sampling method in model.generate() function * change param to tuple, add test * fix old param in rag_model, remove unused import * remove unused import * fix small errors * fix test * add possibility to softly regulate length when using sampling method in model.generate() function * fix test config, fix formatting * fix rag integration, fix docstyling * change param to tuple, add test * fix old param in rag_model, remove unused import * change test according to new param * fix test case * move start_length calculation to Logitprocessor * add possibility to softly regulate length when using sampling method in model.generate() function * fix rag integration, fix docstyling * fix test config, fix formatting * change param to tuple, add test * fix old param in rag_model, remove unused import * add possibility to softly regulate length when using sampling method in model.generate() function * fix test config, fix formatting * fix rag integration, fix docstyling * add possibility to softly regulate length when using sampling method in model.generate() function * fix rag integration, fix docstyling * change param to tuple, add test * fix old param in rag_model, remove unused import * fix small errors * Update src/transformers/generation_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/generation_utils.py * Update src/transformers/generation_utils.py * fix docstring, add type ind model rag * fix docstrings * introduce seq_length variable for cleaner code * fix black formatting * add input_ids_seq_length to modeling_rag * add input_ids_seq_length to test * retrigger checks * retrigger checks Co-authored-by: Kevin Bondzio <kev@AIM-LAP-02.local> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Kevin Bondzio <kev@AIM-LAP-02.fritz.box>	2022-03-11 19:36:44 +01:00
Patrick von Platen	322c8533d7	Run daily test without time-out at least once (#16077 )	2022-03-11 18:04:17 +01:00
feifang24	7e00247fad	check for key 'torch.dtype' in nested dicts in config (#16065 )	2022-03-11 12:00:11 -05:00
Matt	5d2fed2e8c	Adding type hints for TFRoBERTa (#16057 ) * Adding type annotations for TFRoBERTa * Add type hints to TFRobertaModel too	2022-03-11 16:13:47 +00:00
Matt	bb69d154c5	Add type annotations for BERT and copies (#16074 ) * Add type annotations for BERT and copies * make fixup	2022-03-11 16:13:29 +00:00
Sylvain Gugger	f7708e1bed	Force default brnahc name via the config	2022-03-11 10:09:15 -05:00
Sylvain Gugger	ecf989ca73	Trigger doc build	2022-03-11 09:20:05 -05:00
Lysandre Debut	0868fdef85	Fix torch-scatter version (#16072 )	2022-03-11 09:03:27 -05:00
Funtowicz Morgan	5b369dc5d8	Remove assertion over possible activation functions in DistilBERT (#16066 ) * Remove assertion over possible activation functions * Same for TF and Flax	2022-03-11 14:27:59 +01:00
Sylvain Gugger	f5741bcd02	Move QDQBert in just PyTorch block (#16062 )	2022-03-11 07:58:02 -05:00
Yih-Dar	b6bdb943b2	Fix a TF test name (LayoutLMModelTest) (#16061 ) * fix name Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-11 11:22:36 +01:00
David S. Batista	96ac7549cb	updating fine-tune classifier documentation (#16063 )	2022-03-10 16:21:56 -05:00
lewtun	6b09328368	Fix duplicate arguments passed to dummy inputs in ONNX export (#16045 ) * Fix duplicate arguments passed to dummy inputs in ONNX export * Fix M2M100 ONNX config * Ensure we check PreTrained model only if torch is available * Remove TensorFlow tests for models without PyTorch parity	2022-03-10 20:19:45 +01:00
Suraj Patil	ba21001f4c	support new marian models (#15831 ) * support not sharing embeddings * update modeling * update tokenizer * fix conversion script * always use self.shared * boom boom * begin tests * update tests * fix resize_decoder_token_embeddings * address Patrick's comments * style * update conversion script * fix conversion script * fix tokenizer * better name target vocab * add integration test for tokenizer with two vocabs * style * address Patrick's comments * add integration test for model	2022-03-10 19:41:56 +01:00
Lysandre Debut	e66743e6c9	DeBERTa/DeBERTa-v2/SEW Support for torch 1.11 (#16043 ) * Support for torch 1.11 * Address Sylvain's comment	2022-03-10 09:01:05 -05:00
Sanchit Gandhi	741e49305d	Fix Bug in Flax Seq2Seq Models (#16021 ) * Fix Bug in Flax Seq2Seq Models * incorporate suggested changes	2022-03-10 14:58:05 +01:00
Joao Gante	b7018abf3c	TF: Unpack model inputs through a decorator (#15907 ) * MVP * apply decorator to TFBertModel * finish updating bert * update rembert (copy-linked to bert) * update roberta (copy-linked to bert); Fix args * Now working for non-text modalities	2022-03-10 13:31:35 +00:00
Sylvain Gugger	19597998f6	Don't compute metrics in LM examples on TPU (#16029 )	2022-03-10 07:44:51 -05:00
Sylvain Gugger	10591399d6	Build the doc in a seperate folder then move it (#16020 ) * Build the doc in a seperate folder then move it * Allow job * Is this it? * Dislike comments? * Copy instead of move * Removing version built * Typos * No variable * Take _versions.yml into account * Finish main job and add dev job * Forgot the run * Fix syntax error * Execute builder from the repo * Typo	2022-03-10 07:44:29 -05:00
Yih-Dar	2f463effb3	Fix TFDebertaV2ConvLayer in TFDebertaV2Model (#16031 ) * fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-10 12:23:46 +01:00
Sanchit Gandhi	1da84ae02c	Fix Bug in Flax-Speech-Encoder-Decoder Test (#16041 ) * Fix Bug in Flax-Speech-Encoder-Decoder Test * change thresholds for CPU precision	2022-03-10 12:09:29 +01:00
Suraj Patil	b2a1c994cb	[README] fix url for Preprocessing tutorial (#16042 )	2022-03-10 12:09:05 +01:00
NielsRogge	8d83ebdf18	[Tests] Add attentions_option to ModelTesterMixin (#15909 ) * Add attentions_option to common tester * Fix tests, apply suggestion * Apply suggestion from code review Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-03-10 12:00:30 +01:00
Patrick von Platen	6ce11c2c0f	[Docs] Improve PyTorch, Flax generate API (#15988 ) * Move generate docs * up * Update docs/source/_toctree.yml * correct * correct some stuff * correct tests * more fixes * finish generate * add to doc stest * finish * finalize * add warning to generate method	2022-03-10 11:54:45 +01:00
André Storhaug	0951d31788	Fix dependency error message in ServeCommand (#16033 ) "uvicorn" is misspelled as "unicorn".	2022-03-10 11:35:26 +01:00
NielsRogge	0835119bf3	Add Document Image Transformer (DiT) (#15984 ) * Add conversion script * Improve script * Fix bug * Add option to push to hub * Add support for classification models * Update model name * Upload feature extractor files first * Remove hash checking * Fix config * Add id2label * Add import * Fix id2label file name * Fix expected shape * Add model to README * Improve docs * Add integration test and fix CI * Fix code style * Add missing init * Add model to SPECIAL_MODULE_TO_TEST_MAP Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-03-10 11:34:44 +01:00
Sanchit Gandhi	6c9010ef63	Update README.md	2022-03-10 10:20:37 +01:00
Sanchit Gandhi	fde901877a	Freeze Feature Encoder in FlaxSpeechEncoderDecoder (#15997 ) * Freeze Feature Encoder in FlaxSpeechEncoderDecoder * add backprop test	2022-03-10 09:59:19 +01:00
Pavel Belevich	65f9653ed0	Fix warning message in ElectraForCausalLM (#16023 )	2022-03-09 17:27:15 -05:00
Suraj Patil	a69e185074	add doctests for bart like seq2seq models (#15987 ) * boom boom * enable doctest for few seq2seq models * add seq2seq models in documentation_tests.txt * fix docstring blenderbot * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fix seq classif doc sample * don't check loss for seq classif examples * +IGNORE_OUTPUT => +IGNORE_RESULT * fix _SEQ_CLASS_EXPECTED_OUTPUT_SHAPE * fix some docs * more fixes * last fix (hopefully) * fix big bird gen example * fix mbart gen example Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-09 20:30:38 +01:00
Sanchit Gandhi	b256f3518d	Add FlaxBartForCausalLM (#15995 ) * add causal lm * add CausalLM tests * Add FlaxBartForCausalLM * Add EncoderDecoder model tests * change docstring * make repo-consistency * suggested changes * remove jax ops * correction * rename pre-trained decoder model	2022-03-09 19:53:01 +01:00
lewtun	50dd314d93	Add ONNX export for ViT (#15658 ) * Add ONNX support for ViT * Refactor to use generic preprocessor * Add vision dep to tests * Extend ONNX slow tests to ViT * Add dummy image generator * Use model_type to determine modality * Add deprecation warnings for tokenizer argument * Add warning when overwriting the preprocessor * Add optional args to docstrings * Add minimum PyTorch version to OnnxConfig * Refactor OnnxConfig class variables from CONSTANT_NAME to snake_case * Add reasonable value for default atol Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-09 17:36:59 +01:00
Yih-Dar	b7fa1e3dee	Use tiny models for get_pretrained_model in TFEncoderDecoderModelTest (#15989 ) * Use tiny model for TFRembertEncoderDecoderModelTest.get_pretrained_model() Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-09 17:16:25 +01:00
Shotaro Ishihara	8feede229c	Fix broken code blocks in README.md (#15967 ) at transformers/examples/pytorch/contrastive-image-text	2022-03-09 17:07:52 +01:00
Francesco Saverio Zuppichini	1e8f37992f	done (#16012 )	2022-03-09 15:51:56 +01:00
Basile Van Hoorick	38bce1d4cf	Make `pos` optional to avoid crashing `PerceiverModel` operation (#15972 ) Updates `PerceiverAudioPreprocessor` `forward()` implementation to match most other preprocessors / postprocessors	2022-03-09 15:48:52 +01:00
Sylvain Gugger	cec89e1a0e	Simplify release utils (#15921 ) * Simplify release utils * Quality	2022-03-09 08:47:58 -05:00
Lysandre Debut	e493a3a5e2	Fix github actions comment (#16009 ) * Add issue number * Dev	2022-03-09 08:39:03 -05:00
Joao Gante	e7f34ccd4f	Swag example: Update doc format (#16014 )	2022-03-09 13:25:34 +00:00
Yih-Dar	3ea046995e	Removed an outdated check about hdf5_version (#16011 ) * removed an outdated check about hdf5_version Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-09 14:21:23 +01:00
Patrick von Platen	c1aaa43935	[Doctests] Move doctests to new GPU & Fix bugs (#15969 ) * test * up * up * Empty test commit * up * update tests * up * fix some vision models * correct * correct docs * Trigger notification * finalize * check * correct quicktour * Apply suggestions from code review * improve doctests * Trigger Build * next try * next try * and again * Output current clone information * Output current clone information * Correct path * add tf round again * revert to daily job Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2022-03-09 13:09:56 +01:00
Nicolas Patry	f4e4ad34cc	Add `ForInstanceSegmentation` models to `image-segmentation` pipelines (#15937 ) * Adding ForInstanceSegmentation to pipelines. * Last fix `category_id` renamed to `label_id`. * Can't be none no more. * No `is_thing_map` anymore.	2022-03-09 10:19:05 +01:00
David Hall	5b7dcc7342	Seed _get_train_sampler's generator with arg seed to improve reproducibility (#15961 ) * Seed get_train_sampler's generator with arg seed to improve reproducibility and make the world_size<=1 code path more similar to the others * move test file into trainer test explicitly * dumb typo * make style lint happy * per discussion, switch to data_seed * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-08 13:45:41 -05:00
Joao Gante	70203b5937	TF generate refactor - past without encoder outputs (#15944 ) * Remove packed past from generation_tf_utils * update models with the new past format * update template accordingly	2022-03-08 14:46:44 +00:00
Joao Gante	62d847602a	Update TF multiple choice example (#15868 )	2022-03-08 13:16:34 +00:00
Patrick von Platen	ab2f8d12a7	add hf hub to env version command (#15981 )	2022-03-08 14:03:03 +01:00
Yih-Dar	72983303c5	Fix TFEncoderDecoderModelTest - Pytorch device (#15979 ) * fix device Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-08 13:37:20 +01:00
Sylvain Gugger	f5a080dd10	Do a pull in case docs were updated during build (#15922 )	2022-03-08 07:19:41 -05:00
Yeb Havinga	91fb62d01c	Speedup training by using numpy instead of jnp for batch shuffling (#15963 ) Speedup training by using numpy instead of jnp for batch shuffling Co-authored-by: Yeb Havinga <y.t.havinga@mgrid.net>	2022-03-08 12:18:38 +01:00

1 2 3 4 5 ...

9188 Commits