transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-03 03:31:05 +06:00

Author	SHA1	Message	Date
Minwoo Lee	86a154722f	Fix omitted lazy import for xlm-prophetnet (#13052 ) * Fix omitted lazy import for xlm-prophetnet * Update src/transformers/models/xlm_prophetnet/__init__.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Fix style using black Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-08-13 12:24:53 +02:00
Nicolas Patry	d58926ab1d	Moving fill-mask pipeline to new testing scheme (#12943 ) * Fill mask pipelines test updates. * Model eval !! * Adding slow test with actual values. * Making all tests pass (skipping quite a bit.) * Doc styling. * Better doc cleanup. * Making an explicit test with no pad token tokenizer. * Typo.	2021-08-13 12:04:18 +02:00
Yih-Dar	a04d4bf2d7	Fix flax gpt2 hidden states (#13109 ) * Fix inconsistency of the last element in hidden_states between PyTorch/Flax GPT2(Neo) (#13102) * Fix missing elements in outputs tuple * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * Fix local variable 'all_hidden_states' referenced before assignment * Fix by returning tuple containing None values * Fix quality Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Suraj Patil <surajp815@gmail.com>	2021-08-13 14:15:53 +05:30
Will Frey	d8fb278a2c	Create py.typed (#12893 ) * Create py.typed This creates a [py.typed as per PEP 561](https://www.python.org/dev/peps/pep-0561/#packaging-type-information) that should be distributed to mark that the package includes (inline) type annotations. * Update setup.py Include py.typed as package data * Update setup.py Call `setup(...)` with `zip_safe=False`.	2021-08-13 04:12:59 -04:00
Sylvain Gugger	b0a917c48a	Fix CircleCI nightly tests (#13113 )	2021-08-13 08:57:30 +02:00
Gunjan Chhablani	bda1cb0236	Fix VisualBERT docs (#13106 ) * Fix VisualBERT docs * Show example notebooks as lists * Fix style	2021-08-13 11:44:04 +05:30
Bill Schnurr	e46ad22cd6	Improve type checker performance (#13094 ) * conditional declare `TOKENIZER_MAPPING_NAMES` within a `if TYPE_CHECKING` block so that type checkers dont need to evaluate the RHS of the assignment. this improves performance of the pylance/pyright type checkers * Update src/transformers/models/auto/tokenization_auto.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * adding missing import * format Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-08-12 18:45:54 +02:00
Sylvain Gugger	b9962b8656	Ci last fix (#13103 ) * Only report failures on failures * Fix typo * Put it everywhere	2021-08-12 10:45:06 -04:00
Suraj Patil	f5cd27694a	[FlaxCLIP] allow passing params to image and text feature methods (#13099 ) * allow passing params to image and text feature method * ifx for hybrid clip as well	2021-08-12 18:35:01 +05:30
Sylvain Gugger	9a498c37a2	Rely on huggingface_hub for common tools (#13100 ) * Remove hf_api module and use hugginface_hub * Style * Fix to test_fetcher * Quality	2021-08-12 14:59:02 +02:00
Patrick von Platen	6900dded49	[Flax/JAX] Run jitted tests at every commit (#13090 ) * up * up * up	2021-08-12 14:49:46 +02:00
Yih-Dar	773d386041	Change a parameter name in FlaxBartForConditionalGeneration.decode() (#13074 ) * Change FlaxBartForConditionalGeneration.decode() argument: deterministic -> train * Also change the parameter name to train for flax marian and mbart Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2021-08-12 17:49:48 +05:30
Sylvain Gugger	f176fbf588	Fix doc building error	2021-08-12 05:49:02 -04:00
Sylvain Gugger	be323d5152	Reactive test fecthers on scheduled test with proper git install (#13097 ) * Reactive test fecthers on scheduled test with proper git install * Proper fetch-depth	2021-08-12 11:38:14 +02:00
Sylvain Gugger	ea8ffe36d3	Proper import for unittest.mock.patch (#13085 )	2021-08-12 11:23:00 +02:00
Kamal Raj	d329b63369	Deberta tf (#12972 ) * TFDeberta moved weights to build and fixed name scope added missing , bug fixes to enable graph mode execution updated setup.py fixing typo fix imports embedding mask fix added layer names avoid autmatic incremental names +XSoftmax cleanup added names to layer disable keras_serializable Distangled attention output shape hidden_size==None using symbolic inputs test for Deberta tf make style Update src/transformers/models/deberta/modeling_tf_deberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Update src/transformers/models/deberta/modeling_tf_deberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Update src/transformers/models/deberta/modeling_tf_deberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Update src/transformers/models/deberta/modeling_tf_deberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Update src/transformers/models/deberta/modeling_tf_deberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Update src/transformers/models/deberta/modeling_tf_deberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Update src/transformers/models/deberta/modeling_tf_deberta.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> removed tensorflow-probability removed blank line * removed tf experimental api +torch_gather tf implementation from @Rocketknight1 * layername DeBERTa --> deberta * copyright fix * added docs for TFDeberta & make style * layer_name change to fix load from pt model * layer_name change as pt model * SequenceClassification layername change, to same as pt model * switched to keras built-in LayerNormalization * added `TFDeberta` prefix most layer classes * updated to tf.Tensor in the docstring	2021-08-12 05:01:26 -04:00
Gunjan Chhablani	c4e1586db8	Fix VisualBert Embeddings (#13017 )	2021-08-12 03:57:34 -04:00
Lysandre Debut	53b38d6269	Doctests job (#13088 ) * Doctests * Limit to 4 decimals * Try with separate PT/TF tests * Remove test for TF * Ellips the predictions * Doctest continue on failure Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>	2021-08-12 03:42:25 -04:00
Ibraheem Moosa	3f52c685c1	Fix classifier dropout in AlbertForMultipleChoice (#13087 ) Classification head of AlbertForMultipleChoice uses `hidden_dropout_prob` instead of `classifier_dropout_prob`. This is not desirable as we cannot change classifer head dropout probability without changing the dropout probabilities of the whole model.	2021-08-12 03:37:31 -04:00
Lysandre Debut	c89180a9de	Install git (#13091 ) * Install git * Add TF tests * And last TF test * Add in commented code too Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>	2021-08-11 18:09:41 +02:00
Gunjan Chhablani	c71f73f438	Add VisualBERT demo notebook (#12263 ) * Initialize VisualBERT demo * Update demo * Add commented URL * Update README * Update README	2021-08-11 10:10:59 -04:00
Sylvain Gugger	83424ade1a	[Doctest] Setup, quicktour and task_summary (#13078 ) * Fix doctests for quicktour * Adapt causal LM exemple * Remove space * Fix until summarization * End of task summary * Style * With last changes in quicktour	2021-08-11 13:45:25 +02:00
Sylvain Gugger	bfc885091b	Fix last one	2021-08-10 13:48:26 -04:00
Ibraheem Moosa	29dada00c4	Use original key for label in DataCollatorForTokenClassification (#13057 ) * Use original key for label in DataCollatorForTokenClassification DataCollatorForTokenClassification accepts either `label` or `labels` as key for label in it's input. However after padding the label it assigns the padded labels to key `labels`. If originally `label` was used as key than the original upadded labels still remains in the batch. Then at line 192 when we try to convert the batch elements to torch tensor than these original unpadded labels cannot be converted as the labels for different samples have different lengths. * Fixed style.	2021-08-10 18:39:48 +02:00
Sylvain Gugger	95e2e14f9d	Revert to all tests whil we debug what's wrong (#13072 )	2021-08-10 18:37:01 +02:00
Sylvain Gugger	477480ce2a	Trigger GPU tests	2021-08-10 10:26:06 -04:00
Sylvain Gugger	0dad5d825d	Fix fallback of test_fetcher (#13071 )	2021-08-10 16:17:06 +02:00
Sylvain Gugger	4dd857244c	Merge branch 'master' of github.com:huggingface/transformers	2021-08-10 09:40:38 -04:00
Sylvain Gugger	bd5593b6c4	Try fecthing the last two commits	2021-08-10 09:40:16 -04:00
Sylvain Gugger	9e9b8f1d99	Roll out the test fetcher on push tests (#13055 ) * Use test fetcher for push tests as well * Force diff with last commit for circleCI on master * Fix syntax error * Style * Schedule nightly tests	2021-08-10 14:54:52 +02:00
Sylvain Gugger	2e0d767ab2	Pin sacrebleu	2021-08-10 06:27:49 -04:00
Sylvain Gugger	0454e4bd8b	Fix ModelOutput instantiation form dictionaries (#13067 ) * Fix ModelOutput instantiation form dictionaries * Style	2021-08-10 12:20:04 +02:00
Aleksey Korshuk	3157fa3c53	docs: add HuggingArtists to community notebooks (#13050 ) * Adding HuggingArtists to Community Notebooks * Adding HuggingArtists to Community Notebooks * Adding HuggingArtists to Community Notebooks * docs: add HuggingArtists to community notebooks Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-08-10 09:36:44 +02:00
Kevin Canwen Xu	ab7551cd7f	Add try-except for torch_scatter (#13040 ) * Add try-catch for torch_scatter * Update modeling_tapas.py	2021-08-10 15:29:35 +08:00
SaulLu	76cadb7943	replace tgt_lang by tgt_text (#13061 )	2021-08-09 22:47:05 +05:30
Lysandre	a8bf2fa76e	Documentation for patch v4.9.2	2021-08-09 16:14:17 +02:00
Lysandre Debut	5008e08885	Add to ONNX docs (#13048 ) * Add to ONNX docs * Add MBART example * Update docs/source/serialization.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-08-09 09:51:49 -04:00
Lysandre Debut	6f5ab9daf1	Add MBART to models exportable with ONNX (#13049 ) * Add MBART to models exportable with ONNX * unittest mock * Add tests * Misc fixes	2021-08-09 08:56:04 -04:00
Patrick von Platen	13a9c9a354	[Flax] Refactor gpt2 & bert example docs (#13024 ) * fix_torch_device_generate_test * remove @ * improve docs for clm * speed-ups * correct t5 example as well * push final touches * Update examples/flax/language-modeling/README.md * correct docs for mlm * Update examples/flax/language-modeling/README.md Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-08-09 13:37:50 +02:00
abhishek thakur	3ff2cde5ca	tfhub.de -> tfhub.dev (#12565 )	2021-08-09 08:11:17 +02:00
Patrick von Platen	24cbf6bc5a	Update README.md	2021-08-08 17:11:19 +02:00
lewtun	7390d9de63	Use min version for huggingface-hub dependency (#12961 ) * Use min version for huggingface-hub dependency * Update dependency version table	2021-08-08 09:06:05 -05:00
Sylvain Gugger	7fcee113c1	Tpu tie weights (#13030 ) * Fix tied weights on TPU * Manually tie weights in no trainer examples * Fix for test * One last missing * Gettning owned by my scripts * Address review comments * Fix test * Fix tests * Fix reformer tests	2021-08-06 20:41:39 +02:00
Lysandre Debut	1bf38611a4	Put smaller ALBERT model (#13028 )	2021-08-06 12:41:33 -04:00
Michael Benayoun	dc420b0eb1	T5 with past ONNX export (#13014 ) T5 with past ONNX export, and more explicit past_key_values inputs and outputs names for ONNX model Authored-by: Michael Benayoun <michael@huggingface.co>	2021-08-06 15:46:26 +02:00
Michael Benayoun	ee11224611	FX submodule naming fix (#13016 ) Changed the way dynamically inserted submodules are named and the method used to insert them Authored-by: Michael Benayoun <michael@huggingface.co>	2021-08-06 15:37:29 +02:00
Sylvain Gugger	9870093f7b	[WIP] Disentangle auto modules from other modeling files (#13023 ) * Initial work * All auto models * All tf auto models * All flax auto models * Tokenizers * Add feature extractors * Fix typos * Fix other typo * Use the right config * Remove old mapping names and update logic in AutoTokenizer * Update check_table * Fix copies and check_repo script * Fix last test * Add back name * clean up * Update template * Update template * Forgot a ) * Use alternative to fixup * Fix TF model template * Address review comments * Address review comments * Style	2021-08-06 13:12:30 +02:00
Patrick von Platen	2e4082364e	[Flax T5] Speed up t5 training (#13012 ) * fix_torch_device_generate_test * remove @ * update * up * fix * remove f-stings * correct readme * up Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-08-06 11:21:37 +02:00
Patrick von Platen	60e448c87e	[Flax] Correct pt to flax conversion if from base to head (#13006 ) * finish PR * add tests * correct tests * finish * correct other flax tests * better naming * correct naming * finish * apply sylvains suggestions	2021-08-05 18:38:50 +02:00
Nils Reimers	33929448a1	Replace // operator with / operator + long() (#13013 )	2021-08-05 15:55:14 +02:00

... 2 3 4 5 6 ...

7895 Commits