transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-04 13:20:12 +06:00

Author	SHA1	Message	Date
Matt	61cf2ea9c0	Fix incorrect output shapes for TF/PT LED (#13882 ) * Fix issues with LED model * Style pass * Bugfixes * correct attentions as well Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-10-07 17:30:15 +01:00
Nicolas Patry	0ddadbf0a8	Fixing question-answering with long contexts (#13873 ) * Tmp. * Fixing BC for question answering with long context. * Capping model_max_length to avoid tf overflow. * Bad workaround bugged roberta. * Fixing name.	2021-10-05 16:08:58 +02:00
Lysandre Debut	c3d9ac7607	Expose get_config() on ModelTesters (#12812 ) * Expose get_config() on ModelTesters * Typo	2021-07-21 04:13:11 -04:00
Lysandre Debut	959d448b3f	Fix led torchscript (#12735 ) * Don't test LED on torchscript * Typo	2021-07-15 11:48:50 -04:00
Daniel Stancl	e3ff165aa5	Fix cross-attention head mask for Torch encoder-decoder models (#10605 ) * Fix cross-attention head mask for Torch BART models * Fix head masking for cross-attention module for the following models: BART, Blenderbot, Blenderbot_small, M2M_100, Marian, MBart, Pegasus * Enable test_headmasking for M2M_100 model * Fix cross_head_mask for FSMT, LED and T5 * This commit fixes `head_mask` for cross-attention modules in the following models: FSMT, LED, T5 * It also contains some smaller changes in doc so that it is be perfectly clear the shape of `cross_head_mask` is the same as of `decoder_head_mask` * Update template * Fix template for BartForCausalLM * Fix cross_head_mask for Speech2Text models * Fix cross_head_mask in templates * Fix args order in BartForCausalLM template * Fix doc in BART templates * Make more explicit naming * `cross_head_mask` -> `cross_attn_head_mask` * `cross_layer_head_mask` -> `cross_attn_layer_head_mask` * Fix doc * make style quality * Fix speech2text docstring	2021-04-23 18:58:06 +02:00
Sylvain Gugger	ba8b1f4754	Add support for multiple models for one config in auto classes (#11150 ) * Add support for multiple models for one config in auto classes * Use get_values everywhere * Prettier doc	2021-04-08 18:41:36 -04:00
Daniel Stancl	71bdc076dd	Add head_mask and decoder_head_mask to PyTorch LED (#9856 ) * Add {decoder_,}head_mask to LED * Fix create_custom_forward signatue in encoder * Add head_mask to longformer * Add head_mask to longformer to fix dependencies of LED on Longformer. * Not working yet * Add mising one input in longofrmer_modeling.py * make fix-copies	2021-02-02 11:06:52 -08:00
Patrick von Platen	c8ea582ed6	reduce led memory (#9723 )	2021-01-21 05:16:15 -05:00
Patrick von Platen	a400fe8931	[LED Test] fix common inputs pt for flaky pt-tf led test (#9459 ) * fix common inputs pt flakey led * fix other tests correspondingly	2021-01-07 12:29:03 +01:00
Patrick von Platen	b8462b5b2a	[GenerationOutputs] Fix GenerationOutputs Tests (#9443 ) * fix generation models * fix led * fix docs * add is_decoder * fix last docstrings * make style * fix t5 cross attentions * correct t5	2021-01-06 19:37:02 +01:00
Patrick von Platen	189387e9b2	LED (#9278 ) * create model * add integration * save current state * make integration tests pass * add one more test * add explanation to tests * remove from bart * add padding * remove unnecessary test * make all tests pass * re-add cookie cutter tests * finish PyTorch * fix attention test * Update tests/test_modeling_common.py * revert change * remove unused file * add string to doc * save intermediate * make tf integration tests pass * finish tf * fix doc * fix docs again * add led to doctree * add to auto tokenizer * added tips for led * make style * apply jplus statements * correct tf longformer * apply lysandres suggestions * apply sylvains suggestions * Apply suggestions from code review	2021-01-05 13:14:30 +01:00

11 Commits