transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Sanchit Gandhi	cd9274d010	[FlaxBert] Add ForCausalLM (#16995 ) * [FlaxBert] Add ForCausalLM * make style * fix output attentions * Add RobertaForCausalLM * remove comment * fix fx-to-pt model loading * remove comment * add modeling tests * add enc-dec model tests * add big_bird * add electra * make style * make repo-consitency * add to docs * remove roberta test * quality * amend cookiecutter * fix attention_mask bug in flax bert model tester * tighten pt-fx thresholds to 1e-5 * add 'copied from' statements * amend 'copied from' statements * amend 'copied from' statements * quality	2022-05-03 11:26:19 +02:00
Patrick von Platen	31616b8d61	[T5 Tokenizer] Model has no fixed position ids - there is no hardcode… (#16990 ) * [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length * [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length * correct t5 tokenizer * correct t5 tokenizer * fix test * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * finish Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-05-02 21:27:34 +02:00
Sylvain Gugger	1073f00d4e	Clean up setup.py (#17045 ) * Clean up setup.py * Trigger CI * Upgrade Python used	2022-05-02 12:58:17 -04:00
Lysandre Debut	30ca529902	Make the sacremoses dependency optional (#17049 ) * Make sacremoses optional * Pickle	2022-05-02 12:47:47 -04:00
Lysandre Debut	bb2e088be7	Allow all imports from transformers (#17050 )	2022-05-02 12:47:39 -04:00
NielsRogge	1ac698744c	Add YOLOS (#16848 ) * First draft * Add YolosForObjectDetection * Make forward pass work * Add mid position embeddings * Add interpolation of position encodings * Add expected values * Add YOLOS to tests * Add integration test * Support tiny model as well * Support all models in conversion script * Remove mid_pe_size attribute * Make more tests pass * Add model to README and fix config * Add copied from statements * Rename base_model_prefix to vit * Add missing YOLOS_PRETRAINED_CONFIG_ARCHIVE_MAP * Apply suggestions from code review * Apply more suggestions from code review * Convert remaining checkpoints * Improve docstrings * Add YolosFeatureExtractor * Add feature extractor to docs * Add corresponding tests * Fix style * Fix docs * Apply suggestion from code review * Fix bad rebase * Fix some more bad rebase * Fix missing character * Improve docs and variable names Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-05-02 18:30:55 +02:00
Zachary Mueller	f275e593bf	Fix no_trainer examples to properly calculate the number of samples (#17046 ) * Update all examples to properly calculate progress bar	2022-05-02 11:56:25 -04:00
Zachary Mueller	35d48db881	Update no_trainer examples to use new logger (#17044 ) * Propagate and fix imports	2022-05-02 11:56:15 -04:00
calpt	daecae1f1c	[Trainer] Move logic for checkpoint loading into separate methods for easy overriding (#17043 )	2022-05-02 10:40:37 -04:00
NielsRogge	2de2c9ecca	Clean up vision tests (#17024 ) * Clean up tests * Make fixup Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-05-02 16:28:58 +02:00
Sylvain Gugger	4be8b95a9f	Disable Flax GPU tests on push (#17042 )	2022-05-02 10:25:53 -04:00
yujun	bdd690a74d	add torch.no_grad when in eval mode (#17020 ) * add torch.no_grad when in eval mode * make style quality	2022-05-02 07:49:19 -04:00
Martin Pömsl	9586e222af	Fix typo in RetriBERT docstring (#17018 )	2022-05-02 07:48:20 -04:00
Sanchit Gandhi	93b802c43e	[Flax(Speech)EncoderDecoder] Fix bug in `decoder_module` (#17036 ) * [FlaxSpeechEncoderDecoder] Fix bug in `decoder_module` * [FlaxEncoderDecoder] Fix bug in `decoder_module`	2022-05-02 13:06:45 +02:00
Sylvain Gugger	1ae182d9a6	Fix style	2022-05-02 06:19:31 -04:00
Michael Benayoun	2c2a2169b6	Fx with meta (#16836 ) * Add meta proxy * Uses meta data to trace data dependent control-flow * Remove commented class * Handles torch creating functions * Added type annotation to fix tracing * Tracing works for everything but T5 and GPT-J * Almost all previously supported models pass * All architectures can be traced except T5 * Intermediate commit to have a trace of the comparison operators for HFProxy * Everything works, except loss computation * Everything works * Removed unused import * Overriden methods do not use underlying ops (linear and torch.matmul), and model attributes are copied to the traced version * Fix torch_matmul_override * Change attributes reference to deepcopy * Remove breakpoint and add torch_index_override * Small fix * Fix typo * Replace asserts by explicit exceptions	2022-05-02 11:46:52 +02:00
Sanchit Gandhi	ff846e9b28	[FlaxGenerate] Fix bug in decoder_start_token_id (#17035 )	2022-05-02 11:05:27 +02:00
Manan Dey	eb877f1fd0	update docs of length_penalty (#17022 )	2022-05-02 11:01:18 +02:00
Omar U. Espejel	da47c264f9	Add translating guide (#17004 ) * Add translating guide	2022-04-30 17:43:38 -05:00
Yih-Dar	ede5e04191	Add a check on config classes docstring checkpoints (#17012 ) * Add the check * add missing ckpts * add a list to ignore * call the added check script * better regex pattern Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-30 10:40:46 +02:00
Sylvain Gugger	7152ed2bae	Result of new doc style with fixes (#17015 ) * Result of new doc style with fixes * Add last two files * Bump hf-doc-builder	2022-04-29 17:42:15 -04:00
Sylvain Gugger	18df440709	Replace dict/BatchEncoding instance checks by Mapping (#17014 ) * Replace dict/BatchEncoding instance checks by Mapping * Typo	2022-04-29 17:20:52 -04:00
Nicolas Patry	b8dffd1f3e	Revert "Updating variable names. (#16445 )" (#17011 ) This reverts commit `4f3a14e3c2`.	2022-04-29 12:26:45 -04:00
Nicolas Patry	4f3a14e3c2	Updating variable names. (#16445 )	2022-04-29 17:44:28 +02:00
tarzan	20fb5d51ea	Update README_zh-hans.md (#16977 )	2022-04-29 11:05:03 -04:00
Pavel Belevich	63fbed5c59	Make create_extended_attention_mask_for_decoder static method (#16893 )	2022-04-29 10:57:09 -04:00
Joao Gante	fb0ae12947	TF: XLA bad words logits processor and list of processors (#16974 )	2022-04-29 15:54:58 +01:00
Zachary Mueller	57e6464ac9	Update all require decorators to use skipUnless when possible (#16999 )	2022-04-29 08:55:38 -04:00
Yih-Dar	e952e049b4	use scale=1.0 in floats_tensor called in speech model testers (#17007 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-29 14:41:33 +02:00
Sylvain Gugger	e6f00a11d7	Update README to latest release (#16997 )	2022-04-28 14:17:44 -04:00
Zachary Mueller	3486a92a57	Fix savedir for by epoch (#16996 )	2022-04-28 13:49:45 -04:00
Yih-Dar	5af5735f62	set eos_token_id to None to generate until max length (#16989 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-28 19:47:38 +02:00
amyeroberts	01562dac7e	Rename a class to reflect framework pattern AutoModelXxx -> TFAutoModelXxx (#16993 )	2022-04-28 18:11:54 +01:00
conan1024hao	1be8d56ec6	Add parameter --config_overrides for run_mlm_wwm.py (#16961 ) * dd parameter --config_overrides for run_mlm_wwm.py * linter	2022-04-28 10:44:55 -04:00
Yih-Dar	1f9e862507	Update check_models_are_tested to deal with Windows path (#16973 ) * fix * Apply suggestions from code review Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-04-28 15:31:57 +02:00
Dat Quoc Nguyen	dced262409	Update tokenization_bertweet.py (#16941 ) The emoji version must be either 0.5.4 or 0.6.0. Newer emoji versions have been updated to newer versions of the Emoji Charts, thus not consistent with the one used for pre-processing the pre-training Tweet corpus (i.e. not consistent with the vocab).	2022-04-27 16:54:31 -04:00
Yih-Dar	992996e9ca	Add -e flag to some GH workflow yml files (#16959 ) * Add -e flag * add check * create new keys * run python setup.py build install * add comments * change to develop Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-27 21:44:21 +02:00
Yih-Dar	596afb4297	Fix check_all_models_are_tested (#16970 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-27 21:18:29 +02:00
Sylvain Gugger	691cdbb7d7	Fix doc notebooks links (#16969 ) * Fix doc notebooks links * Remove missing section	2022-04-27 14:59:53 -04:00
Zachary Mueller	60e1d883f1	Fixup no_trainer save logic (#16968 ) * Fixup all examples	2022-04-27 14:46:49 -04:00
Sylvain Gugger	c79bbc3ba5	Fix multiple deletions of the same files in save_pretrained (#16947 ) * Fix multiple deletions of the same files in save_pretrained * Add is_main_process argument	2022-04-27 12:28:42 -04:00
Sylvain Gugger	bfbec17765	Fix add-new-model-like when model doesn't support all frameworks (#16966 )	2022-04-27 11:15:25 -04:00
Mishig Davaadorj	cf8a7c2490	Update custom_models.mdx (#16964 ) BertModelForSequenceClassification -> BertForSequenceClassification	2022-04-27 16:46:55 +02:00
Antoni Baum	5896b3ecce	Fix `distributed_concat` with scalar tensor (#16963 ) * Fix `distributed_concat` with scalar tensor * Update trainer_pt_utils.py	2022-04-27 10:26:22 -04:00
NielsRogge	084c38c59d	[HF Argparser] Fix parsing of optional boolean arguments (#16946 ) * Add fix * Apply suggestion from code review Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-04-27 15:00:45 +02:00
Leonid Boytsov	c82e017aa9	Misc. fixes for Pytorch QA examples: (#16958 ) 1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed). 2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion). 3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions. 4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.	2022-04-27 08:51:39 -04:00
Yih-Dar	49d5bcb0f3	Fix HubertRobustTest PT/TF equivalence test on GPU (#16943 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-27 10:50:03 +02:00
NielsRogge	479fdc4925	Add semantic script, trainer (#16834 ) * Add first draft * Improve script and README * Improve README * Apply suggestions from code review * Improve script, add link to resulting model * Add corresponding test * Adjust learning rate	2022-04-27 10:12:18 +02:00
Anton Lozhkov	a4a88fa09f	[Research] Speed up evaluation for XTREME-S (#16785 ) * Avoid repeated per-lang filtering * Language groups and logits preprocessing * Style	2022-04-27 08:34:21 +02:00
Yongliang Shen	2d91e3c304	use original loaded keys to find mismatched keys (#16920 )	2022-04-26 17:29:52 -04:00

1 2 3 4 5 ...

9691 Commits