transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-03 21:00:08 +06:00

Author	SHA1	Message	Date
Sylvain Gugger	f128ccb997	Clean README in post release job as well. (#17519 )	2022-06-02 07:44:03 -04:00
Sylvain Gugger	7535d92e71	Pin protobouf that breaks TensorBoard in PyTorch (#17440 )	2022-05-26 09:56:55 -04:00
Sylvain Gugger	56f50590d5	Use Accelerate in `from_pretrained` for big model inference (#17341 ) * Initial work * More or less finished with first draft * Update src/transformers/modeling_utils.py Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Update src/transformers/modeling_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Fix randomly initialized weights * Update src/transformers/modeling_utils.py Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Address review comments * Rename DeepSpeed folder to temporarily fix the test issue? * Revert to try if Accelerate fix works * Use latest Accelerate release * Quality and fixes * Style * Quality * Add doc * Test + fix * More blocks Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>	2022-05-23 14:32:21 -04:00
Sylvain Gugger	3fd7de49f4	Pin dill to fix examples (#17368 ) * Pin dill for now * Try this version? * force install * Actually use dep in testing * Try a larger pin	2022-05-20 11:00:58 -04:00
Sylvain Gugger	afe5d42d8d	Black preview (#17217 ) * Black preview * Fixup too! * Fix check copies * Use the same version as the CI * Bump black	2022-05-12 16:25:55 -04:00
Lysandre Debut	5294fa12ee	Dev version	2022-05-12 11:04:23 -04:00
Stas Bekman	f861504466	[Deepspeed] add many more models to the model zoo test (#12695 ) * model zoo take 2 * add deberta * new param for zero2 * doc update * doc update * add layoutlm * bump deepspeed * add deberta-v2, funnel, longformer * new models * style * add t5_v1 * update TAPAS status * reorg problematic models * move doc to another PR * style * fix checkpoint check test * making progress on more models running * cleanup * new version * cleanup	2022-05-10 08:22:42 -07:00
Zachary Mueller	2fbb237967	Add the auto_find_batch_size capability from Accelerate into Trainer (#17068 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> - Adds auto_batch_size finder - Moves training loop to an inner training loop	2022-05-09 12:29:18 -04:00
lewtun	4bb1d0ec84	Skip RoFormer ONNX test if rjieba not installed (#16981 ) * Skip RoFormer ONNX test if rjieba not installed * Update deps table * Skip RoFormer serialization test * Fix RoFormer vocab * Add rjieba to CircleCI	2022-05-04 10:04:10 +02:00
Sylvain Gugger	1073f00d4e	Clean up setup.py (#17045 ) * Clean up setup.py * Trigger CI * Upgrade Python used	2022-05-02 12:58:17 -04:00
Lysandre Debut	30ca529902	Make the sacremoses dependency optional (#17049 ) * Make sacremoses optional * Pickle	2022-05-02 12:47:47 -04:00
Sylvain Gugger	7152ed2bae	Result of new doc style with fixes (#17015 ) * Result of new doc style with fixes * Add last two files * Bump hf-doc-builder	2022-04-29 17:42:15 -04:00
Sylvain Gugger	e6f00a11d7	Update README to latest release (#16997 )	2022-04-28 14:17:44 -04:00
Sylvain Gugger	dee6f01636	Pin Jax to last working release (#16808 ) * Pin Jax to last working release * Try lower * Try lower	2022-04-16 21:15:19 -04:00
Stas Bekman	ce2fef2ad2	[trainer / deepspeed] fix hyperparameter_search (#16740 ) * [trainer / deepspeed] fix hyperparameter_search * require optuna * style * oops * add dep in the right place * create deepspeed-testing dep group * Trigger CI	2022-04-14 17:24:38 -07:00
Lysandre Debut	a180efe7fd	Dev version	2022-04-06 11:08:12 -04:00
Lysandre Debut	53a4d6b115	Pin tokenizers version <0.13 (#16539 ) * Pin tokenizers version <0.13 * Style	2022-04-01 11:53:18 -04:00
Sylvain Gugger	473709fc76	Use doc builder styler (#16412 ) * Config update * Use doc-builder styler * Cleanup * Adapt import * We need it there too!	2022-03-28 07:45:18 -04:00
Yih-Dar	9d88be5778	bump cookiecutter version (#16387 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-24 11:08:31 -04:00
Lysandre Debut	eca77f4719	Updates the default branch from master to main (#16326 ) * Updates the default branch from master to main * Links from `master` to `main` * Typo * Update examples/flax/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-23 03:46:59 -04:00
Suraj Patil	b25b92ac4f	update jax version and re-enable some tests (#16254 )	2022-03-18 16:45:39 +01:00
Stas Bekman	580dd87c55	[Deepspeed] add support for bf16 mode (#14569 ) * [WIP] add support for bf16 mode * prep for bf16 * prep for bf16 * fix; zero2/bf16 is ok * check bf16 is available * test fixes * enable zero3_bf16 * config files * docs * split stage_dtype; merge back to non-dtype-specific config file * fix doc * cleanup * cleanup * bfloat16 => bf16 to match the PR changes * s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/ * test fixes/skipping * move * fix * Update docs/source/main_classes/deepspeed.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * backticks * cleanup * cleanup * cleanup * new version * add note about grad accum in bf16 Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-11 17:53:53 -08:00
Sylvain Gugger	79d28e80b6	v4.18.0.dev.0	2022-03-03 10:19:58 -05:00
Lysandre Debut	54f0db4066	Add PT + TF automatic builds (#15860 ) * Add PT + TF automatic builds * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Wrap up Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-03-01 08:55:11 -05:00
SaulLu	e93763d420	fix CLIP fast tokenizer and change some properties of the slow version (#15067 ) Very big changes concerning the tokenizer fast of CLIP which did not correspond to the tokenizer slow of CLIP Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-02-18 10:21:30 +01:00
Alex Hedges	e1cbc073bf	Require tokenizers>=0.11.1 (#15266 ) `tokenizers` version that supports the feature to choose the direction of truncation	2022-02-15 11:46:12 +01:00
Lysandre Debut	7732d0fe7a	Upgrade black to version ~=22.0 (#15565 ) * Upgrade black to version ~=22.0 * Check copies * Fix code	2022-02-09 09:28:57 -05:00
Stas Bekman	297602c7f4	[deepspeed] saving checkpoint fallback when fp16 weights aren't saved (#14948 ) * [deepspeed] saving checkpoint fallback when fp16 weights aren't saved * Bump required deepspeed version to match usage when saving checkpoints * update version Co-authored-by: Mihai Balint <balint.mihai@gmail.com>	2022-01-28 11:05:47 -08:00
Lysandre	eab338104d	Docs for version v4.16.0	2022-01-27 13:11:51 -05:00
Lysandre	f87db5e412	Release: v4.16.0	2022-01-27 13:06:33 -05:00
Patrick von Platen	497346d07e	[ASR pipeline] correct with lm pipeline (#15200 ) * [ASR pipeline] correct with lm pipeline * improve error	2022-01-18 15:36:22 +01:00
Sylvain Gugger	6f8e644f09	Mark bad tokenizers version (#15188 )	2022-01-17 15:20:58 -05:00
Joao Gante	ebc4edfe7a	update from keras2onnx to tf2onnx (#15162 )	2022-01-14 17:35:39 +00:00
Nicolas Patry	08cb5718ec	Enabling `tokenizers` upgrade. (#14941 ) * Enabling `tokenizers` upgrade. * Moved ugly comment. * Tokenizers==0.11.1 needs an update to keep borrow checker happy in highly contiguous calls. * Support both 0.11.1 and 0.11.0	2021-12-30 17:30:58 +01:00
Patrick von Platen	fa39ff9fc4	Docs for v4.16.0dev0	2021-12-22 20:39:44 +01:00
Patrick von Platen	05fa1a7ac1	Release: v4.15.0	2021-12-22 18:43:15 +01:00
Patrick von Platen	c4a96cecbc	Wav2Vec2 meets phonemes (#14353 ) * up * add tokenizer * improve more * finish tokenizer * finish * adapt speech recognition script * adapt convert * more fixes * more fixes * update phonemizer wav2vec2 * better naming * fix more tests * more fixes swedish * correct tests * finish * improve script * remove file * up * lets get those 100 model architectures until the end of the month * make fix-copies * correct more * correct script * more fixes * more fixes * add to docs * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * replace assert * fix copies * fix docs * new try docs * boom boom * update * add phonemizer to audio tests * make fix-copies * up * upload models * some changes * Update tests/test_tokenization_wav2vec2_phoneme.py Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * more fixes * remove @ Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>	2021-12-17 19:56:44 +01:00
Sylvain Gugger	5061a9fd55	Post sphinx-clean up and contributing guide updates (#14790 ) * Clean up sphinx * Update contributing guide * Update docs README * No example title * Fix copies * Update CONTRIBUTING.md Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-12-16 09:29:26 -05:00
Lysandre	7c9c41f43c	Docs for v4.14.0	2021-12-15 18:29:53 +01:00
Lysandre	960d8cb41d	Release: v4.14.0	2021-12-15 18:20:35 +01:00
Lysandre	ab31b3e41b	Docs for v4.14.0dev0	2021-12-09 17:09:23 +01:00
Lysandre	4da3a696e4	Release: v4.13.0	2021-12-09 16:55:21 +01:00
Patrick von Platen	961732c276	[Wav2Vec2] PyCTCDecode Integration to support language model boosted decoding (#14339 ) * up * up * up * make it cleaner * correct * make styhahalal * add more tests * finish * small fix * make style * up * tryout to solve cicrle ci * up * fix more tests * fix more tests * apply sylvains suggestions * fix import * correct docs * add pyctcdecode only to speech tests * fix more tests * add tf, flax and pt tests * add pt * fix last tests * fix more tests * Apply suggestions from code review * change lines * Apply suggestions from code review Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * correct tests * correct tests * add doc string Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>	2021-12-08 12:07:54 +01:00
Stas Bekman	956a483173	[deepspeed] zero inference (#14253 ) * [deepspeed] zero inference * only z3 makes sense for inference * fix and style * docs * rework * fix test * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * responding to suggestions Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-11-23 14:09:15 -08:00
Lysandre Debut	331c3d2aa0	Add GitPython to quality tools (#14459 ) * Update setup.py * Update setup.py * Update setup.py * Remove GitPython install	2021-11-19 08:43:48 -05:00
Sylvain Gugger	b567510cff	Debug doc (#14424 ) * Create branch for tests * Pin first upgrade * Really pin * Polish fix	2021-11-16 18:58:07 -05:00
Sylvain Gugger	46d0cdae40	Support for TF >= 2.7 (#14345 )	2021-11-09 18:49:29 -05:00
Suraj Patil	85a4bda4f4	bump flax version (#14343 )	2021-11-09 22:15:22 +05:30
Sylvain Gugger	a14d62b0b1	Pin TF until tests are fixed (#14283 ) * Pin TF until tests are fixed * Also pin TF CPU	2021-11-04 21:15:42 -04:00
Matt	b90a48f654	Removing Keras version pinning (#14280 ) * Removing Keras version pinning * make fixup	2021-11-04 17:58:28 +00:00

1 2 3 4 5 ...

318 Commits