transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-05 22:00:09 +06:00

Author	SHA1	Message	Date
Sylvain Gugger	3e2dd7f92d	Poc to use safetensors (#19175 ) * Poc to use safetensors * Typo * Final version * Add tests * Save with the right name! * Update tests/test_modeling_common.py Co-authored-by: Julien Chaumond <julien@huggingface.co> * Support for sharded checkpoints * Test from Hub part 1 * Test from hub part 2 * Fix regular checkpoint sharding * Bump for fixes Co-authored-by: Julien Chaumond <julien@huggingface.co>	2022-09-30 10:58:04 -04:00
Lucain	902d30b31a	Use `hf_raise_for_status` instead of deprecated `_raise_for_status` (#19244 ) * Use instead of from huggingface_hub * bump huggingface_hub to 0.10.0 + make deps_table_update	2022-09-29 08:58:39 -04:00
Nicolas Patry	d5848a574a	Allowing users to use the latest `tokenizers` release ! (#19139 ) * Allowing users to use the latest `tokenizers` release ! * Upgrading the versions table too.	2022-09-21 17:46:04 +02:00
Quentin Lhoest	66154a6c87	suppoer deps from github (#19141 )	2022-09-21 16:15:31 +02:00
Sylvain Gugger	114295c010	Refuse Datasets 2.5.0 while waiting for a patch	2022-09-21 09:37:53 -04:00
Sylvain Gugger	820cb97a3f	Organize test jobs (#19058 ) * Tests conditional run * Syntax * Deps * Try early exit * Another way * Test with no tests to run * Test all * Typo * Try this way * With tests to run * Mostly finished * Typo * With a modification in one file only * No change, no tests * Final cleanup * Address review comments	2022-09-16 09:19:51 -04:00
Sylvain Gugger	f7ce4f1ff7	Fix custom tokenizers test (#19052 ) * Fix CI for custom tokenizers * Add nightly tests * Run CI, run! * Fix paths * Typos * Fix test	2022-09-15 11:31:09 -04:00
Lysandre	16913b3c92	Dev version	2022-09-14 14:58:20 -04:00
Sylvain Gugger	3774010161	Automate check for new pipelines and metadata update (#19029 ) * Automate check for new pipelines and metadata update * Add Datasets to quality extra	2022-09-14 14:06:49 -04:00
Sylvain Gugger	a2a3afbc8d	PyTorch >= 1.7.0 and TensorFlow >= 2.4.0 (#19016 )	2022-09-14 07:19:02 -04:00
NielsRogge	59407bbeb3	Add Deformable DETR (#17281 ) * First draft * More improvements * Improve model, add custom CUDA code * Import torch before * Add script that imports custom layer * Add everything in new ops directory * Import custom layer in modeling file * Fix ARCHIVE_MAP typo * Creating the custom kernel on the fly. * Import custom layer in modeling file * More improvements * Fix CUDA loading * More improvements * Improve conversion script * Improve conversion script * Make it work until encoder_outputs * Make forward pass work * More improvements * Make logits match original implementation * Make implementation also support single_scale model * Add support for single_scale and dilation checkpoint * Add support for with_box_refine model * Support also two stage model * Improve tests * Fix more tests * Make more tests pass * Upload all models to the hub * Clean up some code * Improve decoder outputs * Rename intermediate hidden states and reference points * Improve model outputs * Move tests to dedicated folder * Improve model outputs * Fix retain_grad test * Improve docs * Clean up and make test_initialization pass * Improve variable names * Add copied from statements * Improve docs * Fix style * Improve docs * Improve docs, move tests to model folder * Fix rebase * Remove DetrForSegmentation from auto mapping * Apply suggestions from code review * Improve variable names and docstrings * Apply some more suggestions from code review * Apply suggestion from code review * better docs and variables names * hint to num_queries and two_stage confusion * remove asserts and code refactor * add exception if two_stage is True and with_box_refine is False * use f-strings * Improve docs and variable names * Fix code quality * Fix rebase * Add require_torch_gpu decorator * Add pip install ninja to CI jobs * Apply suggestion of @sgugger * Remove DeformableDetrForObjectDetection from auto mapping * Remove DeformableDetrModel from auto mapping * Add model to toctree * Add model back to mappings, skip model in pipeline tests * Apply @sgugger's suggestion * Fix imports in the init * Fix copies * Add CPU implementation * Comment out GPU function * Undo previous change * Apply more suggestions * Remove require_torch_gpu annotator * Fix quality * Add logger.info * Fix logger * Fix variable names * Fix initializaztion * Add missing initialization * Update checkpoint name * Add model to doc tests * Add CPU/GPU equivalence test * Add Deformable DETR to pipeline tests * Skip model for object detection pipeline Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: Nouamane Tazi <nouamane98@gmail.com> Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>	2022-09-14 11:45:21 +02:00
Joao Gante	1182b945a6	TF: TF 2.10 unpin + related onnx test skips (#18995 )	2022-09-12 19:30:27 +01:00
Sylvain Gugger	a26114777e	Revert "TF: unpin maximum TF version (#18917 )" (#18972 ) This reverts commit `d8cf3b2087`.	2022-09-10 09:11:46 -04:00
Joao Gante	d8cf3b2087	TF: unpin maximum TF version (#18917 )	2022-09-10 13:33:01 +01:00
Bram Vanroy	855dcae8bb	update black target version (#18955 ) * update black target version * add comment as per https://github.com/huggingface/transformers/pull/18955#issuecomment-1242081649 * revert change Will only update to 3.7 after black 2023 upgrade in January	2022-09-09 17:30:05 -04:00
Sylvain Gugger	38c3cd52fb	Clean up utils.hub using the latest from hf_hub (#18857 ) * Clean up utils.hub using the latest from hf_hub * Adapt test * Address review comment * Fix test	2022-09-02 10:30:06 -04:00
Albert Villanova del Moral	fafbb57df1	Pin rouge_score (#18247 ) * Pin rouge_score * Pin also in dependency_versions_table * Update excluded versions * Revert "Update excluded versions" This reverts commit `0d0362df30`. * Revert "Revert "Update excluded versions"" This reverts commit `66c47af8a6`.	2022-09-01 12:04:49 +02:00
Albert Villanova del Moral	a26c752353	Unpin fsspec (#18846 )	2022-09-01 10:20:15 +02:00
Sylvain Gugger	74690b62a1	Pin ffspec (#18837 ) * Pin ffspec * Typo	2022-08-31 19:04:04 +02:00
Joao Gante	fea4636cfa	Pin max tf version (#18818 )	2022-08-31 10:07:53 +02:00
Yih-Dar	ec8d26248f	unpin resampy (#18527 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-08 17:44:10 +02:00
Sylvain Gugger	70b0d4e193	Fix compatibility with 1.12 (#17925 ) * Fix compatibility with 1.12 * Remove pin from examples requirements * Update torch scatter version * Fix compatibility with 1.12 * Remove pin from examples requirements * Update torch scatter version * fix torch.onnx.symbolic_opset12 import * Reject bad version Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-08 09:53:08 -04:00
Sylvain Gugger	c7849d9efc	Remove py.typed (#18485 )	2022-08-05 09:12:19 -04:00
Omar Sanseviero	a507908cd3	Update pinned hhub version (#18448 ) * Update pinned hhub version * Make style	2022-08-03 08:37:42 -04:00
Sylvain Gugger	941d233153	Fix ROUGE add example check and update README (#18398 ) * Fix ROUGE add example check and update README * Stay consistent in values	2022-08-01 11:14:49 -04:00
Sylvain Gugger	af1e6b4d87	Add evaluate to test dependencies (#18396 )	2022-08-01 08:55:44 -04:00
Lysandre	c89a592e87	Dev version	2022-07-27 17:13:57 +02:00
Sylvain Gugger	9bd3968509	Fix slow CI by pinning resampy (#18077 ) * Fix slow CI by pinning resampy * Actually put it in the speech dependencies	2022-07-08 10:51:24 -04:00
Sanchit Gandhi	ec07eccc7d	[Flax] Bump to v0.4.1 (#17966 )	2022-07-05 15:17:17 +01:00
Sylvain Gugger	5a3d0cbdda	Pin PyTorch while we fix compatibility with 1.12	2022-06-28 15:07:26 -04:00
Lysandre Debut	1dfa03f12b	Pin black to 22.3.0 to benefit from a stable --preview flag (#17918 )	2022-06-28 04:32:18 -04:00
Matt	ee0d001de7	Add a TF in-graph tokenizer for BERT (#17701 ) * Add a TF in-graph tokenizer for BERT * Add from_pretrained * Add proper truncation, option handling to match other tokenizers * Add proper imports and guards * Add test, fix all the bugs exposed by said test * Fix truncation of paired texts in graph mode, more test updates * Small fixes, add a (very careful) test for savedmodel * Add tensorflow-text dependency, make fixup * Update documentation * Update documentation * make fixup * Slight changes to tests * Add some docstring examples * Update tests * Update tests and add proper lowercasing/normalization * make fixup * Add docstring for padding! * Mark slow tests * make fixup * Fall back to BertTokenizerFast if BertTokenizer is unavailable * Fall back to BertTokenizerFast if BertTokenizer is unavailable * make fixup * Properly handle tensorflow-text dummies	2022-06-27 12:06:21 +01:00
Sourab Mangrulkar	21a772426d	Migrate HFDeepSpeedConfig from trfrs to accelerate (#17623 ) * Migrate HFDeepSpeedConfig from trfrs to accelerate * add `accelerate` to testing dep * addressing comments * addressing comments Using `_shared_state` and avoiding object creation. This is necessary as `notebook_launcher` in `launcers.py` checks `len(AcceleratorState._shared_state)>0` to throw an error. * resolving comments 1. Use simple API from accelerate to manage the deepspeed config integration 2. Update the related documentation * reverting changes and addressing comments * docstring correction * addressing nits * addressing nits * addressing nits 3 * bumping up the accelerate version to 0.10.0 * resolving import * update setup.py to include deepspeed dependencies * Update dependency_versions_table.py * fixing imports * reverting changes to CI dependencies for "run_tests_pipelines_tf" tests These changes didn't help with resolving the failures and I believe this needs to be addressed in another PR. removing `accelerate` as hard dependency Resolves issues related to CI Tests * adding `accelerate` as dependency for building docs resolves failure in Build PR Documentation test * adding `accelerate` as dependency in "dev" to resolve doc build issue * resolving comments 1. adding `accelerate` to extras["all"] 2. Including check for accelerate too before import HFDeepSpeedConfig from there Co-Authored-By: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * resolving comments Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-06-17 23:29:35 +05:30
Sylvain Gugger	7c6ec195ad	v4.21.0.dev0	2022-06-16 12:20:53 -04:00
Stas Bekman	2f59ad1609	[trainer/deepspeed] load_best_model (reimplement re-init) (#17151 ) * [trainer/deepspeed] load_best_model * to sync with DS PR #1947 * simplify * rework load_best_model test * cleanup * bump deepspeed>=0.6.5 Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>	2022-06-02 09:14:21 -07:00
Sylvain Gugger	f128ccb997	Clean README in post release job as well. (#17519 )	2022-06-02 07:44:03 -04:00
Sylvain Gugger	7535d92e71	Pin protobouf that breaks TensorBoard in PyTorch (#17440 )	2022-05-26 09:56:55 -04:00
Sylvain Gugger	56f50590d5	Use Accelerate in `from_pretrained` for big model inference (#17341 ) * Initial work * More or less finished with first draft * Update src/transformers/modeling_utils.py Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Update src/transformers/modeling_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Fix randomly initialized weights * Update src/transformers/modeling_utils.py Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Address review comments * Rename DeepSpeed folder to temporarily fix the test issue? * Revert to try if Accelerate fix works * Use latest Accelerate release * Quality and fixes * Style * Quality * Add doc * Test + fix * More blocks Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>	2022-05-23 14:32:21 -04:00
Sylvain Gugger	3fd7de49f4	Pin dill to fix examples (#17368 ) * Pin dill for now * Try this version? * force install * Actually use dep in testing * Try a larger pin	2022-05-20 11:00:58 -04:00
Sylvain Gugger	afe5d42d8d	Black preview (#17217 ) * Black preview * Fixup too! * Fix check copies * Use the same version as the CI * Bump black	2022-05-12 16:25:55 -04:00
Lysandre Debut	5294fa12ee	Dev version	2022-05-12 11:04:23 -04:00
Stas Bekman	f861504466	[Deepspeed] add many more models to the model zoo test (#12695 ) * model zoo take 2 * add deberta * new param for zero2 * doc update * doc update * add layoutlm * bump deepspeed * add deberta-v2, funnel, longformer * new models * style * add t5_v1 * update TAPAS status * reorg problematic models * move doc to another PR * style * fix checkpoint check test * making progress on more models running * cleanup * new version * cleanup	2022-05-10 08:22:42 -07:00
Zachary Mueller	2fbb237967	Add the auto_find_batch_size capability from Accelerate into Trainer (#17068 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> - Adds auto_batch_size finder - Moves training loop to an inner training loop	2022-05-09 12:29:18 -04:00
lewtun	4bb1d0ec84	Skip RoFormer ONNX test if rjieba not installed (#16981 ) * Skip RoFormer ONNX test if rjieba not installed * Update deps table * Skip RoFormer serialization test * Fix RoFormer vocab * Add rjieba to CircleCI	2022-05-04 10:04:10 +02:00
Sylvain Gugger	1073f00d4e	Clean up setup.py (#17045 ) * Clean up setup.py * Trigger CI * Upgrade Python used	2022-05-02 12:58:17 -04:00
Lysandre Debut	30ca529902	Make the sacremoses dependency optional (#17049 ) * Make sacremoses optional * Pickle	2022-05-02 12:47:47 -04:00
Sylvain Gugger	7152ed2bae	Result of new doc style with fixes (#17015 ) * Result of new doc style with fixes * Add last two files * Bump hf-doc-builder	2022-04-29 17:42:15 -04:00
Sylvain Gugger	e6f00a11d7	Update README to latest release (#16997 )	2022-04-28 14:17:44 -04:00
Sylvain Gugger	dee6f01636	Pin Jax to last working release (#16808 ) * Pin Jax to last working release * Try lower * Try lower	2022-04-16 21:15:19 -04:00
Stas Bekman	ce2fef2ad2	[trainer / deepspeed] fix hyperparameter_search (#16740 ) * [trainer / deepspeed] fix hyperparameter_search * require optuna * style * oops * add dep in the right place * create deepspeed-testing dep group * Trigger CI	2022-04-14 17:24:38 -07:00

1 2 3 4 5 ...

353 Commits