transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-19 04:28:26 +06:00

Author	SHA1	Message	Date
Quentin Meeus	5b72b3412b	Remove CLI spams with Whisper FeatureExtractor (#21267 ) * Remove CLI spams with Whisper FeatureExtractor Whisper feature extractor representation includes the MEL filters, a list of list that is represented as ~16,000 lines. This needlessly spams the command line. I added a `__repr__` method that replaces this list with a string "<array of shape (80, 201)>" * Remove mel_filters from to_dict output Credits to @ArthurZucker * remove unused import * update feature extraction tests for the changes in to_dict	2023-02-10 09:15:16 -05:00
Eugene Zapolsky	129011c20b	adding a tip for deepspeed integration in multi-node environment (#21459 ) * adding note concerning use_node_local_storage * overriding checkpoint.use_node_local_storage if save_on_each_node == True * add more content * add more content * improve * style --------- Co-authored-by: Stas Bekman <stas@stason.org>	2023-02-10 09:12:56 -05:00
Katie Le	21a2d900ec	Added with torch.no_grad() to Camembert integration test (#21544 ) add with torch.no_grad() to Camembert integration test Co-authored-by: Bibi <Bibi@katies-mac.local>	2023-02-10 10:58:29 +01:00
Younes Belkada	f83942684d	[`pipeline`] A simple fix for half-precision & 8bit models (#21479 ) * v1 fix * adapt from suggestions * make style * fix tests * add gpu tests * update docs * fix other tests * Apply suggestions from code review Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> * better fix * make fixup * better example * revert changes * proposal * more elegant solution * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-10 10:26:17 +01:00
Sylvain Gugger	97d3390fc8	Skip failing test for now	2023-02-09 20:11:26 -05:00
Katie Le	23c146c38b	Added with torch.no_grad() to XLM-Roberta integration test (#21547 ) * added with torch.no_grad() to the integration tests and applied make style * added with torch.no_grad() to xlm roberta forward pass --------- Co-authored-by: Bibi <Bibi@katies-mac.local>	2023-02-09 21:49:54 +01:00
Sylvain Gugger	04b2f13c37	🚨🚨🚨 Enforce single model initialization (#21431 ) * Enforce single model initialization * Add OneFormer example for problem 3 * Do it the Stas way * Actually rename the uses... * Rewrite test * Try to change the test this way * Fix all init slow/fast tests * Break connection * Fix more tests * Fix test for initialization * Remove custom test * Quality * Fix last failing tests * The end?	2023-02-09 15:46:26 -05:00
Sylvain Gugger	2020ac4bd6	Fix from_pretrained API with config and state_dict (#21542 )	2023-02-09 15:44:02 -05:00
Sylvain Gugger	1efe9c0b24	Fix inclusion of non py files in package (#21546 ) * Fix inclusion of non py files in package * No need for the **	2023-02-09 14:15:10 -05:00
Sylvain Gugger	7927732ff8	Align BLIP-2 winit with others	2023-02-09 12:03:27 -05:00
NielsRogge	d7f1e7c009	Add BLIP-2 (#21441 ) * First draft * More improvements * More improvements * Improve conversion script * Convert all weights * Make forward pass work * Make logits match * More improvements * More improvements * More improvements * Use get_input_embeddings * Improve some more * Improve model tests * Improve model tests * More improvements * Fix processor * Update files * Update prepare_inputs_for_generation * More improvements * Fix copies * More fixes * Make fixup * More improvements * Add support for seq2seq language model * More improvements * Fix test * More improvements * Improve conversion script * Remove some todo's * Fix README's * Improve conversion script * Fix generation * Fix style and remove Blip2Model * Fix model outputs * More improvements * Set eos_token_id in config * Fix quality * Small improvements * Add processor tests * More improvements * Apply suggestions * Apply suggestions * Add integration test * Update image URL * Add integration test * Fix model_type * Update style * Improve docs * Add doc tests * Fix copies * Remove tests which are passing * Improve some more * Add tests for seq2seq language models * Minor fix * Convert more checkpoints * finalize CI * Fix blip and blip2 processors * add `accelerate` support for `blip2` * clean up * make style * Update conversion script * Update conversion script some more * Update organization * revert toc file * add blip-2 to toc file * Some more improvements * Fix docstring * Improve docs --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: younesbelkada <younesbelkada@gmail.com>	2023-02-09 16:52:11 +01:00
lee1jun	b31cee6727	fix typo in run_speech_recognition_ctc.py (#21528 ) Update run_speech_recognition_ctc.py There should be `# limitations under the License` line at the end of the documentation section.	2023-02-09 09:46:40 -05:00
Joao Gante	0d33381fad	Tag tests as slow ⌛ (#21537 ) begone slow tests	2023-02-09 14:46:15 +00:00
Victor Sonck	3a726777ca	Fix ClearML Integration to run in ClearML pipelines and external Tasks. (#21531 ) * Added clearml pipeline fix for when task is already initialized * Correctly initialize	2023-02-09 09:28:55 -05:00
Motoki Wu	17109ecfb8	Fix missing unfinished_sequences (#21529 ) fix missing unfinished_sequences	2023-02-09 09:06:22 -05:00
Joao Gante	2edf9a857b	Generate: TF `.generate()` can now be exported with dynamic length (#21474 )	2023-02-09 12:52:30 +00:00
Joao Gante	e69f9715eb	Generate: make TF `.generate()` signature == PT `.generate()` signature (#21525 )	2023-02-09 11:10:13 +00:00
Yih-Dar	c35bb6de54	Add `__len__` method to `_LazyAutoMapping` (#21522 ) Add `__len__` method to `_LazyAutoMapping` Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-02-08 20:35:14 +01:00
Motoki Wu	9960506cbe	Fix multiple `eos_token_id`s in model.generate(...) (#21461 ) * add tests with multiple eos_token_ids * make math.prod instead of sum * make fixup * fix long and also use np.prod since math.prod does not exist <python 3.8 * make fixup * add prod util * use prod util instead of np.prod * make fixup * previous .long location * use tensor ops * remove prod * remove prod * update device * make fixup * fix none	2023-02-08 13:48:46 -05:00
Nicolas Patry	06d940efc3	Fixing backward compatiblity `image_processor` in pipeline. (#21513 )	2023-02-08 19:36:20 +01:00
Stas Bekman	8ea994d3c5	[tests] add missing `report_to none` (#21505 ) [tests] report_to none	2023-02-08 09:32:40 -08:00
Thomas Wang	98d5b72727	Update OPT conversion script to work for OPT-IML (#21519 )	2023-02-08 18:31:10 +01:00
Matthijs Hollemans	fe616f35c8	no more dummies for speech processors (#21517 )	2023-02-08 11:41:54 -05:00
Joao Gante	1d9c26a4b8	Generate: TF `compute_transition_scores` (#21341 )	2023-02-08 16:36:43 +00:00
Stefan Schweter	d3046dad80	[Doc] Minor URL fixes in PyTorch Text Classification Readme (#21511 ) docs: fix some references in PyTorch text classification readme	2023-02-08 09:39:52 -05:00
dependabot[bot]	e024cd715e	Bump cryptography from 36.0.2 to 39.0.1 in /examples/research_projects/decision_transformer (#21507 ) Bump cryptography in /examples/research_projects/decision_transformer Bumps [cryptography](https://github.com/pyca/cryptography) from 36.0.2 to 39.0.1. - [Release notes](https://github.com/pyca/cryptography/releases) - [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst) - [Commits](https://github.com/pyca/cryptography/compare/36.0.2...39.0.1) --- updated-dependencies: - dependency-name: cryptography dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-02-08 09:25:06 -05:00
Guillaume Klein	ca905ba28e	Exclude the madeup words from M2M100Tokenizer.vocab_size (#20976 )	2023-02-08 09:19:06 -05:00
Katie Le	cc1d0685b3	Wrap RemBert integration test forward passes with torch.no_grad() (#21503 ) added with torch.no_grad() to the integration tests and applied make style Co-authored-by: Bibi <Bibi@katies-mac.local>	2023-02-08 14:00:52 +01:00
Sylvain Gugger	5b67ab9924	Fix import in Accelerate for find_exec_bs (#21501 )	2023-02-07 16:45:59 -05:00
Prajwal Kailas	eb1771ef1f	Check for mapping/dict in distributed_concat function (#21500 ) check for mapping/dict in distributed_concat function Co-authored-by: prajwal967 <user.email>	2023-02-07 16:45:37 -05:00
Stefan Schweter	7e51a441e4	Add XLM-V to Model Doc (#21498 ) * doc: introduce new section for XLM-V model * doc: mention more details for XLM-V integration * docs: paper abstract in italics, model identifier for base model added * doc: mention new XLM-V support * auto: add XLM-V mapping * doc: run make fix-copies ;)	2023-02-07 16:43:19 -05:00
Adrian Sager La Ganga	a3034c7004	Add inverse sqrt learning rate scheduler (#21495 ) * added inverse sqrt lr scheduler * Updated get_scheduler in src/transformers/optimization.py * Updated src/transformers/__init__.py * Added inverse sqrt lr scheduler test * Updated docs/source/en/main_classes/optimizer_schedules.mdx * Ran style and quality scripts * Fix get_inverse_sqrt_schedule docstring * Comment implementation URL	2023-02-07 15:00:50 -05:00
Stas Bekman	b9af152efb	[tokenizer] sanitize saved config (#21483 ) * [tokenizer] sanitize saved config * rm config["name_or_path"] test	2023-02-07 10:51:45 -08:00
Sylvain Gugger	67d074874d	Cleanup quality (#21493 ) * Remove mentions of flake8/isort * Clean up inits * Deall with all other inits * Last special rule for dummy files	2023-02-07 12:27:31 -05:00
raghavanone	571fa585b6	Add limit_all_gathers option to fsdp_config and fix forward_prefetch bug (#21489 ) * Add limit_all_gathers option to fsdp_config and fix forward_prefetch bug * Fix black issue * Fix ruff failure * Incorporate PR feedbacks * Incorporate PR feedbacks * Incorporate PR feedbacks	2023-02-07 12:27:06 -05:00
Yih-Dar	479322bfaa	A new test to check config attributes being used (#21453 ) * Add a new test to check config attributes being used * Add a new test to check config attributes being used * Add a new test to check config attributes being used * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions * Update allowed cases - part 1 * Update allowed cases - part 2 * final --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-07 17:49:30 +01:00
Arthur	9e7f84a556	[OPT] Adds `GPT2TokenizerFast` to the list of tokenizer to use for OPT. (#20823 ) * Add ("opt", ("GPT2Tokenizer", "GPT2TokenizerFast" if is_tokenizers_available() else None)), * skip failing test * Add ("opt", ("GPT2Tokenizer", "GPT2TokenizerFast" if is_tokenizers_available() else None)), * skip failing test	2023-02-07 17:35:28 +01:00
raghavanone	8a303f527f	Sanity check the type of id2label and label2id arguments of from_pretrained for TokenClassification models (#21490 ) * Sanity check the type of id2label and label2id arguments of from_pretrained for TokenClassification models * Incorporate PR feedbacks * Incorporate PR feedbacks	2023-02-07 10:44:43 -05:00
Matt	28ec07d8ad	Typos/fixes to link syntax (#21450 ) * Typos/fixes to link syntax * Trying section headers * Add header formatting for Rule #3	2023-02-07 15:19:19 +00:00
Jeroen Van Der Donckt	bbe98ea9c3	🖊️ fix typo in pytorch semantic segmentation readme (#21492 )	2023-02-07 09:39:24 -05:00
Iulian Taiatu	8581fbaa6d	changed "ot" to "to" (#21488 )	2023-02-07 09:31:32 -05:00
Younes Belkada	fa0ae17958	[`Doc`] Fix int8 docs (#21487 ) fix int8 docs	2023-02-07 15:09:27 +01:00
Joao Gante	1e4cf8bb44	Generate: TF can now generate from embeddings in encoder-decoder models (#21475 )	2023-02-07 11:18:23 +00:00
Arthur	12eb528b5a	[CI ] Remove `past` in favor of `pat_key_values` (#21443 ) * fix past renamed to past_key_value * update more `past`that were ski^êd * fixup * remove changes made to rag * refactor `_reorder_cache` to use `past_key_values` * fix git `prepare_inputs_for_generation` to pass tests when false is needed in use_cache	2023-02-07 09:51:35 +01:00
Sylvain Gugger	5b49376202	Deprecate parallelize API (#21448 ) * Deprecate parallelize API * Add documentation * Fix copies	2023-02-06 19:39:13 -05:00
Sylvain Gugger	cc8407522a	Fix epoch number when resuming training (#21478 )	2023-02-06 19:34:34 -05:00
dependabot[bot]	35f93f299f	Bump oauthlib from 3.2.1 to 3.2.2 in /examples/research_projects/decision_transformer (#21481 ) Bump oauthlib in /examples/research_projects/decision_transformer Bumps [oauthlib](https://github.com/oauthlib/oauthlib) from 3.2.1 to 3.2.2. - [Release notes](https://github.com/oauthlib/oauthlib/releases) - [Changelog](https://github.com/oauthlib/oauthlib/blob/master/CHANGELOG.rst) - [Commits](https://github.com/oauthlib/oauthlib/compare/v3.2.1...v3.2.2) --- updated-dependencies: - dependency-name: oauthlib dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-02-06 18:27:14 -05:00
Sylvain Gugger	6f79d26442	Update quality tooling for formatting (#21480 ) * Result of black 23.1 * Update target to Python 3.7 * Switch flake8 to ruff * Configure isort * Configure isort * Apply isort with line limit * Put the right black version * adapt black in check copies * Fix copies	2023-02-06 18:10:56 -05:00
lewtun	b7bb2b59f7	Add tips for generation with Int8 models (#21424 ) * Add tips for generation with Int8 models * Empty commit to trigger CI * Apply suggestions from code review Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/en/perf_infer_gpu_one.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-06 20:25:40 +01:00
Joao Gante	10056d898e	OPT: BLIP2-ready `prepare_inputs_for_generation` (#21477 )	2023-02-06 18:19:17 +00:00

... 60 61 62 63 64 ...

15053 Commits