transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-23 14:29:01 +06:00

Author	SHA1	Message	Date
Sylvain Gugger	b0520f594c	Skip failing tests	2022-07-11 10:16:54 -04:00
Yulv-git	95113d1365	Fix some typos. (#17560 ) * Fix some typos. Signed-off-by: Yulv-git <yulvchi@qq.com> * Fix typo. Signed-off-by: Yulv-git <yulvchi@qq.com> * make fixup.	2022-07-11 05:00:13 -04:00
Yih-Dar	6f0723a9be	Restore original task in test_warning_logs (#17985 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-07-01 20:44:27 +02:00
Aaron Pham	49cd736a28	feat: add pipeline registry abstraction (#17905 ) * feat: add pipeline registry abstraction - added `PipelineRegistry` abstraction - updates `add_new_pipeline.mdx` (english docs) to reflect the api addition - migrate `check_task` and `get_supported_tasks` from transformers/pipelines/__init__.py to transformers/pipelines/base.py#PipelineRegistry.{check_task,get_supported_tasks} Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com> * fix: update with upstream/main chore: Apply suggestions from sgugger's code review Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * chore: PR updates - revert src/transformers/dependency_versions_table.py from upstream/main - updates pipeline registry to use global variables Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com> * tests: add tests for pipeline registry Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com> * tests: add test for output warning. Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com> * chore: fmt and cleanup unused imports Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com> * fix: change imports to top of the file and address comments Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-06-30 12:11:08 -04:00
Patrick von Platen	e4d2588573	[Pipelines] Add revision tag to all default pipelines (#17667 ) * trigger test failure * upload revision poc * Update src/transformers/pipelines/base.py Co-authored-by: Julien Chaumond <julien@huggingface.co> * up * add test * correct some stuff * Update src/transformers/pipelines/__init__.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * correct require flag Co-authored-by: Julien Chaumond <julien@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-06-30 16:37:18 +02:00
Mishig Davaadorj	77b76672e2	Fix img seg tests (load checkpoints from `hf-internal-testing`) (#17939 ) * Revert "Skip failing test until they are fixed." This reverts commit `8f400775fc`. * Use `tiny-detr` checkpts from `hf-internal-testing`	2022-06-29 10:19:37 -04:00
Sylvain Gugger	8f400775fc	Skip failing test until they are fixed.	2022-06-29 09:11:29 -04:00
Nicolas Patry	776855c752	Fixing a regression with `return_all_scores` introduced in #17606 (#17906 ) Fixing a regression with `return_all_scores` introduced in #17606 - The legacy test actually tested `return_all_scores=False` (the actual default) instead of `return_all_scores=True` (the actual weird case). This commit adds the correct legacy test and fixes it. Tmp legacy tests. Actually fix the regression (also contains lists) Less diffed code.	2022-06-28 17:24:45 -04:00
Daniel Stancl	a72f1c9f5b	Add `LongT5` model (#16792 ) * Initial commit * Make some fixes * Make PT model full forward pass * Drop TF & Flax implementation, fix copies etc * Add Flax model and update some corresponding stuff * Drop some TF things * Update config and flax local attn * Add encoder_attention_type to config * . * Update docs * Do some cleansing * Fix some issues -> make style; add some docs * Fix position_bias + mask addition + Update tests * Fix repo consistency * Fix model consistency by removing flax operation over attn_mask * [WIP] Add PT TGlobal LongT5 * . * [WIP] Add flax tglobal model * [WIP] Update flax model to use the right attention type in the encoder * Fix flax tglobal model forward pass * Make the use of global_relative_attention_bias * Add test suites for TGlobal model * Fix minor bugs, clean code * Fix pt-flax equivalence though not convinced with correctness * Fix LocalAttn implementation to match the original impl. + update READMEs * Few updates * Update: [Flax] improve large model init and loading #16148 * Add ckpt conversion script accoring to #16853 + handle torch device placement * Minor updates to conversion script. * Typo: AutoModelForSeq2SeqLM -> FlaxAutoModelForSeq2SeqLM * gpu support + dtype fix * Apply some suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * * Remove (de)parallelize stuff * Edit shape comments * Update README.md * make fix-copies * Remove caching logic for local & tglobal attention * Apply another batch of suggestions from code review * Add missing checkpoints * Format converting scripts * Drop (de)parallelize links from longT5 mdx * Fix converting script + revert config file change * Revert "Remove caching logic for local & tglobal attention" This reverts commit 2a619828f6ddc3e65bd9bb1725a12b77fa883a46. * Stash caching logic in Flax model * Make side relative bias used always * Drop caching logic in PT model * Return side bias as it was * Drop all remaining model parallel logic * Remove clamp statements * Move test files to the proper place * Update docs with new version of hf-doc-builder * Fix test imports * Make some minor improvements * Add missing checkpoints to docs * Make TGlobal model compatible with torch.onnx.export * Replace some np.ndarray with jnp.ndarray * Fix TGlobal for ONNX conversion + update docs * fix _make_global_fixed_block_ids and masked neg value * update flax model * style and quality * fix imports * remove load_tf_weights_in_longt5 from init and fix copies * add slow test for TGlobal model * typo fix * Drop obsolete is_parallelizable and one warning * Update __init__ files to fix repo-consistency * fix pipeline test * Fix some device placements * [wip]: Update tests -- need to generate summaries to update expected_summary * Fix quality * Update LongT5 model card * Update (slow) summarization tests * make style * rename checkpoitns * finish * fix flax tests Co-authored-by: phungvanduy <pvduy23@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: patil-suraj <surajp815@gmail.com>	2022-06-13 22:36:58 +02:00
Sijun He	66336dc183	Add Visual Question Answering (VQA) pipeline (#17286 ) * wip * rebase * all tests pass * rebase * ready for PR * address comments * fix styles * add require_torch to pipeline test * remove remote image to improve CI consistency * address comments; fix tf/flax tests * address comments; fix tf/flax tests * fix tests; add alias * repo consistency tests * Update src/transformers/pipelines/visual_question_answering.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * address comments * Update src/transformers/pipelines/visual_question_answering.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * merge * Update src/transformers/models/auto/modeling_auto.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * merge Co-authored-by: Sijun He <sijunhe@Sijuns-MacBook-Pro.local> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-06-13 07:49:44 -04:00
Nicolas Patry	c38f4e1f1c	Running a pipeline of `float16`. (#17637 ) When we're preparing the tensors for CPU for postprocessing, we need to upgrade the `float16` to `float32` since CPUs don't have instructions for `[b]float16`.	2022-06-09 19:04:42 +02:00
Nicolas Patry	2351729f7d	Adding `top_k` argument to `text-classification` pipeline. (#17606 ) * Adding `top_k` and `sort` arguments to `text-classification` pipeline. - Deprecate `return_all_scores` as `top_k` is more uniform with other pipelines, and a superset of what `return_all_scores` can do. BC is maintained though. `return_all_scores=True` -> `top_k=None` `return_all_scores=False` -> `top_k=1` - Using `top_k` will imply sorting the results, but using no argument will keep the results unsorted for backward compatibility. * Remove `sort`. * Fixing the test. * Remove bad doc.	2022-06-09 18:33:10 +02:00
Nicolas Patry	2b282296f1	Adding `batch_size` test to QA pipeline. (#17330 )	2022-05-19 14:28:12 -04:00
Nicolas Patry	a4386d7e40	[BC] Fixing usage of text pairs (#17324 ) * [BC] Fixing usage of text pairs The BC is actually preventing users from misusing the pipeline since users could have been willing to send text pairs and the pipeline would instead understand the thing as a batch returning bogus results. The correct usage of text pairs is preserved in this PR even when that makes the code clunky. Adds support for {"text":..,, "text_pair": ...} inputs for both dataset iteration and more explicit usage to pairs. * Updating the doc. * Update src/transformers/pipelines/text_classification.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/pipelines/text_classification.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/pipelines/test_pipelines_text_classification.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * quality. Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2022-05-19 10:29:16 +02:00
Nicolas Patry	2cb2ea3fa1	Accepting real pytorch device as arguments. (#17318 ) * Accepting real pytorch device as arguments. * is_torch_available.	2022-05-18 10:06:24 -04:00
Sylvain Gugger	afe5d42d8d	Black preview (#17217 ) * Black preview * Fixup too! * Fix check copies * Use the same version as the CI * Bump black	2022-05-12 16:25:55 -04:00
Nicolas Patry	6d80c92c77	LogSumExp trick `question_answering` pipeline. (#17143 ) * LogSumExp trick `question_answering` pipeline. * Adding a failing test.	2022-05-10 10:03:55 +02:00
Yih-Dar	a59eb349c5	fix missing "models" in pipeline test module (#17090 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-05 16:12:01 +02:00
Nicolas Patry	6620f60c0a	Long QuestionAnsweringPipeline fix. (#16778 ) * Temporary commit witht the long QA fix. * Adding slow tests covering this fix. * Removing fast test as it doesn't fail anyway.	2022-04-21 09:59:25 +02:00
Nicolas Patry	e13a91fe60	Fixing return type tensor with `num_return_sequences>1`. (#16828 ) * Fixing return type tensor with `num_return_sequences>1`. * Nit.	2022-04-20 16:11:51 +02:00
Nicolas Patry	195fbbb6cf	Enabling `Tapex` in table question answering pipeline. (#16663 ) * Enabling `Tapex` in table question answering pipeline. * Questions are independant for Tapex, making the test respect that. * Missing extra space.	2022-04-14 09:06:14 +02:00
Nicolas Patry	a192f61e08	Change the chunk_iter function to handle (#16730 ) * Change the chunk_iter function to handle the subtle cases where the last chunk gets ignored since all the data is in the `left_strided` data. We need to remove the right striding on the previous item. * Remove commented line.	2022-04-12 18:25:02 +02:00
Nicolas Patry	ecb4662d17	Attention mask is important in the case of batching... (#16222 ) * Attention mask is important in the case of batching... * Improve the fix. * Making the sentence different enough that they exhibit different predictions.	2022-03-18 10:02:12 +01:00
Nicolas Patry	f4e4ad34cc	Add `ForInstanceSegmentation` models to `image-segmentation` pipelines (#15937 ) * Adding ForInstanceSegmentation to pipelines. * Last fix `category_id` renamed to `label_id`. * Can't be none no more. * No `is_thing_map` anymore.	2022-03-09 10:19:05 +01:00
Nicolas Patry	7ade7c1794	Updating the slow tests: (#15893 ) Linked to https://github.com/huggingface/transformers/pull/15826	2022-03-04 12:32:19 +01:00
Nicolas Patry	a6e3b17981	Re-enabling all fast pipeline tests. (#15924 )	2022-03-04 09:53:00 +01:00
Nicolas Patry	3822e4a563	Enabling MaskFormer in pipelines (#15917 ) * Enabling MaskFormer in ppipelines No AutoModel though :( * Ooops local file.	2022-03-03 16:31:41 +01:00
Nicolas Patry	b693cbf99c	The tests were not updated after the addition of `torch.diag` (#15890 ) in the scoring (which is more correct)	2022-03-03 15:33:49 +01:00
Nicolas Patry	6e57a56987	Adding timestamps for CTC with LM in ASR pipeline. (#15863 ) * Adding timestamps for CTC with LM in ASR pipeline. * iRemove print. * Nit change.	2022-03-02 10:49:05 +01:00
Nicolas Patry	97f9b8a27b	Fixing the timestamps with chunking. (#15843 ) * Fixing the timestamps with chunking. * The changes modified (and fixed) the striding tests. * Adding a tokenizer test. * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Defense -> comment. * Update src/transformers/models/wav2vec2/tokenization_wav2vec2.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-02-28 21:00:21 +01:00
Nicolas Patry	ad0d7d1745	Adding the option to return_timestamps on pure CTC ASR models. (#15792 ) * Adding the option to return_timestamps on pure CTC ASR models. * Remove `math.prod` which was introduced in Python 3.8 * int are not floats. * Reworking the PR to support "char" vs "word" output. * Fixup! * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Quality. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-02-25 14:06:45 +01:00
Sylvain Gugger	074645e32a	Fix semantic segmentation pipeline test (#15826 )	2022-02-25 09:21:29 +01:00
Lysandre Debut	29c10a41d0	[Test refactor 1/5] Per-folder tests reorganization (#15725 ) * Per-folder tests reorganization Co-authored-by: sgugger <sylvain.gugger@gmail.com> Co-authored-by: Stas Bekman <stas@stason.org>	2022-02-23 15:46:28 -05:00

1 2 3

133 Commits