transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-07 14:50:07 +06:00

Author	SHA1	Message	Date
amyeroberts	3412f5979d	Use PyAV instead of Decord in examples (#21572 ) * Use PyAV instead of Decord * Get frame indices * Fix number of frames * Update src/transformers/models/videomae/image_processing_videomae.py * Fix up * Fix copies * Update timesformer doctests * Update docstrings	2023-03-02 12:30:38 +00:00
Yih-Dar	db572b3854	Use torch `1.13.1` in push/schedule CI (#21421 ) Use torch 1.13.1 in push/scheduled CI Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-02-02 14:58:52 +01:00
Yih-Dar	94db82573e	Fix (DeepSpeed) docker image build issue (#21002 ) * Fix docker image build issue * remove comment * Add comment * Update docker/transformers-pytorch-deepspeed-latest-gpu/Dockerfile Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>	2023-01-04 21:28:33 +01:00
Yih-Dar	1543cee7c8	Recompile `apex` in `DeepSpeed` CI image (#20788 ) Recompile apex in DeepSpeed CI image Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-15 21:35:27 +01:00
Yih-Dar	b1706f6908	Install video dependency for pipeline CI (#20777 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-15 18:47:05 +01:00
Yih-Dar	94f8e21c70	Install `torch-tensorrt 1.3.0` for DeepSpeed CI (#20764 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-14 17:30:36 +01:00
Yih-Dar	d994473b05	Uninstall `torch_tensorrt` in `DeepSpeed` CI image for now (#20758 ) Uninstall torch_tensorrt for now Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-13 22:25:47 +01:00
Yih-Dar	d4bf9ee1ff	Update CI to torch 1.13.0 (#20687 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-12 20:04:56 +01:00
Yih-Dar	147fa37fb1	pin TF 2.11 in docker files (#20642 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-07 15:46:48 +01:00
Yih-Dar	f68796bd60	Fix `natten` installation in docker file (#20632 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-06 22:23:06 +01:00
Yih-Dar	91182e3a70	Install `tensorflow_probability` for TF pipeline CI (#20586 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-05 16:07:25 +01:00
Yih-Dar	8639cfb4c2	Install `natten` with CUDA version (#20546 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-05 15:08:32 +01:00
Yih-Dar	dd6fb1319b	Add `natten` for CI (#20511 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-30 19:49:34 +01:00
Yih-Dar	f10cdba22e	Pin TF 2.10.1 for Push CI (#20319 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-18 18:24:35 +01:00
Bartosz Szmelczynski	78a471ff71	Fix tapas scatter (#20149 ) * First draft * Remove scatter dependency * Add require_torch * update vectorized sum test, add clone call * remove artifacts * fix style * fix style v2 * remove "scatter" mentions from the code base * fix isort error Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-14 01:04:26 -05:00
raghavanone	7829c890db	Change the import of kenlm from github to pypi (#19770 ) * Change the import of kenlm from github to pypi * Change the import of kenlm from github to pypi in circleci config * Fix code quality issues * Fix isort issue, add kenlm in extras for audio * Add kenlm to deps * Add kenlm to deps * Commit 'make fixup' changes * Remove version from kenlm deps * commit make fixup changes * Remove manual installation of kenlm * Remove manual installation of kenlm * Remove manual installation of kenlm	2022-10-26 17:06:46 +02:00
Yih-Dar	15fd39ea0e	Install tf2onnx dev version (#19755 ) * pin tf2onnx<=1.12.0 * Install tf2onnx main * Pin to a specific commit Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-20 20:24:39 +02:00
Yih-Dar	d7dc774a79	Fix `TFGroupViT` CI (#19461 ) * Fix TFGroupViT CI Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-11 14:29:15 +02:00
Yih-Dar	16242e1bf0	Run `torchdynamo` tests (#19056 ) * Enable torchdynamo tests * make style Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-15 11:10:16 -07:00
Joao Gante	1182b945a6	TF: TF 2.10 unpin + related onnx test skips (#18995 )	2022-09-12 19:30:27 +01:00
Sylvain Gugger	a26114777e	Revert "TF: unpin maximum TF version (#18917 )" (#18972 ) This reverts commit `d8cf3b2087`.	2022-09-10 09:11:46 -04:00
Joao Gante	d8cf3b2087	TF: unpin maximum TF version (#18917 )	2022-09-10 13:33:01 +01:00
Yih-Dar	6690ba3f4d	pin TF 2.9.1 for self-hosted CIs (#18925 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-07 19:46:14 +02:00
Yih-Dar	ecdf9b06bc	Remove cached torch_extensions on CI runners (#18868 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-02 18:17:58 +02:00
Yih-Dar	84beb8a49b	Unpin detectron2 (#18727 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-23 11:10:07 +02:00
Yih-Dar	30992ef0d9	[Hotfix] pin detectron2 5aeb252 to avoid test fix (#18701 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-20 00:37:38 +02:00
Younes Belkada	6d175c1129	[bnb] Minor modifications (#18631 ) * bnb minor modifications - refactor documentation - add troubleshooting README - add PyPi library on DockerFile * Apply suggestions from code review Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Apply suggestions from code review * Apply suggestions from code review * Apply suggestions from code review * put in one block - put bash instructions in one block * update readme - refactor a bit hardware requirements * change text a bit * Apply suggestions from code review Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * apply suggestions Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * add link to paper * Apply suggestions from code review Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Update tests/mixed_int8/README.md * Apply suggestions from code review * refactor a bit * add instructions Turing & Amperer Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * add A6000 * clarify a bit * remove small part * Update tests/mixed_int8/README.md Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2022-08-17 00:48:10 +02:00
Yih-Dar	510c2a0b32	Change scheduled CIs to use torch 1.12.1 (#18644 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-16 13:41:37 +02:00
Younes Belkada	4a51075a96	`bitsandbytes` - `Linear8bitLt` integration into `transformers` models (#17901 ) * first commit * correct replace function * add final changes - works like charm! - cannot implement tests yet - tested * clean up a bit * add bitsandbytes dependencies * working version - added import function - added bitsandbytes utils file * small fix * small fix - fix import issue * fix import issues * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * refactor a bit - move bitsandbytes utils to utils - change comments on functions * reformat docstring - reformat docstring on init_empty_weights_8bit * Update src/transformers/__init__.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * revert bad formatting * change to bitsandbytes * refactor a bit - remove init8bit since it is useless * more refactoring - fixed init empty weights issue - added threshold param * small hack to make it work * Update src/transformers/modeling_utils.py * Update src/transformers/modeling_utils.py * revmoe the small hack * modify utils file * make style + refactor a bit * create correctly device map * add correct dtype for device map creation * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply suggestions - remove with torch.grad - do not rely on Python bool magic! * add docstring - add docstring for new kwargs * add docstring - comment `replace_8bit_linear` function - fix weird formatting * - added more documentation - added new utility function for memory footprint tracking - colab demo to add * few modifs - typo doc - force cast into float16 when load_in_8bit is enabled * added colab link * add test architecture + docstring a bit * refactor a bit testing class * make style + refactor a bit * enhance checks - add more checks - start writing saving test * clean up a bit * male style * add more details on doc * add more tests - still needs to fix 2 tests * replace by "or" - could not fix it from GitHub GUI Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * refactor a bit testing code + add readme * make style * fix import issue * Update src/transformers/modeling_utils.py Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com> * add few comments * add more doctring + make style * more docstring * raise error when loaded in 8bit * make style * add warning if loaded on CPU * add small sanity check * fix small comment * add bitsandbytes on dockerfile * Improve documentation - improve documentation from comments * add few comments * slow tests pass on the VM but not on the CI VM * Fix merge conflict * make style * another test should pass on a multi gpu setup * fix bad import in testing file * Fix slow tests - remove dummy batches - no more CUDA illegal memory errors * odify dockerfile * Update docs/source/en/main_classes/model.mdx * Update Dockerfile * Update model.mdx * Update Dockerfile * Apply suggestions from code review * few modifications - lm head can stay on disk/cpu - change model name so that test pass * change test value - change test value to the correct output - torch bmm changed to baddmm in bloom modeling when merging * modify installation guidelines * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * replace `n`by `name` * merge `load_in_8bit` and `low_cpu_mem_usage` * first try - keep the lm head in full precision * better check - check the attribute `base_model_prefix` instead of computing the number of parameters * added more tests * Update src/transformers/utils/bitsandbytes.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Merge branch 'integration-8bit' of https://github.com/younesbelkada/transformers into integration-8bit * improve documentation - fix typos for installation - change title in the documentation Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>	2022-08-10 09:13:36 +02:00
NielsRogge	82bb682643	[VideoMAE] Add model to doc tests (#18523 ) * Add videomae to doc tests * Add pip install decord Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-08-08 19:28:51 +02:00
Yih-Dar	f681437203	Enable Past CI (#17919 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-07-05 18:08:36 +02:00
Yih-Dar	b089cca347	PyTorch 1.12.0 for scheduled CI (#17949 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-29 19:32:19 +02:00
Yih-Dar	9fe2403bc5	Use explicit torch version in deepspeed CI (#17942 ) * use explicit torch version Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-29 18:20:34 +02:00
Yih-Dar	ca169dbdf1	Enable PyTorch nightly build CI (#17335 ) * nightly build pytorch CI * fix working dir * change time and event name Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-17 16:42:27 +02:00
Yih-Dar	df15703b42	Fix doc builder Dockerfile (#17435 ) * Fix doc builder Dockerfile Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-14 09:58:48 +02:00
Yih-Dar	da0bed5f4a	Pre-build DeepSpeed (#17607 ) * pre-build deepspeed Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-09 23:02:33 +02:00
Yih-Dar	264128cb9d	Explicit versions in docker files (#17586 ) * Update docker file Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-08 15:04:22 +02:00
Joao Gante	78c695eb62	CLI: add stricter automatic checks to `pt-to-tf` (#17588 ) * Stricter pt-to-tf checks; Update docker image for related tests * check all attributes in the output Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-06-08 10:45:10 +01:00
Yih-Dar	9aa230aa2f	Use latest stable PyTorch/DeepSpeed for Push & Scheduled CI (#17417 ) * update versions Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-07 11:53:05 +02:00
Yih-Dar	7198b63362	install dev. version of accelerate (#17243 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-13 13:47:09 -04:00
Stas Bekman	ce2fef2ad2	[trainer / deepspeed] fix hyperparameter_search (#16740 ) * [trainer / deepspeed] fix hyperparameter_search * require optuna * style * oops * add dep in the right place * create deepspeed-testing dep group * Trigger CI	2022-04-14 17:24:38 -07:00
Sylvain Gugger	867f3950fa	Rename master to main for notebooks links and leftovers (#16397 )	2022-03-25 09:12:23 -04:00
Lysandre Debut	c1000e703b	Dcoker images runtime -> devel (#16141 ) * Runtime -> Devel * Torch before DeepSpeed	2022-03-14 12:37:20 -04:00
Lysandre Debut	26426923b7	No self-hosted runner for dev documentation (#15710 )	2022-03-01 14:05:54 -05:00
Lysandre Debut	7ff9d450cd	Scatter should run on CUDA (#15872 )	2022-03-01 11:47:17 -05:00
Lysandre Debut	54f0db4066	Add PT + TF automatic builds (#15860 ) * Add PT + TF automatic builds * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Wrap up Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-03-01 08:55:11 -05:00
Lysandre Debut	a0e3480699	[Test refactor 5/5] Build docker images (#15729 )	2022-02-23 15:48:19 -05:00
Sylvain Gugger	dabeb15292	Examples reorg (#11350 ) * Base move * Examples reorganization * Update references * Put back test data * Move conftest * More fixes * Move test data to test fixtures * Update path * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address review comments and clean Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-04-21 11:11:20 -04:00
Adrien David-Sivelle	98fb718577	Docker GPU Images: Add NVIDIA/apex to the cuda images with pytorch (#7598 ) - Use cuda:10.2 image instead of 10.1 (to address version mismatch warning with pytorch) - Use devel version that is built on the runtime and includes headers and development tools (was otherwise failing to build apex)	2020-10-06 15:23:32 +02:00
zcain117	1b8a7ffcfd	Add setup for TPU CI to run every hour. (#6219 ) * Add setup for TPU CI to run every hour. * Re-organize config.yml Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-08-07 11:17:07 -04:00

1 2

56 Commits