transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-06 14:20:04 +06:00

Author	SHA1	Message	Date
Suzana Ilić	b440b8d1ce	Added talks (#12415 )	2021-06-29 16:01:16 +01:00
Shamane Siri	5257818e68	minor fixes in original RAG training (#12395 )	2021-06-29 13:39:48 +01:00
Patrick von Platen	31c3e7e75b	[Flax] Add T5 pretraining script (#12355 ) * fix_torch_device_generate_test * remove @ * add length computatan * finish masking * finish * upload * fix some bugs * finish * fix dependency table * correct tensorboard * Apply suggestions from code review * correct processing * slight change init * correct some more mistakes * apply suggestions * improve readme * fix indent * Apply suggestions from code review Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com> * correct tokenizer * finish * finish * finish * finish Co-authored-by: Patrick von Platen <patrick@huggingface.co> Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>	2021-06-28 20:11:29 +01:00
Patrick von Platen	27b6ac4611	Update README.md	2021-06-28 17:22:10 +01:00
Patrick von Platen	89b57a6669	[Flax community event] Add more description to readme (#12398 ) * fix_torch_device_generate_test * remove @ * boom boom * correct typos * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * Apply suggestions from code review Co-authored-by: Suzana Ilić <io.suzanai@gmail.com> * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Suzana Ilić <io.suzanai@gmail.com>	2021-06-28 17:18:42 +01:00
Stas Bekman	4a872caef4	remove extra white space from log format (#12360 )	2021-06-25 13:20:14 -07:00
Vasudev Gupta	332a245861	Add FlaxBigBird QuestionAnswering script (#12233 ) * port bigbird script * adapt script a bit * change location * adapt more * save progress * init commit * style * dataset script tested * readme add	2021-06-25 18:05:48 +01:00
Patrick von Platen	aa550c4a11	Update README.md	2021-06-25 11:55:51 +01:00
Marc van Zee	f2c4ce7e33	Add flax/jax quickstart (#12342 )	2021-06-24 17:04:18 +01:00
Patrick von Platen	44739c8180	[Flax/JAX] Add how to propose projects markdown (#12311 ) * fix_torch_device_generate_test * remove @ * finish * make style	2021-06-23 14:50:35 +01:00
Patrick von Platen	64029abe4c	[Flax] Main doc for event orga (#12305 ) * fix_torch_device_generate_test * remove @ * push * finish * some typos * add more info on communication * add suggestions	2021-06-22 18:02:52 +01:00
Vishal Burman	b53bc55ba9	Fix for making student ProphetNet for Seq2Seq Distillation (#12130 ) * make_student.py: fix to make student ProphetNet * reformat	2021-06-21 09:36:44 -04:00
Stas Bekman	88e84186e5	[style] consistent nn. and nn.functional: part 4 `examples` (#12156 ) * consistent nn. and nn.functional: p4 examples * restore	2021-06-14 12:28:24 -07:00
Stas Bekman	61e191987d	rm require_version_examples (#12088 )	2021-06-09 11:02:52 -07:00
Anton Lozhkov	d472bd7b18	Wav2Vec2 Pretraining (#11306 ) * Working quantizer forward * Working quantizer forward * Clean up unused model parts, test reproducibility * Working quantizer forward * Clean up unused model parts, test reproducibility * Remove custom outputs from the shared ones * correct conversion * correct bug * add first pretrain script * save intermediate * static shapes * save intermediate * finish first pretrain script version * more refactor * remove wanddb * refactor more * improve test * correct perplexity compute bug * finish model implementation * add to docs * finish docs * finish pretraining script * finish pretraining script * remove wandb * finish PR for merge * finish config * finish * make deepspeed work * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply suggestions * fix flaky test Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-06-09 18:40:56 +01:00
Stas Bekman	d14e0af274	sync LayerDrop for Wav2Vec2Encoder + tests (#12076 )	2021-06-09 13:21:03 +01:00
Stas Bekman	11d86d3de4	[Deepspeed Wav2vec2] integration (#11638 ) * wip * wip - but working with https://github.com/microsoft/DeepSpeed/pull/1044 * cleanup * workaround * working 5/8 modes * solve fp32 distributed zero3 * style * sync * sync * rework * deprecation * cleanup * https://github.com/microsoft/DeepSpeed/pull/1044 pr was merged * clean up * add a guide * more prose * more prose * fix * more prose * sub_group_size was too big * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * refactor * bug fix * make the true check explicit * new deepspeed release Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-06-08 12:32:03 -07:00
Mario Šaško	f5eec0d8e9	Replace legacy tensor.Tensor with torch.tensor/torch.empty (#12027 ) * Replace legacy torch.Tensor constructor with torch.{tensor, empty} * Remove torch.Tensor in examples	2021-06-08 13:58:38 +01:00
Shamane Siri	e33085d648	updated the original RAG implementation to be compatible with latest Pytorch-Lightning (#11806 ) * updated the original RAG implementation to be compatible with the latest PL version * updated the requirements.txt file * execute make style * code quality test * code quality * conflix resolved in requirement.txt * code quality * changed the MyDDP class name to CustomDDP	2021-06-08 13:42:49 +01:00
dependabot[bot]	6db3a87de2	Bump urllib3 from 1.25.8 to 1.26.5 in /examples/research_projects/lxmert (#11983 ) Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.25.8 to 1.26.5. - [Release notes](https://github.com/urllib3/urllib3/releases) - [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst) - [Commits](https://github.com/urllib3/urllib3/compare/1.25.8...1.26.5) --- updated-dependencies: - dependency-name: urllib3 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2021-06-02 03:40:20 -04:00
Shamane Siri	9ec0f01b6c	RAG-2nd2end-revamp (#11893 ) * initial * code quality test * code quality * added test functions in test_modeling_rag.py and test_retrieval_rag.py to test end2end retreiver * minor change in test_modeling_rag * fixed tests * Update examples/research_projects/rag-end2end-retriever/README.md typo corrected as suggested by lhoestq Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com> * Update examples/research_projects/rag-end2end-retriever/finetune_rag.py type change suggested by lhoestq Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com> * Update src/transformers/models/rag/retrieval_rag.py Adding this change as mentioned by lhoestq. Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com> * completed the minor changes suggested by the reviewers Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>	2021-06-01 07:32:26 +01:00
Philip May	77f4c46b50	remove defaults to None if optional (#11703 )	2021-05-12 09:11:10 -04:00
Quentin Lhoest	1a0b41781d	Update requirements.txt (#11634 )	2021-05-10 11:19:52 +05:30
Tommy Chiang	7e406f4a65	[Examples] Fix invalid links after reorg (#11650 )	2021-05-10 11:16:48 +05:30
Manuel Romero	58c789e3d2	Update README.md (#11489 ) Add link to code	2021-04-30 04:29:59 -04:00
Jaimeen Ahn	0661abc545	Variable Correction for Consistency in Distillation Example (#11444 ) As the error comes from the inconsistency of variable meaning number of gpus in parser and its actual usage in the train.py script, 'gpus' and 'n_gpu' respectively, the correction makes the example work	2021-04-26 13:30:48 -04:00
Patrick von Platen	32dbb2d954	make style (#11442 )	2021-04-26 13:50:34 +02:00
Sudharsan S T	f25444cb22	Close open files to suppress ResourceWarning (#11240 ) Co-authored-by: Sudharsan Thirumalai <sudharsan.t@sprinklr.com>	2021-04-14 10:31:04 -04:00
Nithin Holla	653076ca30	Save the Wav2Vec2 processor before training starts (#10910 ) Co-authored-by: nithin19 <nithin@amberscript.com>	2021-04-14 14:52:06 +03:00
Stas Bekman	c9035e4537	fix: The 'warn' method is deprecated (#11105 ) * The 'warn' method is deprecated * fix test	2021-04-07 09:20:06 -04:00
Stas Bekman	3d39226a51	s\|Pretrained\|PreTrained\| (#11048 )	2021-04-04 18:08:42 -07:00
versis	335c0ca35c	fixed typo: logging instead of logger (#11025 )	2021-04-02 09:22:22 -04:00
Yih-Dar	e031162a6b	fix md file to avoid evaluation crash (#10962 )	2021-03-30 21:26:22 +03:00
Stas Bekman	05c966f24b	[vulnerability] dep fix (#10954 ) Fixes https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/Pygments/open @LysandreJik	2021-03-29 17:25:47 -04:00
Stas Bekman	3c27d246e5	[vulnerability] fix dependency (#10914 ) this PR fixes https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/PyYAML/open	2021-03-26 09:06:11 -04:00
Stas Bekman	8fb4671811	[vulnerability] in example deps fix (#10817 ) Takes care of: https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/jinja2/open @LysandreJik Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-03-22 09:05:24 -04:00
dependabot[bot]	dbfe379514	Bump jinja2 from 2.11.2 to 2.11.3 in /examples/research_projects/lxmert (#10818 ) Bumps [jinja2](https://github.com/pallets/jinja) from 2.11.2 to 2.11.3. - [Release notes](https://github.com/pallets/jinja/releases) - [Changelog](https://github.com/pallets/jinja/blob/master/CHANGES.rst) - [Commits](https://github.com/pallets/jinja/compare/2.11.2...2.11.3) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2021-03-22 08:54:50 -04:00
Qiushi Pan	29904a967b	Update FINE_TUNE_XLSR_WAV2VEC2.md (#10849 ) Fix typo.	2021-03-22 07:58:59 -04:00
Patrick von Platen	0f226f78ce	push (#10846 )	2021-03-22 10:32:21 +03:00
Suraj Patil	82b8d8c7b0	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-21 22:47:09 +05:30
Patrick von Platen	af6125ffdb	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-21 12:31:33 +03:00
Patrick von Platen	5aaf6e1460	small improvements for wav2vec2 info script (#10829 )	2021-03-21 11:41:44 +03:00
Suraj Patil	68b55885ed	add doc for Local machine (#10828 )	2021-03-21 13:25:34 +05:30
Julien Chaumond	1438c487df	wav2vec doc tweaks (#10808 ) * wording/typos tweaks * Make model upload instructions simpler	2021-03-19 12:48:54 -04:00
Patrick von Platen	b9570a813c	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-19 19:45:28 +03:00
Patrick von Platen	e8968bd03a	[XLSR-Wav2Vec2 Info doc] Add a couple of lines (#10806 ) * finish * fix * fix * fix * fix	2021-03-19 12:52:54 +03:00
Stas Bekman	427ea3fecb	addressing vulnerability report in research project deps (#10802 ) Following up on a security alert: https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/Pillow/open	2021-03-18 22:02:10 -04:00
Patrick von Platen	2ae678229f	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-19 00:29:20 +03:00
Patrick von Platen	68a3215949	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-19 00:27:40 +03:00
Patrick von Platen	03df3fbcb4	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-19 00:26:49 +03:00
Patrick von Platen	e84adbed40	Add XLSR-Wav2Vec2 Fine-Tuning README.md (#10786 ) * upload * upload fine-tuning script * improve * adapt * Apply suggestions from code review * correct * upload * finalize * remove @ * correct typos	2021-03-19 00:22:43 +03:00
Suraj Patil	5f19c07a70	add run_common_voice script (#10767 ) * add initial script * finish script * add shell script example * accept chars_to_ignor as cl arg * align the script with other example scripts * add torchaudio dep	2021-03-18 17:21:16 +05:30
Mohamed El-Geish	af8afdc88d	wav2vec2: support datasets other than LibriSpeech (#10581 ) * wav2vec2: support datasets other than LibriSpeech * Formatting run_asr.py to pass code quality test * bundled orthography options and added verbose logs * fixing a typo in timit fine-tuning script * update comment for clarity * resize_lm_head and load custom vocab from file * adding a max_duration_in_seconds filter * do not assign `duration_filter` lambda, use a def * log untransliterated text as well * fix base model for arabic * fix duration filter when target_sr is not set * drop duration_in_seconds when unneeded * script for wav2vec2-large-lv60-timit-asr * fix for "tha" in arabic corpus (huggingface#10581) * adding more options to work with common_voice * PR feedback (huggingface#10581) * small README change	2021-03-18 10:20:26 +03:00
Joe Davison	966ba081c9	zero-shot pipeline multi_class -> multi_label (#10727 )	2021-03-15 16:02:46 -06:00
Stas Bekman	f284089ec4	[examples tests on multigpu] resolving require_torch_non_multi_gpu_but_fix_me (#10561 ) * batch 1 * this is tpu * deebert attempt * the rest	2021-03-08 11:11:40 -08:00
Patrick von Platen	395ffcd757	fix run seq2seq (#10547 )	2021-03-05 18:17:12 +03:00
Patrick von Platen	0234de8418	Add Fine-Tuning for Wav2Vec2 (#10145 ) * add encode labels function to tokenizer * start adding finetuning * init dropout * upload * correct convert script * apply changes * fix second typo * make first dummy training run * adapt convert script * push confg for comparison * remove conf * finish training * adapt data collator * add research folder * update according to fairseq feedback * some minor corrections * refactor masking indices a bit * some minor changes * clean tokenizer * finish clean-up * remove previous logic * update run script * correct training * finish changes * finish model * correct bug * fix training a bit more * add some tests * finish gradient checkpointing * finish example * correct gradient checkpointing * improve tokenization method * revert changes in tokenizer * revert general change * adapt fine-tuning * update * save intermediate test * Update README.md * finish finetuning * delete conversion script * Update src/transformers/models/wav2vec2/configuration_wav2vec2.py * Update src/transformers/models/wav2vec2/processing_wav2vec2.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * finish wav2vec2 script * finish wav2vec2 fine-tuning * finalize test * correct test * adapt tests * finish * remove test file Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-03-01 12:13:17 +03:00
Joe Davison	cbadb5243c	Zero shot distillation script cuda patch (#10284 )	2021-02-19 14:06:57 -05:00
Joe Davison	c6fe17557e	Script for distilling zero-shot classifier to more efficient student (#10244 ) * add zero-shot distillation script * readme wordsmithing * clean up code * add multi-gpu teacher inference plus tidying up more code * add use_fast_tokenizer arg * update results in readme * more readme wordsmithing * style * Add handle to readme Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * fix code block * add error+docs about distributed & tpu * add @sgugger format requests * xla -> tpu * support fp16 for teacher preds * no checkpoint by default * add demo colab link * add model sharing prompt + model link * correct resulting acc of example Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-02-18 17:08:45 -05:00
Lysandre Debut	0d8e554d42	Line endings should be LF across repo and not CRLF (#10119 )	2021-02-10 10:50:00 -05:00
Stas Bekman	d55e10beab	[research proj] [lxmert] rm bleach dependency (#9970 ) Looks like a vulnerability and it's not really used anywhere in the code, so just as well remove it completely from deps. https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/bleach/open	2021-02-03 05:24:40 -05:00
wlhgtc	1682804ebd	Fit chinese wwm to new datasets (#9887 ) * MOD: fit chinese wwm to new datasets * MOD: move wwm to new folder * MOD: formate code * Styling * MOD add param and recover trainer Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>	2021-02-01 03:37:59 -05:00
Sylvain Gugger	3ec40299c1	Remove nested lxmert (#9440 )	2021-01-07 04:10:41 -05:00
Patrick von Platen	eef66035a2	[PyTorch Bart] Split Bart into different models (#9343 ) * first try * remove old template * finish bart * finish mbart * delete unnecessary line * init pegasus * save intermediate * correct pegasus * finish pegasus * remove cookie cutter leftover * add marian * finish blenderbot * replace in file * correctly split blenderbot * delete "old" folder * correct "add statement" * adapt config for tf comp * correct configs for tf * remove ipdb * fix more stuff * fix mbart * push pegasus fix * fix mbart * more fixes * fix research projects code * finish docs for bart, mbart, and marian * delete unnecessary file * correct attn typo * correct configs * remove pegasus for seq class * correct peg docs * correct peg docs * finish configs * further improve docs * add copied from statements to mbart * fix copied from in mbart * add copy statements to marian * add copied from to marian * add pegasus copied from * finish pegasus * finish copied from * Apply suggestions from code review * make style * backward comp blenderbot * apply lysandres and sylvains suggestions * apply suggestions * push last fixes * fix docs * fix tok tests * fix imports code style * fix doc	2021-01-05 22:00:05 +01:00
dependabot[bot]	5dd389d1c7	Bump notebook from 6.1.4 to 6.1.5 in /examples/research_projects/lxmert (#9402 ) Bumps [notebook](https://github.com/jupyter/jupyterhub) from 6.1.4 to 6.1.5. - [Release notes](https://github.com/jupyter/jupyterhub/releases) - [Changelog](https://github.com/jupyterhub/jupyterhub/blob/master/CHECKLIST-Release.md) - [Commits](https://github.com/jupyter/jupyterhub/commits) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2021-01-04 10:02:07 -05:00
Sylvain Gugger	23a71449c0	Put back LXMert example (#9401 )	2021-01-04 09:59:07 -05:00
Sam Shleifer	8eb7f26d5d	simplify marian distillation script (#9394 )	2021-01-04 11:21:24 +05:30
Yoshitomo Matsubara	d944966b19	Fix typos in README and bugs in RAG example code for end-to-end evaluation and finetuning (#9355 ) * fix a bug in eval_batch_retrieval * should return parser as well as other staticmethod * remove duplicate argument * these kwargs are no longer accepted (cause TypeError in self.generator.generate of modeling_rag.py) * fixed file paths in README * moved an arg to add_ray_specific_args	2021-01-03 16:00:30 +01:00
Teven	4eef5889ac	Adding performer fine-tuning research exampke (#9239 ) * added run_mlm_performer.py research example * make styke * make styke * Added a README !	2020-12-21 21:19:41 +01:00
Amog Kamsetty	a4b21cdd20	[RAG] Add Ray implementation for distributed retrieval (#9197 ) * wip * wip * wip * wip * wip * wip * wip * wip * uncomment * uncomment * wip * updates * add docstring * updates * fix arg * fixes * add unit tests * update readme * update readme * update finetune script * update test * add test * add ray to test dependencies * separate ray and ray tune * formatting * shutdown ray at end of test * fix tests * formatting * formatting * even more formatting * address comments * formatting * add files * Update examples/research_projects/rag/test_distributed_retriever.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * address comments * addressing comments Co-authored-by: Ubuntu <ubuntu@ip-172-31-21-208.us-west-2.compute.internal> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-12-21 10:39:30 +01:00
Aleksey Tikhonov	291974c65c	GPT-model attention heads pruning example (#9189 ) * Pruning for GPT attn heads * The code formatted according to the transformers requirements * Update run_prune_gpt.py * Update run_prune_gpt.py	2020-12-18 16:32:10 -05:00
Yoshitomo Matsubara	44c340f45f	fix a bug in eval_batch_retrieval (#9089 )	2020-12-15 14:46:55 +01:00
dependabot[bot]	24f6cdeab6	Bump notebook in /examples/research_projects/movement-pruning/lxmert (#9062 ) Bumps [notebook](https://github.com/jupyter/jupyterhub) from 6.1.4 to 6.1.5. - [Release notes](https://github.com/jupyter/jupyterhub/releases) - [Changelog](https://github.com/jupyterhub/jupyterhub/blob/master/CHECKLIST-Release.md) - [Commits](https://github.com/jupyter/jupyterhub/commits) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2020-12-11 10:32:43 -05:00
Sylvain Gugger	783d7d2629	Reorganize examples (#9010 ) * Reorganize example folder * Continue reorganization * Change requirements for tests * Final cleanup * Finish regroup with tests all passing * Copyright * Requirements and readme * Make a full link for the documentation * Address review comments * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Add symlink * Reorg again * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Adapt title * Update to new strucutre * Remove test * Update READMEs Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-12-11 10:07:02 -05:00

... 5 6 7 8 9

424 Commits