transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-05 22:00:09 +06:00

Author	SHA1	Message	Date
Patrick von Platen	e84adbed40	Add XLSR-Wav2Vec2 Fine-Tuning README.md (#10786 ) * upload * upload fine-tuning script * improve * adapt * Apply suggestions from code review * correct * upload * finalize * remove @ * correct typos	2021-03-19 00:22:43 +03:00
Suraj Patil	5f19c07a70	add run_common_voice script (#10767 ) * add initial script * finish script * add shell script example * accept chars_to_ignor as cl arg * align the script with other example scripts * add torchaudio dep	2021-03-18 17:21:16 +05:30
Mohamed El-Geish	af8afdc88d	wav2vec2: support datasets other than LibriSpeech (#10581 ) * wav2vec2: support datasets other than LibriSpeech * Formatting run_asr.py to pass code quality test * bundled orthography options and added verbose logs * fixing a typo in timit fine-tuning script * update comment for clarity * resize_lm_head and load custom vocab from file * adding a max_duration_in_seconds filter * do not assign `duration_filter` lambda, use a def * log untransliterated text as well * fix base model for arabic * fix duration filter when target_sr is not set * drop duration_in_seconds when unneeded * script for wav2vec2-large-lv60-timit-asr * fix for "tha" in arabic corpus (huggingface#10581) * adding more options to work with common_voice * PR feedback (huggingface#10581) * small README change	2021-03-18 10:20:26 +03:00
Joe Davison	966ba081c9	zero-shot pipeline multi_class -> multi_label (#10727 )	2021-03-15 16:02:46 -06:00
Stas Bekman	f284089ec4	[examples tests on multigpu] resolving require_torch_non_multi_gpu_but_fix_me (#10561 ) * batch 1 * this is tpu * deebert attempt * the rest	2021-03-08 11:11:40 -08:00
Patrick von Platen	395ffcd757	fix run seq2seq (#10547 )	2021-03-05 18:17:12 +03:00
Patrick von Platen	0234de8418	Add Fine-Tuning for Wav2Vec2 (#10145 ) * add encode labels function to tokenizer * start adding finetuning * init dropout * upload * correct convert script * apply changes * fix second typo * make first dummy training run * adapt convert script * push confg for comparison * remove conf * finish training * adapt data collator * add research folder * update according to fairseq feedback * some minor corrections * refactor masking indices a bit * some minor changes * clean tokenizer * finish clean-up * remove previous logic * update run script * correct training * finish changes * finish model * correct bug * fix training a bit more * add some tests * finish gradient checkpointing * finish example * correct gradient checkpointing * improve tokenization method * revert changes in tokenizer * revert general change * adapt fine-tuning * update * save intermediate test * Update README.md * finish finetuning * delete conversion script * Update src/transformers/models/wav2vec2/configuration_wav2vec2.py * Update src/transformers/models/wav2vec2/processing_wav2vec2.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * finish wav2vec2 script * finish wav2vec2 fine-tuning * finalize test * correct test * adapt tests * finish * remove test file Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-03-01 12:13:17 +03:00
Joe Davison	cbadb5243c	Zero shot distillation script cuda patch (#10284 )	2021-02-19 14:06:57 -05:00
Joe Davison	c6fe17557e	Script for distilling zero-shot classifier to more efficient student (#10244 ) * add zero-shot distillation script * readme wordsmithing * clean up code * add multi-gpu teacher inference plus tidying up more code * add use_fast_tokenizer arg * update results in readme * more readme wordsmithing * style * Add handle to readme Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * fix code block * add error+docs about distributed & tpu * add @sgugger format requests * xla -> tpu * support fp16 for teacher preds * no checkpoint by default * add demo colab link * add model sharing prompt + model link * correct resulting acc of example Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-02-18 17:08:45 -05:00
Lysandre Debut	0d8e554d42	Line endings should be LF across repo and not CRLF (#10119 )	2021-02-10 10:50:00 -05:00
Stas Bekman	d55e10beab	[research proj] [lxmert] rm bleach dependency (#9970 ) Looks like a vulnerability and it's not really used anywhere in the code, so just as well remove it completely from deps. https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/bleach/open	2021-02-03 05:24:40 -05:00
wlhgtc	1682804ebd	Fit chinese wwm to new datasets (#9887 ) * MOD: fit chinese wwm to new datasets * MOD: move wwm to new folder * MOD: formate code * Styling * MOD add param and recover trainer Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>	2021-02-01 03:37:59 -05:00
Sylvain Gugger	3ec40299c1	Remove nested lxmert (#9440 )	2021-01-07 04:10:41 -05:00
Patrick von Platen	eef66035a2	[PyTorch Bart] Split Bart into different models (#9343 ) * first try * remove old template * finish bart * finish mbart * delete unnecessary line * init pegasus * save intermediate * correct pegasus * finish pegasus * remove cookie cutter leftover * add marian * finish blenderbot * replace in file * correctly split blenderbot * delete "old" folder * correct "add statement" * adapt config for tf comp * correct configs for tf * remove ipdb * fix more stuff * fix mbart * push pegasus fix * fix mbart * more fixes * fix research projects code * finish docs for bart, mbart, and marian * delete unnecessary file * correct attn typo * correct configs * remove pegasus for seq class * correct peg docs * correct peg docs * finish configs * further improve docs * add copied from statements to mbart * fix copied from in mbart * add copy statements to marian * add copied from to marian * add pegasus copied from * finish pegasus * finish copied from * Apply suggestions from code review * make style * backward comp blenderbot * apply lysandres and sylvains suggestions * apply suggestions * push last fixes * fix docs * fix tok tests * fix imports code style * fix doc	2021-01-05 22:00:05 +01:00
dependabot[bot]	5dd389d1c7	Bump notebook from 6.1.4 to 6.1.5 in /examples/research_projects/lxmert (#9402 ) Bumps [notebook](https://github.com/jupyter/jupyterhub) from 6.1.4 to 6.1.5. - [Release notes](https://github.com/jupyter/jupyterhub/releases) - [Changelog](https://github.com/jupyterhub/jupyterhub/blob/master/CHECKLIST-Release.md) - [Commits](https://github.com/jupyter/jupyterhub/commits) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2021-01-04 10:02:07 -05:00
Sylvain Gugger	23a71449c0	Put back LXMert example (#9401 )	2021-01-04 09:59:07 -05:00
Sam Shleifer	8eb7f26d5d	simplify marian distillation script (#9394 )	2021-01-04 11:21:24 +05:30
Yoshitomo Matsubara	d944966b19	Fix typos in README and bugs in RAG example code for end-to-end evaluation and finetuning (#9355 ) * fix a bug in eval_batch_retrieval * should return parser as well as other staticmethod * remove duplicate argument * these kwargs are no longer accepted (cause TypeError in self.generator.generate of modeling_rag.py) * fixed file paths in README * moved an arg to add_ray_specific_args	2021-01-03 16:00:30 +01:00
Teven	4eef5889ac	Adding performer fine-tuning research exampke (#9239 ) * added run_mlm_performer.py research example * make styke * make styke * Added a README !	2020-12-21 21:19:41 +01:00
Amog Kamsetty	a4b21cdd20	[RAG] Add Ray implementation for distributed retrieval (#9197 ) * wip * wip * wip * wip * wip * wip * wip * wip * uncomment * uncomment * wip * updates * add docstring * updates * fix arg * fixes * add unit tests * update readme * update readme * update finetune script * update test * add test * add ray to test dependencies * separate ray and ray tune * formatting * shutdown ray at end of test * fix tests * formatting * formatting * even more formatting * address comments * formatting * add files * Update examples/research_projects/rag/test_distributed_retriever.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * address comments * addressing comments Co-authored-by: Ubuntu <ubuntu@ip-172-31-21-208.us-west-2.compute.internal> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-12-21 10:39:30 +01:00
Aleksey Tikhonov	291974c65c	GPT-model attention heads pruning example (#9189 ) * Pruning for GPT attn heads * The code formatted according to the transformers requirements * Update run_prune_gpt.py * Update run_prune_gpt.py	2020-12-18 16:32:10 -05:00
Yoshitomo Matsubara	44c340f45f	fix a bug in eval_batch_retrieval (#9089 )	2020-12-15 14:46:55 +01:00
dependabot[bot]	24f6cdeab6	Bump notebook in /examples/research_projects/movement-pruning/lxmert (#9062 ) Bumps [notebook](https://github.com/jupyter/jupyterhub) from 6.1.4 to 6.1.5. - [Release notes](https://github.com/jupyter/jupyterhub/releases) - [Changelog](https://github.com/jupyterhub/jupyterhub/blob/master/CHECKLIST-Release.md) - [Commits](https://github.com/jupyter/jupyterhub/commits) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2020-12-11 10:32:43 -05:00
Sylvain Gugger	783d7d2629	Reorganize examples (#9010 ) * Reorganize example folder * Continue reorganization * Change requirements for tests * Final cleanup * Finish regroup with tests all passing * Copyright * Requirements and readme * Make a full link for the documentation * Address review comments * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Add symlink * Reorg again * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Adapt title * Update to new strucutre * Remove test * Update READMEs Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-12-11 10:07:02 -05:00

... 5 6 7 8 9

424 Commits