transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-30 17:52:35 +06:00

History

Max Baak 08cd694ef0 ENH: added new output_logits option to generate function (#28667 ) output_logits option behaves like output_scores, but returns the raw, unprocessed prediction logit scores, ie. the values before they undergo logit processing and/or warping. The latter happens by default for the regular output scores. It's useful to have the unprocessed logit scores in certain circumstances. For example, unprocessed logit scores are very useful with causallm models when one wants to determine the probability of a certain answer, e.g. when asking a question with a yes/no answer. In that case getting the next-token probabilities of both "yes" and "no" (and/or their relative ratio) is of interest for classification. The reason for getting these _before_ logit processing and/or warping is b/c a) that can change the probabilities or b) reject the tokens of interest / reduce the number of tokens to just 1. For an example use-case see paper TabLLM: Few-shot Classification of Tabular Data with Large Language Models by Stefan Hegselmann, Alejandro Buendia, Hunter Lang, Monica Agrawal, Xiaoyi Jiang, and David Sontag. https://arxiv.org/abs/2210.10723 In addition: - added dedicated unit test: tests/generation/test_utils/test_return_unprocessed_logit_scores which tests return of logics with output_logits=True in generation. - set output_logits=True in all other generation unit tests, that also have output_scores=True. Implemented @gante's and @amyeroberts review feedback Co-authored-by: kx79wq <max.baak@ing.com>		2024-02-19 17:34:17 +00:00
..
benchmark	[Test refactor 1/5] Per-folder tests reorganization (#15725 )	2022-02-23 15:46:28 -05:00
bettertransformer	Fixed malapropism error (#26660 )	2023-10-09 11:04:57 +02:00
deepspeed	fix failing trainer ds tests (#29057 )	2024-02-16 17:18:45 +05:30
extended	Device agnostic trainer testing (#27131 )	2023-10-30 18:16:40 +00:00
fixtures	[WIP] add SpeechT5 model (#18922 )	2023-02-03 12:43:46 -05:00
fsdp	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
generation	ENH: added new output_logits option to generate function (#28667 )	2024-02-19 17:34:17 +00:00
models	Fix the `bert-base-cased` tokenizer configuration test (#29105 )	2024-02-19 13:23:25 +01:00
optimization	Make schedulers picklable by making lr_lambda fns global (#21768 )	2023-03-02 12:08:43 -05:00
peft_integration	[`Peft`] `modules_to_save` support for peft integration (#27466 )	2023-11-14 10:32:57 +01:00
pipelines	Add chat support to text generation pipeline (#28945 )	2024-02-16 16:41:01 +00:00
quantization	FIX [`bnb` / `tests`]: Fix currently failing bnb tests (#29092 )	2024-02-19 10:39:12 +01:00
repo_utils	Allow `# Ignore copy` (#27328 )	2023-12-07 10:00:08 +01:00
sagemaker	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
tokenization	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
tools	Add support for for loops in python interpreter (#24429 )	2023-06-26 09:58:14 -04:00
trainer	Fix trainer test wrt DeepSpeed + auto_find_bs (#29061 )	2024-02-16 10:04:24 -05:00
utils	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
__init__.py	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
test_backbone_common.py	Align backbone stage selection with out_indices & out_features (#27606 )	2023-12-20 18:33:17 +00:00
test_cache_utils.py	Fix static generation when compiling! (#28937 )	2024-02-15 06:27:40 +01:00
test_configuration_common.py	[ `PretrainedConfig`] Improve messaging (#27438 )	2023-11-15 14:10:39 +01:00
test_configuration_utils.py	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
test_feature_extraction_common.py	Split common test from core tests (#24284 )	2023-06-15 07:30:24 -04:00
test_feature_extraction_utils.py	Remove-auth-token (#27060 )	2023-11-13 14:20:54 +01:00
test_image_processing_common.py	Fix a couple of typos and add an illustrative test (#26941 )	2023-12-11 15:51:51 +00:00
test_image_processing_utils.py	Remove-auth-token (#27060 )	2023-11-13 14:20:54 +01:00
test_image_transforms.py	Normalize floating point cast (#27249 )	2023-11-10 15:35:27 +00:00
test_modeling_common.py	Add tie_weights() to LM heads and set bias in set_output_embeddings() (#28948 )	2024-02-14 20:39:01 +00:00
test_modeling_flax_common.py	[Flax] Update no init test for Flax v0.7.1 (#28735 )	2024-01-26 18:20:39 +00:00
test_modeling_flax_utils.py	Enable safetensors conversion from PyTorch to other frameworks without the torch requirement (#27599 )	2024-01-23 10:28:23 +01:00
test_modeling_tf_common.py	Add tf_keras imports to prepare for Keras 3 (#28588 )	2024-01-30 17:26:36 +00:00
test_modeling_tf_utils.py	Add tf_keras imports to prepare for Keras 3 (#28588 )	2024-01-30 17:26:36 +00:00
test_modeling_utils.py	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
test_pipeline_mixin.py	Image Feature Extraction pipeline (#28216 )	2024-02-05 14:50:07 +00:00
test_processing_common.py	Don't save `processor_config.json` if a processor has no extra attribute (#28584 )	2024-01-19 09:59:14 +00:00
test_sequence_feature_extraction_common.py	Fix typo (#25966 )	2023-09-05 10:12:25 +02:00
test_tokenization_common.py	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
test_tokenization_utils.py	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00