mirror of
https://github.com/huggingface/transformers.git
synced 2025-07-06 14:20:04 +06:00

As pointed out by @NielsRogge, a feature extractor is used to prepare inputs for a model with a single modality rather than multimodal models.
39 lines
1.3 KiB
Plaintext
39 lines
1.3 KiB
Plaintext
<!--Copyright 2021 The HuggingFace Team. All rights reserved.
|
|
|
|
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
|
|
the License. You may obtain a copy of the License at
|
|
|
|
http://www.apache.org/licenses/LICENSE-2.0
|
|
|
|
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
|
|
an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
|
|
specific language governing permissions and limitations under the License.
|
|
-->
|
|
|
|
# Feature Extractor
|
|
|
|
A feature extractor is in charge of preparing input features for audio or vision models. This includes feature extraction
|
|
from sequences, *e.g.*, pre-processing audio files to Log-Mel Spectrogram features, feature extraction from images
|
|
*e.g.* cropping image image files, but also padding, normalization, and conversion to Numpy, PyTorch, and TensorFlow
|
|
tensors.
|
|
|
|
|
|
## FeatureExtractionMixin
|
|
|
|
[[autodoc]] feature_extraction_utils.FeatureExtractionMixin
|
|
- from_pretrained
|
|
- save_pretrained
|
|
|
|
## SequenceFeatureExtractor
|
|
|
|
[[autodoc]] SequenceFeatureExtractor
|
|
- pad
|
|
|
|
## BatchFeature
|
|
|
|
[[autodoc]] BatchFeature
|
|
|
|
## ImageFeatureExtractionMixin
|
|
|
|
[[autodoc]] image_utils.ImageFeatureExtractionMixin
|