You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I can reproduce the issue - cc @gante for generation and @eustlb for audio models. I noticed a missing weights error on load:
Some weights of Speech2TextForConditionalGeneration were not initialized from the model checkpoint at facebook/s2t-small-librispeech-asr and are newly initialized: ['model.decoder.embed_positions.weights', 'model.encoder.embed_positions.weights']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference
however, this error also occurs on 4.50, even though generation is correct there, so I don't think it's relevant.
System Info
transformers
version: 4.51.3Who can help?
Running the Speech2Text example on transformers 4.51.x gives either nonsense output or no output. The code I'm running is taken verbatim from https://huggingface.co/docs/transformers/en/model_doc/speech_to_text
On transformers 4.50.3 it gives the expected output:
['mister quilter is the apostle of the middle classes and we are glad to welcome his gospel']
On transformers 4.51.x it gives either no output or nonsense output:
With Python 3.12 & transformers 4.51.3:
['that man man man man man man man man man man man man turn turn turn turn turn turn turn turn turn thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin thin']
With Python 3.9 & transformers 4.51.3:
['']
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
conda create --name temp python=3.12
conda activate temp
pip install torch torchaudio soundfile librosa datasets transformers sentencepiece
Expected behavior
['mister quilter is the apostle of the middle classes and we are glad to welcome his gospel']
The text was updated successfully, but these errors were encountered: