Automatic Speech Recognition
Transformers
PyTorch
Portuguese
wav2vec2
audio
speech
portuguese-speech-corpus
PyTorch
Eval Results (legacy)
Instructions to use alefiury/wav2vec2-large-xlsr-53-coraa-brazilian-portuguese-gain-normalization with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use alefiury/wav2vec2-large-xlsr-53-coraa-brazilian-portuguese-gain-normalization with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="alefiury/wav2vec2-large-xlsr-53-coraa-brazilian-portuguese-gain-normalization")# Load model directly from transformers import AutoProcessor, AutoModelForCTC processor = AutoProcessor.from_pretrained("alefiury/wav2vec2-large-xlsr-53-coraa-brazilian-portuguese-gain-normalization") model = AutoModelForCTC.from_pretrained("alefiury/wav2vec2-large-xlsr-53-coraa-brazilian-portuguese-gain-normalization") - Notebooks
- Google Colab
- Kaggle
metadata
language: pt
datasets:
- CORAA
- common_voice
- mls
- cetuc
- voxforge
metrics:
- wer
tags:
- audio
- speech
- wav2vec2
- pt
- portuguese-speech-corpus
- automatic-speech-recognition
- speech
- PyTorch
license: apache-2.0
model-index:
- name: Alef Iury XLSR Wav2Vec2 Large 53 Portuguese
results:
- task:
name: Speech Recognition
type: automatic-speech-recognition
metrics:
- name: Test CORAA WER
type: wer
value: 24.89%
Wav2vec 2.0 trained with CORAA Portuguese Dataset and Open Portuguese Datasets
This a the demonstration of a fine-tuned Wav2vec model for Portuguese using the following datasets:
Repository
The repository that implements the model to be trained and tested is avaible here.