SEACrowd/kawat
Updated • 16
Evaluation of pretrained Indonesian word embeddings and embeddings trained on Indonesian online news shows they improve performance in downstream tasks.
We introduced KaWAT (Kata Word Analogy Task), a new word analogy task dataset for Indonesian. We evaluated on it several existing pretrained Indonesian word embeddings and embeddings trained on Indonesian online news corpus. We also tested them on two downstream tasks and found that pretrained word embeddings helped either by reducing the training epochs or yielding significant performance gains.
Get this paper in your agent:
hf papers read 1906.09912 curl -LsSf https://hf.co/cli/install.sh | bash No model linking this paper
No Space linking this paper
No Collection including this paper