Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
bbunzeck 's Collections
Child-directed speech facilitates production (CoNLL 2026)
Communicative Baby (BabyLM 2025)
Word learning in small LMs
German BabyLM
Small Language Models Also Work With Small Vocabularies
Fifty shapes of BLiMP: syntactic learning curves in LMs
GPT-wee: How Small Can a Small Language Model Really Get?

Small Language Models Also Work With Small Vocabularies

updated Jan 27, 2025

Models and evaluation data for our 2025 COLING paper (https://aclanthology.org/2025.coling-main.404/).

Upvote
-

  • bbunzeck/grapheme-llama

    Text Generation • 15.1M • Updated Sep 17, 2024 • 3 • 1

  • bbunzeck/grapheme-llama-no-whitespace

    Text Generation • 15.1M • Updated Sep 17, 2024 • 3

  • bbunzeck/phoneme-llama

    Text Generation • 15M • Updated Sep 17, 2024 • 6

  • bbunzeck/phoneme-llama-no-whitespace

    Text Generation • 15M • Updated Sep 17, 2024 • 4

  • bbunzeck/phoneme-babylm-10M

    Viewer • Updated Sep 8, 2024 • 3.92M • 10

  • bbunzeck/phoneme-babylm-100M

    Viewer • Updated Sep 8, 2024 • 15.8M • 39

  • bbunzeck/phoneme-blimp

    Viewer • Updated Sep 8, 2024 • 59.9k • 102

  • bbunzeck/rhyme-sentences

    Viewer • Updated Dec 2, 2024 • 400 • 20

  • bbunzeck/wug-words

    Viewer • Updated Dec 2, 2024 • 1k • 9

  • Small Language Models Also Work With Small Vocabularies: Probing the Linguistic Abilities of Grapheme- and Phoneme-Based Baby Llamas

    Paper • 2410.01487 • Published Oct 2, 2024
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs