Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Malikeh1375 's Collections
Safety-Aligned Models
AI Safety Benchmarks
Clustered Tulu
LLM-Alignment
LLM Interpretability
Medical Datasets

AI Safety Benchmarks

updated Feb 15
Upvote
1

  • JailbreakBench/JBB-Behaviors

    Viewer • Updated Sep 26, 2024 • 500 • 25k • 103

  • walledai/HarmBench

    Viewer • Updated Jul 31, 2024 • 400 • 8.99k • 44

  • allenai/real-toxicity-prompts

    Viewer • Updated Sep 30, 2022 • 99.4k • 13.4k • 118
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs