ShieldGemma
Google · June 2024
● activeOpen Sourcedecoder onlytext
Description
Google's open-source safety classifier built on Gemma 2, designed to filter harmful content in both LLM inputs and outputs. Supports customizable safety policies and content categories, enabling developers to tune safety behavior to their application's needs.
Key Innovations
safety-classifier