Harassment
Why This Matters
Everyone deserves to communicate without being targeted, belittled, or intimidated. Harassment erodes trust in online spaces and can cause lasting psychological harm. By detecting harassing content, we help create environments where people feel safe to participate and share.
What It Detects
Content that expresses, incites, or promotes harassing language towards any person or group. This includes bullying, intimidation, and targeted attacks designed to demean or silence others.
Examples
- •Repeated insults directed at a specific person
- •Mocking someone's appearance, abilities, or personal circumstances
- •Encouraging others to target or pile on an individual
Subcategory
Harassment with Threats
Harassment that escalates to include violence, serious harm, or credible threats against someone's safety or wellbeing.
Examples
- •Threatening physical violence against someone being harassed
- •Stating intent to harm someone or their family
- •Describing specific plans to cause someone distress or injury