Glossary
Trust and Safety
The organizational function responsible for protecting users and the platform from harm — including abuse, policy violations, and misuse of AI capabilities.
Trust and Safety (T&S) is the team and function within a company that works to keep a platform safe — preventing misuse, responding to policy violations, and protecting both users and the organization from harm. In AI companies, T&S often intersects closely with the work of behavior architects: both teams care about what the model produces and how it can be abused, though T&S tends to focus more on enforcement, monitoring, and incident response while behavior architects focus more on proactive design and training. For practitioners entering this space, understanding the T&S function is important context: behavioral design decisions made before launch become T&S problems after launch, so the two functions work best when closely coordinated.