My Work

The OpenAI Moderation API has always been the essential endpoint you wire into every production surface that accepts user-generated content. The big difference today is that moderation is no longer limited to text: The omni-moderation-latest model is a next-generation multimodal content moderation system built on GPT-4o which can classify text and images with a single request, and it gives you better tools for understanding why something was flagged. This post is a follow-up to my earlier deep dive on the OpenAI moderation classifier: https://blog1.neuralengineer.org/llm-moderation-classifer-openai-moderation-api-fdb124c4536a What moderation is (and is not) Moderation answers one question: “Does this content appear to fall into one of the policy categories I care about?” It is: - A fast, structured classifier for routing (allow, block, review, rate-limit, redact). - A complement to your product policy (not a replacement). It is not: - A substitute for...

Search This Blog

My Work

Posts

OpenAI Moderation API: safer multimodal LLM with omni-moderation-latest (text + image)