In this episode, we introduce GovTech Singapore and its reasons for tackling the content moderation problem. We discuss their innovative approach to building the moderation classifier, which involves using a consensus voting mechanism with existing commercial LLMs to improve labeling in the training dataset, providing a strong foundation for developing the machine learning model.
For more details, you can refer to their published tech blog, linked here for your reference:
https://medium.com/dsaid-govtech/building-lionguard-a-contextualised-moderation-classifier-to-tackle-local-unsafe-content-8f68c8f13179