SinhSafe

Created: 14-01-2026 Forks: 0 Watchers: 0 Stars: 0
SinhSafe

Description

SinhSafe benchmarks XLM-R, SinBERT, and SinhLlama to identify the best model for Sinhala/Singlish cyberbullying detection. Our core contribution is a large pseudo-labeled dataset with fine-grained labels (Normal, Offensive, Bullying), processed via a hybrid transliteration pipeline to advance low-resource NLP safety tools.

Team / Supervisors

Publications

    Media

    Tags:
    Languages: