🤬 hongssi/final_abuse_manual_model

hongssi/final_abuse_manual_model은 한국어 문장에서 욕설, 혐오 표현, 모욕성 발언 등을 탐지하는 다중 레이블 분류 모델입니다.
Smilegate의 UNSMILE 데이터셋 을 기반으로, beomi/KcELECTRA-small 모델을 파인튜닝하여 제작되었습니다.

🧠 모델 개요

✅ Base Model: beomi/KcELECTRA-small
✅ Task: Multi-label classification (sigmoid-based)
✅ Output: 각 라벨별 [0.0 ~ 1.0] 확률 값
✅ 목적: Call center, 커뮤니티, 챗봇 등에서의 욕설/모욕 탐지 및 분류

🏷️ 클래스 라벨 (11개)

[
  "여성/가족", "남성", "성소수자", "인종/국적", "연령",
  "지역", "종교", "기타 혐오", "악플/욕설", "clean", "개인지칭"
]

한 문장이 여러 라벨에 해당될 수 있습니다 (multi-label classification)

🧾 학습 정보

항목	값
데이터셋	UNSMILE
샘플 수	95,000+ 문장
모델 구조	ELECTRA-small, classification head (11 output nodes)
토큰화	KcELECTRA tokenizer (uncased, 128 tokens max)
입력 길이	max_length=128
손실 함수	Binary Cross Entropy (BCEWithLogitsLoss)
옵티마이저	AdamW
러닝레이트	5e-5
배치사이즈	32
학습 에폭	5 epochs
평가 지표	Macro F1 Score, Binary Accuracy

📊 모델 성능

클래스	F1 점수
악플/욕설	0.87
여성/가족	0.84
성소수자	0.78
clean	0.91
기타 평균	Macro F1: 0.83

평가 기준은 UNSMILE validation set 기반이며, 실사용 환경에서 전처리 및 사전 탐지 시스템과 함께 사용할 수 있습니다.

📥 사용법 예시

from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch

labels = [
  "여성/가족", "남성", "성소수자", "인종/국적", "연령",
  "지역", "종교", "기타 혐오", "악플/욕설", "clean", "개인지칭"
]

model_id = "hongssi/final_abuse_manual_model"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForSequenceClassification.from_pretrained(model_id)
model.eval()

text = "야 너는 사람도 아니다"
inputs = tokenizer(text, return_tensors="pt", padding=True, truncation=True, max_length=128)

with torch.no_grad():
    outputs = model(**inputs)
    probs = torch.sigmoid(outputs.logits)[0]

results = {label: float(prob) for label, prob in zip(labels, probs)}
print(results)

🧠 통합 활용: 욕설 사전 탐지와 함께

본 모델은 Aho-Corasick 기반의 욕설 사전 탐지와 함께 사용할 경우, 모델이 탐지하지 못한 명시적 비속어도 보완할 수 있어 실사용에서 더욱 안정적인 운영이 가능합니다.

✅ 라이선스

본 모델은 MIT 라이선스를 따릅니다.
학습 데이터인 UNSMILE은 Smilegate에서 공개한 저작물로, 해당 라이선스를 반드시 확인하세요.

🙋‍♂️ 작성자

👤 hongssi (홍태휘)
✉️ [email protected]
🔗 관련 프로젝트: FastAPI 기반 욕설 탐지 API 서버

---

Downloads last month: 5

Safetensors

Model size

109M params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support