jaeyong2
/

diarize-speaker-segmentation-merged

Automatic Speech Recognition

Model card Files Files and versions Community

jaeyong2 commited on 6 days ago

Commit

28c855d

·

verified ·

1 Parent(s): aa9b421

Upload fixed README.md

Files changed (1) hide show

README.md +39 -0

README.md ADDED Viewed

	@@ -0,0 +1,39 @@

+# Fixed Speaker Segmentation Model
+이 모델은 `jaeyong2/speaker-segmentation-merge`에서 키 매핑 문제를 해결한 버전입니다.
+## 문제 해결
+- 원본 모델: 키에 `model.` 접두사 없음
+- 현재 모델: 키에 `model.` 접두사 있음
+- 해결: 접두사 매핑으로 100% 키 매칭 성공
+## 사용법
+```python
+from diarizers import SegmentationModel
+import torch
+# 모델 로드
+model = SegmentationModel()
+state_dict = torch.load('pytorch_model.bin', map_location='cpu')
+model.load_state_dict(state_dict)
+# 추론
+model.eval()
+with torch.no_grad():
+    # 오디오 입력: (batch_size, audio_length)
+    audio = torch.randn(1, 16000)  # 1초 오디오 예시
+    output = model(audio)
+    print(f"Output shape: {output.shape}")
+```
+## 모델 상세
+- 총 파라미터: 54개 레이어
+- 아키텍처: SincNet + LSTM + Linear + Classifier
+- 입력: 원시 오디오 파형
+- 출력: 화자 분할 결과
+## 원본 모델
+- Repository: jaeyong2/speaker-segmentation-merge
+- 키 매핑 100% 완료
+- 모든 사전훈련 가중치 보존