metadata
tags:
- spacy
- token-classification
language:
- vi
license: cc-by-sa-4.0
model-index:
- name: vi_udv25_vietnamesevtb_trf
results:
- task:
name: TAG
type: token-classification
metrics:
- name: TAG (XPOS) Accuracy
type: accuracy
value: 0.8805048216
- task:
name: POS
type: token-classification
metrics:
- name: POS (UPOS) Accuracy
type: accuracy
value: 0.9018631331
- task:
name: MORPH
type: token-classification
metrics:
- name: Morph (UFeats) Accuracy
type: accuracy
value: 0.9695345305
- task:
name: LEMMA
type: token-classification
metrics:
- name: Lemma Accuracy
type: accuracy
value: 0.8934519139
- task:
name: UNLABELED_DEPENDENCIES
type: token-classification
metrics:
- name: Unlabeled Attachment Score (UAS)
type: f_score
value: 0.6807696182
- task:
name: LABELED_DEPENDENCIES
type: token-classification
metrics:
- name: Labeled Attachment Score (LAS)
type: f_score
value: 0.6063552526
- task:
name: SENTS
type: token-classification
metrics:
- name: Sentences F-Score
type: f_score
value: 0.943275972
UD v2.5 benchmarking pipeline for UD_Vietnamese-VTB
| Feature | Description |
|---|---|
| Name | vi_udv25_vietnamesevtb_trf |
| Version | 0.0.1 |
| spaCy | >=3.2.1,<3.3.0 |
| Default Pipeline | experimental_char_ner_tokenizer, transformer, tagger, morphologizer, parser, experimental_edit_tree_lemmatizer |
| Components | experimental_char_ner_tokenizer, transformer, senter, tagger, morphologizer, parser, experimental_edit_tree_lemmatizer |
| Vectors | 0 keys, 0 unique vectors (0 dimensions) |
| Sources | Universal Dependencies v2.5 (Zeman, Daniel; et al.) |
| License | CC BY-SA 4.0 |
| Author | Explosion |
Label Scheme
View label scheme (81 labels for 6 components)
| Component | Labels |
|---|---|
experimental_char_ner_tokenizer |
TOKEN |
senter |
I, S |
tagger |
!, ", ,, -, ., ..., :, ;, ?, @, A, C, CC, E, I, L, LBKT, M, N, NP, Nb, Nc, Np, Nu, Ny, P, R, RBKT, T, V, VP, X, Y, Z |
morphologizer |
POS=NOUN, POS=ADP, POS=X|Polarity=Neg, POS=VERB, POS=ADJ, POS=PUNCT, POS=X, POS=SCONJ, NumType=Card|POS=NUM, POS=DET, POS=CCONJ, POS=PROPN, POS=AUX, POS=PART, POS=INTJ |
parser |
ROOT, advcl, advmod, amod, appos, aux, aux:pass, case, cc, ccomp, compound, conj, cop, csubj, dep, det, discourse, iobj, list, mark, nmod, nsubj, nummod, obj, obl, parataxis, punct, xcomp |
experimental_edit_tree_lemmatizer |
0 |
Accuracy
| Type | Score |
|---|---|
TOKEN_F |
87.90 |
TOKEN_P |
86.84 |
TOKEN_R |
89.00 |
TOKEN_ACC |
98.42 |
SENTS_F |
94.33 |
SENTS_P |
96.23 |
SENTS_R |
92.50 |
TAG_ACC |
88.05 |
POS_ACC |
90.19 |
MORPH_ACC |
96.95 |
DEP_UAS |
68.08 |
DEP_LAS |
60.64 |
LEMMA_ACC |
89.35 |