claudios
/

VulBERTa-MLP-VulDeePecker

@@ -2,40 +2,40 @@
 license: mit
 arxiv: 2205.12424
 datasets:
-- code_x_glue_cc_defect_detection
 metrics:
 - accuracy
 - precision
 - recall
 - f1
 - roc_auc
-model-index:
-  - name: VulBERTa MLP
-    results:
-      - task:
-          type: defect-detection
-        dataset:
-          name: codexglue-devign
-          type: codexglue-devign
-        metrics:
-          - name: Accuracy
-            type: Accuracy
-            value: 64.71
-          - name: Precision
-            type: Precision
-            value: 64.80
-          - name: Recall
-            type: Recall
-            value: 50.76
-          - name: F1
-            type: F1
-            value: 56.93
-          - name: ROC-AUC
-            type: ROC-AUC
-            value: 71.02
 pipeline_tag: text-classification
 tags:
-- devign
 - defect detection
 - code
 ---
@@ -46,7 +46,7 @@ tags:
 ![VulBERTa architecture](https://raw.githubusercontent.com/ICL-ml4csec/VulBERTa/main/VB.png)
 ## Overview
-This model is the unofficial HuggingFace version of "[VulBERTa](https://github.com/ICL-ml4csec/VulBERTa/tree/main)" with an MLP classification head, trained on CodeXGlue Devign (C code), by Hazim Hanif & Sergio Maffeis (Imperial College London). I simplified the tokenization process by adding the cleaning (comment removal) step to the tokenizer and added the simplified tokenizer to this model repo as an AutoClass.
 > This paper presents presents VulBERTa, a deep learning approach to detect security vulnerabilities in source code. Our approach pre-trains a RoBERTa model with a custom tokenisation pipeline on real-world code from open-source C/C++ projects. The model learns a deep knowledge representation of the code syntax and semantics, which we leverage to train vulnerability detection classifiers. We evaluate our approach on binary and multi-class vulnerability detection tasks across several datasets (Vuldeepecker, Draper, REVEAL and muVuldeepecker) and benchmarks (CodeXGLUE and D2A). The evaluation results show that VulBERTa achieves state-of-the-art performance and outperforms existing approaches across different datasets, despite its conceptual simplicity, and limited cost in terms of size of training data and number of model parameters.
@@ -61,7 +61,7 @@ Note that due to the custom tokenizer, you must pass `trust_remote_code=True` wh
 Example:
 ```
 from transformers import pipeline
-pipe = pipeline("text-classification", model="claudios/VulBERTa-MLP-Devign", trust_remote_code=True, return_all_scores=True)
 pipe("static void filter_mirror_setup(NetFilterState *nf, Error **errp)\n{\n    MirrorState *s = FILTER_MIRROR(nf);\n    Chardev *chr;\n    chr = qemu_chr_find(s->outdev);\n    if (chr == NULL) {\n        error_set(errp, ERROR_CLASS_DEVICE_NOT_FOUND,\n                  \"Device '%s' not found\", s->outdev);\n    qemu_chr_fe_init(&s->chr_out, chr, errp);")
 >> [[{'label': 'LABEL_0', 'score': 0.014685827307403088},
   {'label': 'LABEL_1', 'score': 0.985314130783081}]]

 license: mit
 arxiv: 2205.12424
 datasets:
+- VulDeePecker
 metrics:
 - accuracy
 - precision
 - recall
 - f1
 - roc_auc
+# model-index:
+#   - name: VulBERTa MLP
+#     results:
+#       - task:
+#           type: defect-detection
+#         dataset:
+#           name: vuldeepecker
+#           type: vuldeepecker
+#         metrics:
+#           - name: Accuracy
+#             type: Accuracy
+#             value: 64.71
+#           - name: Precision
+#             type: Precision
+#             value: 64.80
+#           - name: Recall
+#             type: Recall
+#             value: 50.76
+#           - name: F1
+#             type: F1
+#             value: 56.93
+#           - name: ROC-AUC
+#             type: ROC-AUC
+#             value: 71.02
 pipeline_tag: text-classification
 tags:
+- vuldeepecker
 - defect detection
 - code
 ---
 ![VulBERTa architecture](https://raw.githubusercontent.com/ICL-ml4csec/VulBERTa/main/VB.png)
 ## Overview
+This model is the unofficial HuggingFace version of "[VulBERTa](https://github.com/ICL-ml4csec/VulBERTa/tree/main)" with an MLP classification head, trained on [VulDeePecker](https://arxiv.org/abs/1801.01681) ([dataset](https://huggingface.co/datasets/claudios/VulDeePecker)), by Hazim Hanif & Sergio Maffeis (Imperial College London). I simplified the tokenization process by adding the cleaning (comment removal) step to the tokenizer and added the simplified tokenizer to this model repo as an AutoClass.
 > This paper presents presents VulBERTa, a deep learning approach to detect security vulnerabilities in source code. Our approach pre-trains a RoBERTa model with a custom tokenisation pipeline on real-world code from open-source C/C++ projects. The model learns a deep knowledge representation of the code syntax and semantics, which we leverage to train vulnerability detection classifiers. We evaluate our approach on binary and multi-class vulnerability detection tasks across several datasets (Vuldeepecker, Draper, REVEAL and muVuldeepecker) and benchmarks (CodeXGLUE and D2A). The evaluation results show that VulBERTa achieves state-of-the-art performance and outperforms existing approaches across different datasets, despite its conceptual simplicity, and limited cost in terms of size of training data and number of model parameters.
 Example:
 ```
 from transformers import pipeline
+pipe = pipeline("text-classification", model="claudios/VulBERTa-MLP-VulDeePecker", trust_remote_code=True, return_all_scores=True)
 pipe("static void filter_mirror_setup(NetFilterState *nf, Error **errp)\n{\n    MirrorState *s = FILTER_MIRROR(nf);\n    Chardev *chr;\n    chr = qemu_chr_find(s->outdev);\n    if (chr == NULL) {\n        error_set(errp, ERROR_CLASS_DEVICE_NOT_FOUND,\n                  \"Device '%s' not found\", s->outdev);\n    qemu_chr_fe_init(&s->chr_out, chr, errp);")
 >> [[{'label': 'LABEL_0', 'score': 0.014685827307403088},
   {'label': 'LABEL_1', 'score': 0.985314130783081}]]