--- base_model: BAAI/bge-large-en-v1.5 datasets: - nazhan/qa-lookup-dataset-iter-1 library_name: setfit metrics: - accuracy pipeline_tag: text-classification tags: - setfit - sentence-transformers - text-classification - generated_from_setfit_trainer widget: - text: Get me the first names of employees working in the 'Legal' department. - text: Provide the value of the export tariff paid on shipments to 'Country Z' in 2024. - text: Show me the value of the freight charges for the shipment made on October 10, 2023. - text: Show me the value of the refund issued to 'Customer K' for a defective product. - text: Provide the value of the environmental compliance cost for 2023. inference: true model-index: - name: SetFit with BAAI/bge-large-en-v1.5 results: - task: type: text-classification name: Text Classification dataset: name: nazhan/qa-lookup-dataset-iter-1 type: nazhan/qa-lookup-dataset-iter-1 split: test metrics: - type: accuracy value: 1.0 name: Accuracy --- # SetFit with BAAI/bge-large-en-v1.5 This is a [SetFit](https://github.com/huggingface/setfit) model trained on the [nazhan/qa-lookup-dataset-iter-1](https://huggingface.co/datasets/nazhan/qa-lookup-dataset-iter-1) dataset that can be used for Text Classification. This SetFit model uses [BAAI/bge-large-en-v1.5](https://huggingface.co/BAAI/bge-large-en-v1.5) as the Sentence Transformer embedding model. A [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance is used for classification. The model has been trained using an efficient few-shot learning technique that involves: 1. Fine-tuning a [Sentence Transformer](https://www.sbert.net) with contrastive learning. 2. Training a classification head with features from the fine-tuned Sentence Transformer. ## Model Details ### Model Description - **Model Type:** SetFit - **Sentence Transformer body:** [BAAI/bge-large-en-v1.5](https://huggingface.co/BAAI/bge-large-en-v1.5) - **Classification head:** a [LogisticRegression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html) instance - **Maximum Sequence Length:** 512 tokens - **Number of Classes:** 2 classes - **Training Dataset:** [nazhan/qa-lookup-dataset-iter-1](https://huggingface.co/datasets/nazhan/qa-lookup-dataset-iter-1) ### Model Sources - **Repository:** [SetFit on GitHub](https://github.com/huggingface/setfit) - **Paper:** [Efficient Few-Shot Learning Without Prompts](https://arxiv.org/abs/2209.11055) - **Blogpost:** [SetFit: Efficient Few-Shot Learning Without Prompts](https://huggingface.co/blog/setfit) ### Model Labels | Label | Examples | |:-------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Lookup |