tomaarsen's picture
tomaarsen HF Staff
Add new SentenceTransformer model
b855015 verified
|
raw
history blame
90.2 kB
metadata
language:
  - en
license: apache-2.0
tags:
  - sentence-transformers
  - sentence-similarity
  - feature-extraction
  - dense
  - generated_from_trainer
  - dataset_size:10000
  - loss:SparseMarginMSEPPPLoss
base_model: microsoft/mpnet-base
widget:
  - source_sentence: what commands can i use in netflix?
    sentences:
      - >-
        Netflix is a streaming service that allows our customers to watch a wide
        variety of award-winning TV shows, movies, documentaries and more on
        thousands of Internet-connected devices. With Netflix, you can enjoy
        unlimited viewing of our content without having to watch a single
        commercial.
      - >-
        Best Answer: You can check out http://instantwatcher.com to see
        everything they have available. Currently there are 9,381 movies.
        Netflix's API says they have 4,505 TV shows, but their system counts
        individual seasons as separate titles. The true number is probably
        between 1,500-2000. As far as I know, there was a time when people
        thought that Netflix is the best service for watching TV shows and
        movies.
      - >-
        The Cortana help section includes the recommended commands you should
        use for each app. Netflix, for example, uses Netflix find [show or
        movie].. Cortana commands work by either speaking or typing (if you
        don't have a microphone or can't bring yourself to talk to your
        computer). If you do use the voice feature, don't forget to start with
        Hey Cortana..
      - >-
        InstantWatcher is a better way to search for Amazon Prime and Netflix
        videos. The site helps you find what you want to watch. Find Ratings.
        Amazon Prime Instant Video and Netflix both provide a wide variety of
        movies and television shows on their video streaming services. But
        finding the show you want to watch can be frustrating.
      - >-
        Courageous fullmovie-youtube-Watch movies online instantly with netflix
        movies. see a list of all available instant netflix movies and start
        your streaming at movies.com...
      - "AllFlicks is a Netflix search engine that, according to Lifehacker, â\x80\x9Cprovides much more powerful search criteriaâ\x80\x9D than Netflix. In addition to searching, you can browse Netflixâ\x80\x99s catalog and sort the list of movies and TV shows by Title, Release Year, Genre, Rating, and Netflix Availability Date. By default, the list is sorted by Netflix Availability Date, so that you can quickly see whatâ\x80\x99s new on Netflix."
      - >-
        What special equipment do I need to watch Netflix. All you need is a
        computer and a connection to the internet and you can watch it directly
        on Netflix.com. In addition you can download the free Netflix app onto
        your smart TV, game console, streaming player, phone or tablet.
      - "To get Netflix on your player you need to select the Netflix App icon, or download it from your connected Blu-ray playerâ\x80\x99s store if itâ\x80\x99s not displayed. Enter your account information as normal and add the activation code to the Netflix site to activate the player on your account."
      - >-
        tv The Best Streaming Netflix TV Shows. List Rules Serial TV shows that
        are currently streaming on Netflix. No movies. The best streaming
        Netflix TV shows include some great television dramas, sci-fi/adventure
        shows, sitcoms, and reality TV programs.
  - source_sentence: >-
      can the new zealand citizen be a dependant child when applying for a 461
      visa
    sentences:
      - "The subclass 461 visa is valid for five years and it may be renewed, subject to certain conditions being met. The visa extension may be granted even if the relationship with the New Zealand citizen on the 444 visa has broken down, as long as certain conditions are met.enerally, the applicant should not be married, but a widow or divorced person is able to apply. One non-blood relationship which is acceptable is if the applicant for the 461 visa is a dependent child of the New Zealand citizenâ\x80\x99s partner or spouse."
      - >-
        You can claim a boyfriend or girlfriend and their children as dependents
        if they are your qualifying relatives. they are not a qualifying child
        of another taxpayer. they meet all of the requirements above to be a
        qualifying relative.
      - >-
        This generally would be your parent or guardian. Here are the criteria
        for being claimed as a dependent as a qualifying child 1  : You are the
        child, stepchild, foster child, sibling, stepsibling or descendant of
        another taxpayer. 2  You lived with the taxpayer for more than half a
        year (there are some exceptions).
      - >-
        1 An Australian citizen travelling on an Australian passport. 2  British
        citizen and or British passport holder who can produce evidence of the
        right to reside permanently in the UK (you can stay up to six months)  A
        citizen of a country which has a visa waiver agreement with New Zealand
        (you can stay up to three months).
      - >-
        Posted. Normally, parents are not considered dependents unless the
        situations in this document apply:
        http://www.cra-arc.g...5/lgbl-eng.html. For non-resident dependents
        (such as the spouse or children of a newcomer to Canada who is still
        residing in their home country), you are able to claim them as
        dependents.osted. Normally, parents are not considered dependents unless
        the situations in this document apply:
        http://www.cra-arc.g...5/lgbl-eng.html. For non-resident dependents
        (such as the spouse or children of a newcomer to Canada who is still
        residing in their home country), you are able to claim them as
        dependents.
      - >-
        Answer: Each child can only be claimed as a dependent by one parent
        (except when married couples file taxes jointly). Typically, in the case
        of divorced, separated, or never married parents, it is the custodial
        parent who is legally able to claim a child as a dependent for tax
        purposes.
      - "Effective August 1, 2014, Citizenship and Immigration Canada has changed its definition of a dependent child for its immigration programs. A dependent child must be under 19 years of age, instead of the previous limit of under 22 years of age. Find out more about the change in the definition of a dependent child. son or daughter is considered a dependant of their parent when the child is: 1  under 19 years old, and does not have a spouse or partner, or. 2  19 years old and over, and has depended largely on the parentâ\x80\x99s financial support since before the age of 19 because of a physical or mental condition."
      - "Children who qualify as dependents. If your son or daughter is your biological child, stepchild, foster child, sibling, step-sibling, or a descendant of any of these individuals, you can claim him/her as your dependent, but the child canâ\x80\x99t turn 19 at any time during the tax year (age 24 if a full-time student)."
      - >-
        There is no age limit on claiming your child as a dependent if the child
        meets the qualifying relative test. As long as all of the following
        tests are met, you may claim a dependency exemption for your child:1 
        Qualifying child or qualifying relative test, 2  Dependent taxpayer
        test, 3  Citizen or resident test, and.o meet the qualifying child test,
        your child must be younger than you and, as of the end of the calendar
        year, either be younger than 19 years old or be a student and younger
        than 24 years old.
  - source_sentence: how soon can i start the nuvaring
    sentences:
      - >-
        Nuva ring effective in 7 days? How does this work? I just started Nuva
        ring, actually it's my first birth control, and my doctor said that it
        only takes 7 days to become effective. I've always heard it takes a FULL
        Cycle...It's making me think about how many hormones are actually in the
        ring... does anyone know why it only takes 7 days opposed to a full
        cyle? thanks.
      - >-
        Menorrhagia is not something you have to put up. with, and NovaSure may
        be the solution. *The average treatment time is 90 seconds, and the
        entire NovaSure procedure. typically takes less than 5 minutes to
        complete.
      - >-
        If NuvaRing® is not started within five days after a first trimester
        abortion or miscarriage, begin NuvaRing® at the time of your next
        menstrual period. Counting the first day of your menstrual period as Day
        1, insert NuvaRing® on or before Day 5 of the cycle, even if you have
        not finished bleeding. During this first cycle, use an extra method of
        birth control, such as male condoms or spermicide, for the first seven
        days of ring use.
      - >-
        #1 trinidadman Junior Member. Hello, I have a female friend of mine
        doing 15mg (5 morning 10mg at noon) of anavar , she has been on for
        about 5 days, when should she start to see any results? Bye the way
        before everyone starts asking why the morning afternoon stuff.
      - >-
        Alternatively, if the NuvaRing is outside the vagina for more than three
        hours during the third week, you can start using a new ring right away.
        This may cause breakthrough spotting or bleeding. Use a backup method of
        contraception until you have used the new ring continuously for seven
        days. Or, if you used NuvaRing continuously for at least the previous
        seven days prior to explusion, you can discard the ring and wait up to
        seven days from the time the ring was removed or expelled to insert a
        new ring.
      - >-
        RE: do you have to store nuvaring in the refrigerator for it to be
        active? i got my nuvaring about 2 weeks ago and they told me to keep it
        stored in the fridge, i've had it in the side of my daughters diaper
        bag the whole time cause i forgot all about it.
      - >-
        No!!! You need to keep Nuvaring in for for 3 weeks, take it out for one
        week, (7 days) then replace with a new one for another 21 days! If you
        take it out , you before 21 days you can get pregnant. And on the web
        site they have a little desktop app that keeps track of the in and out
        times.
      - >-
        Just yesterday I went to the doctor to start on the NuvaRing. The doctor
        had me insert the ring at the appointment even though I'm at day 15 of
        my cycle. When I got home, I read that women who are starting NuvaRing
        who haven't been on another form of oral contraceptive should start the
        ring between days 1 and 5 of their cycle.
      - >-
        Take one tablet as soon as possible after unprotected sex. The
        manufacturer recommends to take within 72 hours (3 days) after
        unprotected sex, but studies have shown moderate efficacy still exists
        if taken up to 120 hours (5 days) after unprotected sex. Levonorgestrel
        0.75 mg, two tablet regimen.
  - source_sentence: who wrote the bell curve
    sentences:
      - >-
        First, you have to know a little about statistics. I assume that the
        grades are drawn from a normal (bell-curve) distribution. These
        distributions have an average (also called a mean) and a standard
        deviation (stdev), which measures how wide the distribution spreads out.
      - >-
        So if you want a bell curve chart, you first have to enter the right
        data. In this tutorial, we select a range of numbers corresponding to a
        Mean average and a Standard Deviation. After applying Excel's Normal
        Distribution function to those numbers, Excel gives you a
        perfectly-formed bell curve.
      - "With standardized IQ tests, IQ tests are designed so that their scores have a â\x80\x98bell curveâ\x80\x99 distribution in the general population with an average of 100. This curve has a peak in the middle where most people score and tapering ends where only a few people score. In statistics this is called a normal distribution."
      - "Definition: The term bell curve is used to describe the mathematical concept called normal distribution, sometimes referred to as Gaussian distribution. â\x80\x98Bell curveâ\x80\x99 refers to the shape that is created when a line is plotted using the data points for an item that meets the criteria of â\x80\x98normal distributionâ\x80\x99.he important things to note about a normal distribution is the curve is concentrated in the center and decreases on either side. This is significant in that the data has less of a tendency to produce unusually extreme values, called outliers, as compared to other distributions."
      - >-
        See normal distribution for the bell curve in statistics. The Bell Curve
        is a controversial, best-selling 1994 book by Richard J. Herrnstein and
        Charles Murray exploring the role of intelligence in American life. The
        authors became notorious for the book's discussion of race and
        intelligence in Chapters 13 and 14.
      - >-
        Noun. 1. Gaussian curve-a symmetrical curve representing the normal
        distribution. bell-shaped curve, Gaussian shape, normal curve.
        statistics-a branch of applied mathematics concerned with the collection
        and interpretation of quantitative data and the use of probability
        theory to estimate population parameters.aussian curve-a symmetrical
        curve representing the normal distribution. bell-shaped curve, Gaussian
        shape, normal curve. statistics-a branch of applied mathematics
        concerned with the collection and interpretation of quantitative data
        and the use of probability theory to estimate population parameters.
      - >-
        Chebyshev's Theorem. The Russian mathematician P. L. Chebyshev (1821-
        1894) discovered that the fraction of observations falling between two
        distinct values, whose differences from the mean have the same absolute
        value, is related to the variance of the population.
      - >-
        A visual glossary of hat types for men, hats, mens hats, advertisement,
        ad, social media ideas, infrographics, fashion, chef hat, derby hat,
        beret, cowboy hat. Fashion in Infographics has a chart for every fashion
        thing you've ever wondered about. So says Buzzfeed, and our followers
        agree.
      - "Definition: The term bell curve is used to describe the mathematical concept called normal distribution, sometimes referred to as Gaussian distribution. â\x80\x98Bell curveâ\x80\x99 refers to the shape that is created when a line is plotted using the data points for an item that meets the criteria of â\x80\x98normal distributionâ\x80\x99."
  - source_sentence: wht is tartaric acid
    sentences:
      - >-
        Tartaric Acid. Tartaric acid is a naturally occurring organic acid which
        appears as a white crystalline solid at room temperature. Foods such as
        grapes, apricots, avocados, apples and sunflower seeds have all been
        known to have high concentrations of the acid.Alternatively, it has also
        been found in tamarinds which are a type of tree indigenous to tropical
        Africa and other warms parts of the world.ses. As mentioned above,
        Mexican cuisine relies heavily on the use of tartaric acid which, when
        combined with baking soda, acts as a leavening agent. The acid also
        plays a major role in wine-making where it is used during the
        fermentation process for acidity adjustments to make for a more palate
        pleasing taste.
      - >-
        Send Enquiry. Tartaric Acid. Tartaric acid is a white crystalline
        diprotic organic acid. It occurs naturally in many plants, particularly
        grapes, bananas, and tamarinds, and is one of the main acids found in
        wine.It is added to other foods to give a sour taste, and is used as
        anantioxidant. Salts of tartaric acid are known as tartrates.end
        Enquiry. Tartaric Acid. Tartaric acid is a white crystalline diprotic
        organic acid. It occurs naturally in many plants, particularly grapes,
        bananas, and tamarinds, and is one of the main acids found in wine.
      - >-
        Acids are used as additives to drinks and foods, as they alter their
        taste and serve as preservatives. Phosphoric acid, for example, is a
        component of cola drinks. Acetic acid is used in day-to-day life as
        vinegar. Carbonic acid is an important part of some cola drinks and
        soda.Citric acid is used as a preservative in sauces and pickles.
        Tartaric acid is an important component of some commonly used foods like
        unripened mangoes and tamarind.cids are used as additives to drinks and
        foods, as they alter their taste and serve as preservatives. Phosphoric
        acid, for example, is a component of cola drinks. Acetic acid is used in
        day-to-day life as vinegar. Carbonic acid is an important part of some
        cola drinks and soda.
      - >-
        Sour salt is actually citric acid, a white crystalline compound with an
        acidic, tart taste; it is often used to furnish the sour taste element
        in sweet/sour dishes such as cabbage soup.
      - >-
        Cream of tartar, which is actually a byproduct of the winemaking
        process, is an acidic powder that is typically used in baking to
        stabilize eggs and creams, as well as adding volume and thickness to the
        final product.
      - >-
        Fumaric acid or trans-butenedioic acid is the chemical compound with the
        formula HO 2 CCH=CHCO 2 H. This white crystalline compound is one of two
        isomeric unsaturated dicarboxylic acids, the other being maleic
        acid.umaric acid has been used a food acidulant since 1946. It is
        approved for use as a food additive in the EU, USA and Australia and New
        Zealand. As a food additive, it is used as an acidity regulator and can
        be denoted by the E number E297.
      - >-
        July 1, 2008 in Baking, How-To's and Tips, Ingredients 72 Comments.
        Cream of tartar, more technically known as potassium hydrogen tartrate,
        is a fine white powder with many culinary applications. It is a
        byproduct of the winemaking process as the powder forms inside wine
        barrels during fermentation.
      - >-
        Tartaric acid is a white crystalline organic acid that occurs naturally
        in many plants, most notably in grapes.he acid itself is added to foods
        as an antioxidant and to impart its distinctive sour taste. Tartaric is
        an alpha-hydroxy-carboxylic acid, is diprotic and aldaric in acid
        characteristics, and is a dihydroxyl derivative of succinic acid.
      - >-
        Citric acid is found naturally in several fruits and berries, and is
        commonly added to many foods and beverages as a preservative or for
        flavor. It has a tart taste, and is also known as lemon salt or sour
        salt. Many of our customers use our citric acid to clean dishwashers and
        dishes, or to make fizzy bath bombs (fun!). Citric acid is completely
        natural, produced by cells during the Krebs cycle.
datasets:
  - tomaarsen/msmarco-Qwen3-Reranker-0.6B
pipeline_tag: sentence-similarity
library_name: sentence-transformers
metrics:
  - cosine_accuracy@1
  - cosine_accuracy@3
  - cosine_accuracy@5
  - cosine_accuracy@10
  - cosine_precision@1
  - cosine_precision@3
  - cosine_precision@5
  - cosine_precision@10
  - cosine_recall@1
  - cosine_recall@3
  - cosine_recall@5
  - cosine_recall@10
  - cosine_ndcg@10
  - cosine_mrr@10
  - cosine_map@100
co2_eq_emissions:
  emissions: 24.122517285028877
  energy_consumed: 0.0620591540178051
  source: codecarbon
  training_type: fine-tuning
  on_cloud: false
  cpu_model: 13th Gen Intel(R) Core(TM) i7-13700K
  ram_total_size: 31.777088165283203
  hours_used: 0.182
  hardware_used: 1 x NVIDIA GeForce RTX 3090
model-index:
  - name: mpnet-base finetuned on MSMARCO via distillation
    results:
      - task:
          type: information-retrieval
          name: Information Retrieval
        dataset:
          name: msmarco eval 1kq 1kd
          type: msmarco-eval-1kq-1kd
        metrics:
          - type: cosine_accuracy@1
            value: 0.922
            name: Cosine Accuracy@1
          - type: cosine_accuracy@3
            value: 0.958
            name: Cosine Accuracy@3
          - type: cosine_accuracy@5
            value: 0.962
            name: Cosine Accuracy@5
          - type: cosine_accuracy@10
            value: 0.973
            name: Cosine Accuracy@10
          - type: cosine_precision@1
            value: 0.922
            name: Cosine Precision@1
          - type: cosine_precision@3
            value: 0.31933333333333325
            name: Cosine Precision@3
          - type: cosine_precision@5
            value: 0.19240000000000004
            name: Cosine Precision@5
          - type: cosine_precision@10
            value: 0.09730000000000001
            name: Cosine Precision@10
          - type: cosine_recall@1
            value: 0.922
            name: Cosine Recall@1
          - type: cosine_recall@3
            value: 0.958
            name: Cosine Recall@3
          - type: cosine_recall@5
            value: 0.962
            name: Cosine Recall@5
          - type: cosine_recall@10
            value: 0.973
            name: Cosine Recall@10
          - type: cosine_ndcg@10
            value: 0.9492468037327995
            name: Cosine Ndcg@10
          - type: cosine_mrr@10
            value: 0.9415369047619048
            name: Cosine Mrr@10
          - type: cosine_map@100
            value: 0.9424045640484957
            name: Cosine Map@100

mpnet-base finetuned on MSMARCO via distillation

This is a sentence-transformers model finetuned from microsoft/mpnet-base on the msmarco-qwen3-reranker-0.6_b dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

  • Model Type: Sentence Transformer
  • Base model: microsoft/mpnet-base
  • Maximum Sequence Length: 512 tokens
  • Output Dimensionality: 768 dimensions
  • Similarity Function: Cosine Similarity
  • Training Dataset:
  • Language: en
  • License: apache-2.0

Model Sources

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 512, 'do_lower_case': False, 'architecture': 'MPNetModel'})
  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("tomaarsen/mpnet-base-msmarco-margin-mse-pp")
# Run inference
queries = [
    "wht is tartaric acid",
]
documents = [
    'Tartaric Acid. Tartaric acid is a naturally occurring organic acid which appears as a white crystalline solid at room temperature. Foods such as grapes, apricots, avocados, apples and sunflower seeds have all been known to have high concentrations of the acid.Alternatively, it has also been found in tamarinds which are a type of tree indigenous to tropical Africa and other warms parts of the world.ses. As mentioned above, Mexican cuisine relies heavily on the use of tartaric acid which, when combined with baking soda, acts as a leavening agent. The acid also plays a major role in wine-making where it is used during the fermentation process for acidity adjustments to make for a more palate pleasing taste.',
    'Tartaric acid is a white crystalline organic acid that occurs naturally in many plants, most notably in grapes.he acid itself is added to foods as an antioxidant and to impart its distinctive sour taste. Tartaric is an alpha-hydroxy-carboxylic acid, is diprotic and aldaric in acid characteristics, and is a dihydroxyl derivative of succinic acid.',
    'Send Enquiry. Tartaric Acid. Tartaric acid is a white crystalline diprotic organic acid. It occurs naturally in many plants, particularly grapes, bananas, and tamarinds, and is one of the main acids found in wine.It is added to other foods to give a sour taste, and is used as anantioxidant. Salts of tartaric acid are known as tartrates.end Enquiry. Tartaric Acid. Tartaric acid is a white crystalline diprotic organic acid. It occurs naturally in many plants, particularly grapes, bananas, and tamarinds, and is one of the main acids found in wine.',
]
query_embeddings = model.encode_query(queries)
document_embeddings = model.encode_document(documents)
print(query_embeddings.shape, document_embeddings.shape)
# [1, 768] [3, 768]

# Get the similarity scores for the embeddings
similarities = model.similarity(query_embeddings, document_embeddings)
print(similarities)
# tensor([[0.7672, 0.8267, 0.7948]])

Evaluation

Metrics

Information Retrieval

Metric Value
cosine_accuracy@1 0.922
cosine_accuracy@3 0.958
cosine_accuracy@5 0.962
cosine_accuracy@10 0.973
cosine_precision@1 0.922
cosine_precision@3 0.3193
cosine_precision@5 0.1924
cosine_precision@10 0.0973
cosine_recall@1 0.922
cosine_recall@3 0.958
cosine_recall@5 0.962
cosine_recall@10 0.973
cosine_ndcg@10 0.9492
cosine_mrr@10 0.9415
cosine_map@100 0.9424

Training Details

Training Dataset

msmarco-qwen3-reranker-0.6_b

  • Dataset: msmarco-qwen3-reranker-0.6_b at 20c25c8
  • Size: 10,000 training samples
  • Columns: query, positive, negative_1, negative_2, negative_3, negative_4, negative_5, negative_6, negative_7, negative_8, and score
  • Approximate statistics based on the first 1000 samples:
    query positive negative_1 negative_2 negative_3 negative_4 negative_5 negative_6 negative_7 negative_8 score
    type string string string string string string string string string string list
    details
    • min: 4 tokens
    • mean: 9.18 tokens
    • max: 45 tokens
    • min: 17 tokens
    • mean: 80.35 tokens
    • max: 298 tokens
    • min: 14 tokens
    • mean: 68.3 tokens
    • max: 197 tokens
    • min: 12 tokens
    • mean: 70.27 tokens
    • max: 209 tokens
    • min: 14 tokens
    • mean: 70.16 tokens
    • max: 241 tokens
    • min: 16 tokens
    • mean: 71.17 tokens
    • max: 211 tokens
    • min: 14 tokens
    • mean: 71.8 tokens
    • max: 190 tokens
    • min: 11 tokens
    • mean: 72.04 tokens
    • max: 194 tokens
    • min: 14 tokens
    • mean: 73.0 tokens
    • max: 203 tokens
    • min: 15 tokens
    • mean: 71.02 tokens
    • max: 206 tokens
    • size: 9 elements
  • Samples:
    query positive negative_1 negative_2 negative_3 negative_4 negative_5 negative_6 negative_7 negative_8 score
    what is clomiphene Uses of This Medicine. Clomiphene is used as a fertility medicine in some women who are unable to become pregnant. Clomiphene probably works by changing the hormone balance of the body. In women, this causes ovulation to occur and prepares the body for pregnancy.ses of This Medicine. Clomiphene is used as a fertility medicine in some women who are unable to become pregnant. Clomiphene probably works by changing the hormone balance of the body. In women, this causes ovulation to occur and prepares the body for pregnancy. Clomiphene citrate, a synthetic hormone commonly used to induce or regulate ovulation, is the most often prescribed fertility pill. Brand names for clomiphene citrate include Clomid and Serophene. Clomiphene works indirectly to stimulate ovulation. Occasionally, clomiphene can stimulate the ovaries too much, causing multiple eggs to be released, which can result in multiple births, such as twins or triplets (see Clomid and Twins) . Clomiphene is one of the least expensive and easiest-to-use fertility drugs. However, it will not work for all types of infertility. Your healthcare provider needs to try to find your cause of infertility before you try clomiphene. Clomiphene Citrate offers two benefits to the performance enhancing athlete with one being primary. Most commonly, this SERM is used for post cycle recovery purposes; specifically to stimulate natural testosterone production that has been suppressed due to the use of anabolic steroids. PCOS and ovulation problems and Clomid treatment. Clomid (clomiphene citrate or Serophene) is an oral medication that is commonly used for the treatment of infertility. It is often given to try to induce ovulation in women that do not develop and release an egg (ovulate) on their own. Indication: Clomid (clomiphene citrate) is often the first choice for treating infertility, because it's effective and been used for more than 40 years. Clomid Description. 1 Clomid (clomiphene citrate tablets USP) is an orally administered, nonsteroidal, ovulatory stimulant designated chemically as 2-[p-(2-chloro-1,2-diphenylvinyl)phenoxy] triethylamine citrate (1:1). It has the molecular formula of C26H28ClNO • C6H8O7 and a molecular weight of 598.09. PCOS and ovulation problems and Clomid treatment. Clomid (clomiphene citrate or Serophene) is an oral medication that is commonly used for the treatment of infertility. 1 It is often given to try to induce ovulation in women that do not develop and release an egg (ovulate) on their own. Clomid is started early in the menstrual cycle and is taken for five days either from cycle days 3 through 7, or from day 5 through 9. 2 Clomid is usually started at a dose of one tablet (50mg) daily-taken any time of day. Clomid is taken as a pill. This is unlike the stronger fertility drugs, which require injection. Clomid is also very effective, stimulating ovulation 80 percent of the time. Clomid may also be marketed under the name Serophene, or you may see it sold under its generic name, clomiphene citrate. Note: Clomid can also be used as a treatment for male infertility. This article focuses on Clomid treatment in women. [4.75390625, 6.9375, 3.92578125, 1.0400390625, 5.61328125, ...]
    typical accountant cost for it contractor In the current market, we’ve seen rates as low as £50 +VAT, and as high as £180 +VAT for dedicated contractor accountants. Interestingly, the average cost of contractor accounting has not risen in line with inflation over the past decade. So, how much does a contractor cost, anywhere from 5% to 25% of the total project cost, with the average ranging 10-15%.ypically the contractor' s crew will be general carpentry trades people, some who may have more specialized skills. Exactly how a general contractor charges for a project depends on the type of contract you agree to. There are three common types of cost contracts, fixed price, time & materials and cost plus a fee. 1 Accountants charge $150-$400 or more an hour, depending on the type of work, the size of the firm and its location. 2 You'll pay lower rates for routine work done by a less-experienced associate or lesser-trained employee, such as $30-$50 for bookkeeping services. 3 An accountant's total fee depends on the project. For a simple start-up, expect a minimum of 0.5-1.5 hours of consultation ($75-$600) to go over your business structure and basic tax issues. So, how much does a contractor cost, anywhere from 5% to 25% of the total project cost, with the average ranging 10-15%.xactly how a general contractor charges for a project depends on the type of contract you agree to. There are three common types of cost contracts, fixed price, time & materials and cost plus a fee. Each contract type has pros and cons for both the consumer and for the contractor. 1 Accountants charge $150-$400 or more an hour, depending on the type of work, the size of the firm and its location. 2 You'll pay lower rates for routine work done by a less-experienced associate or lesser-trained employee, such as $30-$50 for bookkeeping services. 3 An accountant's total fee depends on the project. average data entry keystrokes per hour salaries the average salary for data entry keystrokes per hour jobs is $ 20000 Accounting services are typically $250 to $400 per month, or $350 to $500 per quarter. Sales tax and bank recs included. We do all the processing, filing and tax deposits. 5 employees, bi-weekly payroll, direct deposit, $135 per month. The less that is outsourced, the cheaper it will be for you. A bookkeeper should be paid between $15 and $18 per hour. An accountant with a undergraduate degree (4-years) should be paid somewhere around $20/hour but that still depends on what you're having them do. An accountant with a graduate degree (masters) should be paid between $25 and $30 per hour. Pay by Experience Level for Intelligence Analyst. Median of all compensation (including tips, bonus, and overtime) by years of experience. Intelligence Analysts with a lot of experience tend to enjoy higher earnings. [7.44921875, 3.271484375, 5.859375, 3.234375, 5.421875, ...]
    what is mch on a blood test What High Levels Mean. MCH levels in blood tests are considered high if they are 35 or higher. A normal hemoglobin level is considered to be in the range between 26 and 33 picograms per red blood cell. High MCH levels can indicate macrocytic anemia, which can be caused by insufficient vitamin B12.acrocytic RBCs are large so tend to have a higher MCH, while microcytic red cells would have a lower value.”. MCH is one of three red blood cell indices (MCHC and MCV are the other two). The measurements are done by machine and can help with diagnosis of medical problems. MCH stands for mean corpuscular hemoglobin. It estimates the average amount of hemoglobin in each red blood cell, measured in picograms (a trillionth of a gram). Automated cell counters calculate the MCH, which is reported as part of a complete blood count (CBC) test. MCH may be low in iron-deficiency anemia, and may be high in anemia due to vitamin B12 or folate deficiency. Other forms of anemia can also cause MCH to be abnormal. Doctors only use the MCH as supporting information, not to make a diagnosis. A. MCH stands for mean corpuscular hemoglobin. It estimates the average amount of hemoglobin in each red blood cell, measured in picograms (a trillionth of a gram). Automated cell counters calculate the MCH, which is reported as part of a complete blood count (CBC) test. MCH may be low in iron-deficiency anemia, and may be high in anemia due to vitamin B12 or folate deficiency. Other forms of anemia can also cause MCH to be abnormal. The test used to determine the quantity of hemoglobin in the blood is known as the MCH blood test. The full form of MCH is Mean Corpuscular Hemoglobin. This test is therefore used to determine the average amount of hemoglobin per red blood cell in the body. The results of the MCH blood test are therefore reported in picograms, a tiny measure of weight. MCH blood test high indicates that there is a poor supply of oxygen to the blood where as MCH blood test low mean that hemoglobin is too little in the cells indicating a lack of iron. It is important that iron is maintained at a certain level as too much or too little iron can be dangerous to your body. slide 1 of 7. What Is MCH? MCH is the initialism for Mean Corpuscular Hemoglobin. Taken from Latin, the term refers to the average amount of hemoglobin found in red blood cells. A CBC (complete blood count) blood test can be used to monitor MCH levels in the blood. Lab Tests Online explains that the MCH aspect of a CBC test “is a measurement of the average amount of oxygen-carrying hemoglobin inside a red blood cell. Macrocytic RBCs are large so tend to have a higher MCH, while microcytic red cells would have a lower value.. The test used to determine the quantity of hemoglobin in the blood is known as the MCH blood test. The full form of MCH is Mean Corpuscular Hemoglobin. This test is therefore used to determine the average amount of hemoglobin per red blood cell in the body. The results of the MCH blood test are therefore reported in picograms, a tiny measure of weight. The normal range of the MCH blood test is between 26 and 33 pg per cell. A MCHC test is a test that is carried out to test a person for anemia. The MCHC in a MCHC test stands for Mean Corpuscular Hemoglobin Concentration. MCHC is the calculation of the average hemoglobin inside a red blood cell. A MCHC test can be performed along with a MCV test (Mean Corpuscular Volume).Both levels are used to test people for anemia.The MCHC test is also known as the MCH blood test which tests the levels of hemoglobin in the blood. The MCHC test can be ordered as part of a complete blood count (CBC) test.CHC is measured in grams per deciliter. Normal readings for MCHC are 31 grams per deciliter to 35 grams per deciliter. A MCHC blood test may be ordered when a person is showing signs of fatigue or weakness, when there is an infection, is bleeding or bruising easily or when there is an inflammation. The test looks at the average amount of hemoglobin per red cell. So MCHC = the amount of hemoglobin present in each red blood cell. A MCHC blood test could be ordered for someone who has signs of fatigue or weakness, when there is an infection, is bleeding or bruising easily or when there is noticeable inflammation. [6.44921875, 7.05078125, 7.2109375, 8.40625, 6.53515625, ...]
  • Loss: main.SparseMarginMSEPPPLoss

Evaluation Dataset

msmarco-qwen3-reranker-0.6_b

  • Dataset: msmarco-qwen3-reranker-0.6_b at 20c25c8
  • Size: 1,000 evaluation samples
  • Columns: query, positive, negative_1, negative_2, negative_3, negative_4, negative_5, negative_6, negative_7, negative_8, and score
  • Approximate statistics based on the first 1000 samples:
    query positive negative_1 negative_2 negative_3 negative_4 negative_5 negative_6 negative_7 negative_8 score
    type string string string string string string string string string string list
    details
    • min: 4 tokens
    • mean: 9.05 tokens
    • max: 23 tokens
    • min: 20 tokens
    • mean: 81.61 tokens
    • max: 244 tokens
    • min: 11 tokens
    • mean: 69.2 tokens
    • max: 231 tokens
    • min: 11 tokens
    • mean: 68.76 tokens
    • max: 198 tokens
    • min: 15 tokens
    • mean: 70.99 tokens
    • max: 225 tokens
    • min: 14 tokens
    • mean: 70.7 tokens
    • max: 236 tokens
    • min: 14 tokens
    • mean: 72.57 tokens
    • max: 315 tokens
    • min: 16 tokens
    • mean: 68.95 tokens
    • max: 203 tokens
    • min: 13 tokens
    • mean: 71.68 tokens
    • max: 220 tokens
    • min: 11 tokens
    • mean: 70.18 tokens
    • max: 213 tokens
    • size: 9 elements
  • Samples:
    query positive negative_1 negative_2 negative_3 negative_4 negative_5 negative_6 negative_7 negative_8 score
    how many people employed by shell Shell worldwide. Royal Dutch Shell was formed in 1907, although our history dates back to the early 19th century, to a small shop in London where the Samuel family sold sea shells. Today, Shell is one of the world’s major energy companies, employing an average of 93,000 people and operating in more than 70 countries. Our headquarters are in The Hague, the Netherlands, and our Chief Executive Officer is Ben van Beurden. Show sources information. This statistic shows the number of employees at SeaWorld Entertainment, Inc. in the United States, by type. As of December 2016, SeaWorld employed 5,000 full-time employees and counted approximately 13,000 seasonal employees during their peak operating season. Jobs, companies, people, and articles for LinkedIn’s Payroll Specialist - Addus Homecare, Inc. members. Insights about Payroll Specialist - Addus Homecare, Inc. members on LinkedIn. Median salary $31,300. As of July 2014, there are 139 million people employed in the United States. This number is up by 209,000 employees from June and by 1.47 million from the beginning of 2014. average data entry keystrokes per hour salaries the average salary for data entry keystrokes per hour jobs is $ 20000 Research and review Plano Synergy jobs. Learn more about a career with Plano Synergy including all recent jobs, hiring trends, salaries, work environment and more. Find Jobs Company Reviews Find Salaries Find Resumes Employers / Post Job Upload your resume Sign in From millions of real job salary data. 13 Customer Support Specialist salary data. Average Customer Support Specialist salary is $59,032 Detailed Customer Support Specialist starting salary, median salary, pay scale, bonus data report Register & Know how much $ you can earn | Sign In From millions of real job salary data. 1 Ceo Ally salary data. Average Ceo Ally salary is $55,000 Detailed Ceo Ally starting salary, median salary, pay scale, bonus data report HelpSystems benefits and perks, including insurance benefits, retirement benefits, and vacation policy. Reported anonymously by HelpSystems employees. Glassdoor uses cookies to improve your site experience. [6.265625, -1.3671875, -6.91796875, 1.111328125, -7.96875, ...]
    what is a lcsw LCSW is an acronym for licensed clinical social worker, and people with this title are skilled professionals who meet certain requirements and work in a variety of fields. The term social worker is not always synonymous with licensed clinical social worker. LISW means the person is a Licensed Independent Social Worker. LCSW means the person is a Licensed Clinical Social Worker. Source(s): Introduction to Social Work 101 at University of Nevada, Las Vega (UNLV) Dorothy K. · 1 decade ago. An LCSW is a licensed clinical social worker. A LMHC is the newest addition to the field of mental health. They are highly similar and can do most of the same things with few exceptions. One thing to keep in mind is that because the LMHC lincense is so new, there are fewer in number in the field.n LCSW is a licensed clinical social worker. A LMHC is the newest addition to the field of mental health. They are highly similar and can do most of the same things with few exceptions. One thing to keep in mind is that because the LMHC lincense is so new, there are fewer in number in the field. The Licensed Clinical Social Worker or LCSW, is a sub-sector within the field of Social Work. They work with clients in order to help them deal with issues involving their mental and emotional health. This could be related to substance abuse, past trauma or mental illness. Licensed Clinical Social Worker | LCSW. The Licensed Clinical Social Worker or LCSW, is a sub-sector within the field of Social Work. LCSW's work with clients in order to help deal with issues involving mental and emotional health. There are a wide variety of specializations the Licensed Clinical Social Worker can focus on. The LMSW exam is a computer-based test containing 170 multiple-choice questions designed to measure minimum competencies in four categories of social work practice: Human development, diversity, and behavior in the environment. Assessment and intervention planning. The Licensed Clinical Social Worker, also known as the LCSW, is a branch of social work that specializes in mental health therapy in a counseling format. Becoming an LCSW requires a significant degree of training, including having earned a Master of Social Work (MSW) degree from a Council on Social Work Education (CSWE) accredited program. a. The examination requirements for licensure as an LCSW include passing the Clinical Examination of the ASWB or the Clinical Social Workers Examination of the State of California. Scope of practice-Limitations. a.To the extent they are prepared through education and training, an LCSW can engage in all acts and practices defined as the practice of clinical social work. Certified Social Work (CSW): CSW means a licensed certified social worker. A CSW must have a master s degree. The LTCM Client is a way for companies to stay in touch with you, their customers, in a way that is unobtrusive and completely under the users' control. It's an application that runs quietly on the computer. Users can and should customize the client to match their desired preferences. [7.34375, 6.046875, 7.09765625, 6.46484375, 7.28515625, ...]
    does oolong tea have much caffeine? At a given weight, tea contains more caffeine than coffee, but this doesn’t mean that a usual portion of tea contains more caffeine than coffee because tea is usually brewed in a weak way. Some kinds of tea, such as oolong and black tea, contain higher level of caffeine than most other teas. Among six basic teas (green, black, yellow, white, oolong, dark), green tea contains less caffeine than black tea and white tea contains less than green tea. But many studies found that the caffeine content varies more among individual teas than it does among broad categories. Actually, oolong tea has less caffeine than coffee and black tea. A cup of oolong tea only has about 1/3 of caffeine of a cup of coffee. According to a research conducted by HICKS M.B, the caffeine decreases whenever the tea leaves go through the process of brewing. Oolong tea contains caffeine. Caffeine works by stimulating the central nervous system (CNS), heart, and muscles. Oolong tea also contains theophylline and theobromine, which are chemicals similar to caffeine. Too much oolong tea, more than five cups per day, can cause side effects because of the caffeine. Oolong tea, made from more mature leaves, usually have less caffeine than green tea. On the flip side, mature leaves contain less theanine, a sweet, natural relaxant that makes a tea much less caffeinated than it actually is. That is the theory, anyway. Oolong tea is a product made from the leaves, buds, and stems of the Camellia sinensis plant. This is the same plant that is also used to make black tea and green tea. The difference is in the processing.Oolong tea is partially fermented, black tea is fully fermented, and green tea is unfermented. Oolong tea is used to sharpen thinking skills and improve mental alertness. It is also used to prevent cancer, tooth decay, osteoporosis, and heart disease.owever, do not drink more than 2 cups a day of oolong tea. That amount of tea contains about 200 mg of caffeine. Too much caffeine during pregnancy might cause premature delivery, low birth weight, and harm to the baby. A Department of Nutritional Services report provides the following ranges of caffeine content for a cup of tea made with loose leaves: 1 Black Tea: 23 - 110 mg. 2 Oolong Tea: 12 - 55 mg. Green Tea: 8 - 36 mg. Oolong tea is a product made from the leaves, buds, and stems of the Camellia sinensis plant. This is the same plant that is also used to make black tea and green tea. The difference is in the processing. Oolong tea is partially fermented, black tea is fully fermented, and green tea is unfermented. Oolong tea is used to sharpen thinking skills and improve mental alertness. It is also used to prevent cancer, tooth decay, osteoporosis, and heart disease. Health Effects of Tea – Caffeine. In dry form, a kilogram of black tea has twice the caffeine as a kilogram of coffee…. But one kilogram of black tea makes about 450 cups of tea and one kilogram of coffee makes about 100 cups of coffee, so…. There is less caffeine in a cup of tea than in a cup of coffee. Green teas have less caffeine than black teas, and white teas have even less caffeine than green teas. Oolong teas fall between black and green teas. Herbal tea, because it is not made from the same tea plant, is caffeine-free, naturally. Here is a graphical representation of their respective caffeine content. The average 8-ounce serving of brewed black tea contains 14 to 70 mg of caffeine. This compares to 24 to 45 mg of caffeine found in green tea. An 8-ounce glass of instant iced tea prepared with water contains 11 to 47 mg of caffeine. Most ready-to-drink bottled teas contain 5 to 40 mg of caffeine. Just as with coffee, decaffeinated tea still contains 5 to 10 mg of caffeine per cup. [7.60546875, 8.78125, 9.109375, 8.609375, 7.984375, ...]
  • Loss: main.SparseMarginMSEPPPLoss

Training Hyperparameters

Non-Default Hyperparameters

  • eval_strategy: steps
  • per_device_train_batch_size: 16
  • per_device_eval_batch_size: 16
  • learning_rate: 4e-05
  • num_train_epochs: 1
  • warmup_ratio: 0.1
  • bf16: True

All Hyperparameters

Click to expand
  • overwrite_output_dir: False
  • do_predict: False
  • eval_strategy: steps
  • prediction_loss_only: True
  • per_device_train_batch_size: 16
  • per_device_eval_batch_size: 16
  • per_gpu_train_batch_size: None
  • per_gpu_eval_batch_size: None
  • gradient_accumulation_steps: 1
  • eval_accumulation_steps: None
  • torch_empty_cache_steps: None
  • learning_rate: 4e-05
  • weight_decay: 0.0
  • adam_beta1: 0.9
  • adam_beta2: 0.999
  • adam_epsilon: 1e-08
  • max_grad_norm: 1.0
  • num_train_epochs: 1
  • max_steps: -1
  • lr_scheduler_type: linear
  • lr_scheduler_kwargs: {}
  • warmup_ratio: 0.1
  • warmup_steps: 0
  • log_level: passive
  • log_level_replica: warning
  • log_on_each_node: True
  • logging_nan_inf_filter: True
  • save_safetensors: True
  • save_on_each_node: False
  • save_only_model: False
  • restore_callback_states_from_checkpoint: False
  • no_cuda: False
  • use_cpu: False
  • use_mps_device: False
  • seed: 42
  • data_seed: None
  • jit_mode_eval: False
  • use_ipex: False
  • bf16: True
  • fp16: False
  • fp16_opt_level: O1
  • half_precision_backend: auto
  • bf16_full_eval: False
  • fp16_full_eval: False
  • tf32: None
  • local_rank: 0
  • ddp_backend: None
  • tpu_num_cores: None
  • tpu_metrics_debug: False
  • debug: []
  • dataloader_drop_last: False
  • dataloader_num_workers: 0
  • dataloader_prefetch_factor: None
  • past_index: -1
  • disable_tqdm: False
  • remove_unused_columns: True
  • label_names: None
  • load_best_model_at_end: False
  • ignore_data_skip: False
  • fsdp: []
  • fsdp_min_num_params: 0
  • fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
  • tp_size: 0
  • fsdp_transformer_layer_cls_to_wrap: None
  • accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
  • deepspeed: None
  • label_smoothing_factor: 0.0
  • optim: adamw_torch
  • optim_args: None
  • adafactor: False
  • group_by_length: False
  • length_column_name: length
  • ddp_find_unused_parameters: None
  • ddp_bucket_cap_mb: None
  • ddp_broadcast_buffers: False
  • dataloader_pin_memory: True
  • dataloader_persistent_workers: False
  • skip_memory_metrics: True
  • use_legacy_prediction_loop: False
  • push_to_hub: False
  • resume_from_checkpoint: None
  • hub_model_id: None
  • hub_strategy: every_save
  • hub_private_repo: None
  • hub_always_push: False
  • gradient_checkpointing: False
  • gradient_checkpointing_kwargs: None
  • include_inputs_for_metrics: False
  • include_for_metrics: []
  • eval_do_concat_batches: True
  • fp16_backend: auto
  • push_to_hub_model_id: None
  • push_to_hub_organization: None
  • mp_parameters:
  • auto_find_batch_size: False
  • full_determinism: False
  • torchdynamo: None
  • ray_scope: last
  • ddp_timeout: 1800
  • torch_compile: False
  • torch_compile_backend: None
  • torch_compile_mode: None
  • include_tokens_per_second: False
  • include_num_input_tokens_seen: False
  • neftune_noise_alpha: None
  • optim_target_modules: None
  • batch_eval_metrics: False
  • eval_on_start: False
  • use_liger_kernel: False
  • eval_use_gather_object: False
  • average_tokens_across_devices: False
  • prompts: None
  • batch_sampler: batch_sampler
  • multi_dataset_batch_sampler: proportional
  • router_mapping: {}
  • learning_rate_mapping: {}

Training Logs

Epoch Step Training Loss Validation Loss msmarco-eval-1kq-1kd_cosine_ndcg@10
0.032 20 31.516 - -
0.064 40 26.2358 - -
0.096 60 20.1202 - -
0.128 80 16.6985 - -
0.16 100 16.8513 15.2968 0.9020
0.192 120 14.78 - -
0.224 140 13.8785 - -
0.256 160 13.0089 - -
0.288 180 14.0187 - -
0.32 200 14.3072 12.6029 0.9271
0.352 220 14.0647 - -
0.384 240 13.1793 - -
0.416 260 12.8744 - -
0.448 280 12.9611 - -
0.48 300 12.9061 11.5690 0.9358
0.512 320 12.6941 - -
0.544 340 13.4955 - -
0.576 360 13.1799 - -
0.608 380 11.6603 - -
0.64 400 13.0961 10.9323 0.9464
0.672 420 12.3644 - -
0.704 440 11.9309 - -
0.736 460 11.3084 - -
0.768 480 11.0592 - -
0.8 500 10.8269 11.1977 0.9452
0.832 520 11.1987 - -
0.864 540 12.0763 - -
0.896 560 11.9127 - -
0.928 580 10.9865 - -
0.96 600 12.4522 10.5896 0.9479
0.992 620 10.5014 - -
-1 -1 - - 0.9492

Environmental Impact

Carbon emissions were measured using CodeCarbon.

  • Energy Consumed: 0.062 kWh
  • Carbon Emitted: 0.024 kg of CO2
  • Hours Used: 0.182 hours

Training Hardware

  • On Cloud: No
  • GPU Model: 1 x NVIDIA GeForce RTX 3090
  • CPU Model: 13th Gen Intel(R) Core(TM) i7-13700K
  • RAM Size: 31.78 GB

Framework Versions

  • Python: 3.11.6
  • Sentence Transformers: 5.1.0.dev0
  • Transformers: 4.51.3
  • PyTorch: 2.7.1+cu126
  • Accelerate: 1.5.1
  • Datasets: 2.21.0
  • Tokenizers: 0.21.1

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}