SentenceTransformer based on allenai/specter2_aug2023refresh_base
This is a sentence-transformers model finetuned from allenai/specter2_aug2023refresh_base. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
Model Details
Model Description
- Model Type: Sentence Transformer
- Base model: allenai/specter2_aug2023refresh_base
- Maximum Sequence Length: 512 tokens
- Output Dimensionality: 768 dimensions
- Similarity Function: Cosine Similarity
Model Sources
- Documentation: Sentence Transformers Documentation
- Repository: Sentence Transformers on GitHub
- Hugging Face: Sentence Transformers on Hugging Face
Full Model Architecture
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("m7n/discipline-tuned_specter_2_015")
# Run inference
sentences = [
'The recursive circulant network G(N,d) can be widely used in the design and implementation of parallel processing architectures. It consists of N identical nodes, each node is connected through bidirectional, point-to-point communication channels to different neighbors by jumping d^i , where {\\leq}i{\\leq}{\\lceil}{\\log}_dN{ ceil} - . In this paper, we investigate the routing of a message on G( ^m,0) , a special kind of RCN, that is key to the performance of this network. On G( ^m,0) we would like to transmit k packets from a source node to k destination nodes simultaneously along paths on this network, the i^{th} packet will be transmitted along the i^{th} path, where {\\leq}k{\\leq}m- , {{\\leq}}i{{\\leq}}m- . In order for all packets to arrive at a destination node quickly and securely, we present an O(m^ ) routing algorithm on G( ^m,0) for generating a set of one-to-many node-disjoint and nearly shortest paths, where each path is either shortest or nearly shortest and the total length of these paths is nearly minimum since the path is mainly determined by employing the Hungarian method.',
'Wireless Mesh Networks aim to attain large connectivity with minimum performance degradation, as network size is increase. As such, scalability is one of the main characteristics of Wireless Mesh Networks that differentiates it from other wireless networks. This characteristic creates the need for bandwidth efficiency strategies to ensure that network performance does not degrade as the size of the network increase. Several researches have been done to realize mesh networks. However, the researches conducted were mostly focused on a per TCP/IP layer basis. Also, the studies on bandwidth efficiency and bandwidth improvement are usually dealt with as separate issues. This paper aims to simultaneously study bandwidth efficiency and improvement. Aside from optimizing the bandwidth given a fixed capacity, the capacity is also increased using results of physical layer studies. In this paper, the capacity is improved by using the concept of non-overlapping channels for wireless communication. A channel allocation scheme is conceptualized to choose the transmission channel that would optimize the network performance parameters with consideration of chosen Quality of Service (QoS) parameters. Network utility maximization is used to optimize the bandwidth after channel selection. Furthermore, a routing scheme is proposed using the results of the network utilization method and the channel allocation scheme to find the optimal path that would maximize the network gain.',
'The separation and recovery of NaF from fluorine containing solution by the common ion effect of Na+ was studied. The solubility of NaF in the solutions of NaCl, NaNO0, Na0CO0, Na0SO0 and NaOH at C was determined. It was found that when the compound containing sodium, such as Na0CO0 or Na0SO0 was added into NaF saturated solution to product the common ion effect of Na+, most of the NaF can be crystallized without evaporating concentration, and the added Na0CO0 or Na0SO0 can be recovered by cooling crystallization. Combining cooling crystallization with the common ion effect of Na+, different processes can be designed to recover NaF from different fluorine containing solutions. This will have a significant impact on the treatment of fluorine containing wastewater and the recycling of fluorine resources.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
Evaluation
Metrics
Triplet
- Dataset:
specter_2_
- Evaluated with
TripletEvaluator
Metric | Value |
---|---|
cosine_accuracy | 0.9706 |
Training Details
Training Dataset
Unnamed Dataset
- Size: 43,494 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 80 tokens
- mean: 231.5 tokens
- max: 512 tokens
- min: 82 tokens
- mean: 228.95 tokens
- max: 512 tokens
- min: 81 tokens
- mean: 229.72 tokens
- max: 512 tokens
- Samples:
anchor positive negative The deficiencies of traditional models for the provision of clinical pharmacy services are discussed, and a patient-specific model that integrates drug distribution and clinical pharmacy functions is proposed. Traditional models have either designated specific individuals as providers of clinical pharmacy services or have combined distributive and supportive services with clinical services. In both cases, clinical services have been of secondary importance. Such models have resulted in inconsistent clinical services for which the patient is not necessarily the primary focus and have made it difficult for pharmacists to understand their mission. The lack of a well-defined primary clinical role for pharmacists has confused health-care providers and created problems for managers attempting to evaluate pharmacists and justify clinical services. The integrated patient-specific model is based on the ethical imperative that the patient must be central to any health-care endeavor. Under this m...
Pharmacy workflow efficiencies achieved through the use of an electronic medication-tracking system are described. Medication dispensing turnaround times at the inpatient pharmacy of a large hospital were evaluated before and after transition from manual medication tracking to a Web-based tracking process involving sequential bar-code scanning and real-time monitoring of medication status. The transition was carried out in three phases: ( ) a workflow analysis, including the identification of optimal points for medication scanning with hand-held wireless devices, ( ) the phased implementation of an automated solution and associated hardware at a central dispensing pharmacy and three satellite locations, and ( ) postimplementation data collection to evaluate the impact of the new tracking system and areas for improvement. Relative to the manual tracking method, electronic medication tracking allowed the capture of far more data points, enabling the pharmacy team to delineate the time re...
While the long-term perspective in the organizational analysis has advanced our understanding of field-level dynamics, it has not fully clarified the micro foundation of such dynamics. As a remedy, this article aims to embrace the development of evaluation criteria in the field, where qualitative differences come to be quantitatively evaluated under a criterion associated with one of the qualities. It empirically examines the long-term field dynamics concerning the portable electronic dictionary. Chains of intended and unintended consequences constituted the process of commensuration in the field, which witnessed silent persuasion and belated opposition.
In Escherichia coli , the SeqA protein binds specifically to GATC sequences which are methylated on the A of the old strand but not on the new strand. Such hemimethylated DNA is produced by progression of the replication forks and lasts until Dam methyltransferase methylates the new strand. It is therefore believed that a region of hemimethylated DNA covered by SeqA follows the replication fork. We show that this is, indeed, the case by using global ChIP on Chip analysis of SeqA in cells synchronized regarding DNA replication. To assess hemimethylation, we developed the first genome-wide method for methylation analysis in bacteria. Since loss of the SeqA protein affects growth rate only during rapid growth when cells contain multiple replication forks, a comparison of rapid and slow growth was performed. In cells with six replication forks per chromosome, the two old forks were found to bind surprisingly little SeqA protein. Cell cycle analysis showed that loss of SeqA from the old for...
TRAP is an subunit RNA binding protein that regulates expression of genes involved in tryptophan biosynthesis and transport in Bacillus subtilis . TRAP is activated to bind RNA by binding up to molecules of l -tryptophan in pockets formed by adjacent subunits. The precise mechanism by which tryptophan binding activates TRAP is not known. Thr00 is in the tryptophan binding pocket. A TRAP mutant in which Thr00 is substituted with Val (T00V) does not bind tryptophan but binds RNA constitutively, suggesting that Thr00 plays a key role in the activation mechanism. We have examined the effects of other substitutions of Thr00. TRAP proteins with small -branched aliphatic side chains at residue bind RNA constitutively, whereas those with a small polar side chain show tryptophan-dependent RNA binding. Several mutant proteins exhibited constitutive RNA binding that was enhanced by tryptophan. Although the tryptophan and RNA binding sites on TRAP are distinct and are separated by A, several subst...
Eight rats responded on concurrent Variable-Ratio Extinction schedules for food reinforcement. The assignment of variable-ratio reinforcement to a left or right lever varied randomly following each reinforcer, and was cued by illumination of a stimulus light above that lever. Postreinforcement preference levels decreased substantially and reliably over time when the lever that just delivered reinforcement was now in extinction; however, if that lever was once again associated with variable ratio, this decrease in same-lever preference tended to be small, and for some subjects, not in evidence. The changes in preference level to the extinction lever were well described by a modified version of Killeen, Hanson, and Osborne's ( ) induction model. Consistent with this model's attribution of preference change to induction, we attribute preference change in this report to a brief period of reinforcer-induced arousal that energizes responding to the lever that delivered the last reinforcer. A...
This investigation used case studies to identify barriers to swimming and water safety education for African Americans.The focus was on urban areas and examines the physical and social settings offering recreational learn-to-swim programs through the experiences of African Americans.The findings include statements by parents of participants, swimming instructors, and nonswimmers.There was agreement that a lack of access and exposure to swimming exists for people who are African American.Knowledge or learning to swim can be viewed as cultural capital; for those not learning to swim, it is a cultural liability.This is a cycle in which the lack of access results in institutional decisions that maintain the lack of access to knowledge on water safety.
Maori (the indigenous peoples of Aotearoa, New Zealand) are intimately connected to wai (i.e., water) yet are overrepresented in New Zealand's drowning statistics each year. On average Maori account for - % of all preventable and non-preventable drowning fatalities, despite comprising only percent of New Zealand's population. Drowning remains a significant issue posing a threat to whanau (i.e., families) through premature death being imminent and whakapapa (i.e., genealogy) being interrupted. There is limited research that has examined Maori and indigenous understandings of water safety within the literature and limited studies that have investigated the issue of Maori drowning from a distinctly Maori or indigenous approach. This paper proposes a theory of Maori water safety depicted as the Wai Puna model and draws on three core concepts pertinent to a Maori worldview: whakapapa, matauranga (i.e., Maori knowledge and ways of knowing) and tikanga (i.e., customs, practices). Wai Puna pro...
The aroma of fresh and aged lemon-flavored hard tea was investigated by aroma extract dilution analysis (AEDA), quantitative comparison, and two-dimensional chirality analysis. Aroma extract dilution analysis of fresh hard tea samples showed -methylbutanal, isoamyl alcohol, -damascenone, -ionone, -phenylethanol, -hydroxy- -dimethyl- (0H)-furanone, and vanillin could be the most important aroma contributors to the hard tea due to their high FD values. The analysis of the aged hard tea samples did not reveal new compound formation during storage; however, compared with fresh samples, the flavor dilution value changed substantially in the aged samples. Both AEDA and quantitative analysis demonstrated that -damascenone increased substantially in aged samples, whereas terpene aldehydes decreased substantially after storage. In addition, the FD value of linalool decreased dramatically in aged samples. Two-dimensional GC-MS chirality analysis revealed the FD value decrease of linalool in aged...
- Loss:
TripletLoss
with these parameters:{ "distance_metric": "TripletDistanceMetric.COSINE", "triplet_margin": 0.6 }
Evaluation Dataset
Unnamed Dataset
- Size: 2,174 evaluation samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 78 tokens
- mean: 234.55 tokens
- max: 512 tokens
- min: 83 tokens
- mean: 235.43 tokens
- max: 512 tokens
- min: 86 tokens
- mean: 228.9 tokens
- max: 512 tokens
- Samples:
anchor positive negative The strong focus on global warming the recent years has contributed to a change towards a more climate friendly and energy efficient energy system. New energy efficient electrical appliances have clearly shown to be a challenge in the Norwegian distribution system and in the low voltage network in particular. These types of electrical loads have shown to increasingly often cause voltage disturbances exceeding the quality limits in both the EN00000 [ ] and the Norwegian voltage quality regulations [ ]. This have shown to cause everything from only irritation among customers based on poor lighting quality to malfunction and trip of electrical equipment. Estimates made by Norwegian network operators indicate that the necessary network reinforcement investments in Norway are in the range to billion Euros if all customers are being allowed to install and use the most challenging electrical appliances. These challenges will probably be similar in other countries if not necessarily as large a...
The aim of this paper is to provide the power industry with a better understanding of consumers' attitudes and actions at a time when major grid investments are due to be launched. In order to reach the EUs ambitious goals for renewable energy, about major grid projects are being planned throughout Europe. Projects of this kind are often met by strong protests from local environmentalists. This generates negative publicity for the power industry, prolonged official treatment and delays in completing the projects. This results in major socio-economic consequences and should be avoided. Both the industry and the authorities rely upon public acceptance of the measures that are needed to uphold the progress of the projects. How can the power industry handle these challenges? ( pages)
The drawing submitted to the examination of the Society, and engraved Plate XVI. represents a mosaic pavement before the altar of the chapel in the prior's lodgings at ELY, built of stone by John Crawden, or Crouden, prior from to , now a dwelling house, making part of the Deanery, and lately in the occupation of the Reverend Mr. Lewis Jones, son of the late prebendary of that name. The pavement is feet inches long, and feet inch wide and represents the fall of man; Adam and Eve at the forbidden tree, whose fruit the serpent with a human face, which some persons believed he assumed, seems to be recommending to the latter.
The objective of this experiment was to evaluate a new commercial source of monensin (MON) on performance of mid-lactation dairy cows. In Experiment , Holstein cows ( multiparous and primiparous; DIM; kg/d milk yield; kg BW; mean SD) were used in a randomized block design experiment with a -d covariate and -wk treatment period. The first wk of the treatment period were considered adaptation and the last wk were used for data collection and analysis. Treatments were: Control (CTR; no MON added), Rumensin®️ (RUM; mg/d MON from Elanco Animal Health Inc.), and Monovet®️ (MVet; mg/d MON from Huvepharma®️ US Inc.). All cows were fed the same base diet throughout the experiment and treatments were top-dressed during the treatment period. Orthogonal contrasts were used to evaluate CTR vs. MON (RUM + MVet) and RUM vs. MVet. Compared with CTR, MON tended to increase milk yield ( vs. kg/d) but did not affect DMI or feed efficiency. The MVet treatment improved feed efficiency compared with RUM ( v...
The Cornell Net Carbohydrate Protein Model (Chalupa et al., ;Sniffen et al., ) has developed the need for uniform procedures to partition feed nitrogen into A, B, and C fractions (Pichard and Van Soest, ).While carbohydrate fractions are relatively standardized (based on NDF, ADF with corrections for ash, protein, and lignin), the fractionation of plant nitrogen has been open to considerable variation in procedures.This has led to non-uniformity among reported values for nitrogen fractions.This paper recommends reliable procedures for nonprotein nitrogen (NPN) and buffer-soluble protein.These procedures have been examined for reproducibility and relevance to biological expectations.Procedures for acid-detergent insoluble nitrogen (ADIN), and neutral-detergent insoluble nitrogen (NDIN) am also included as they are required for the model.Some alternatives in certain procedures are offered.
This article takes the theme of the fight of the soul with the body and presents selected items of anthropology of St. John Chrysostom. John Chrysostom examines the human situation after original sin in the eschatological aspect and indicates that the body is not the cause of evil, because sin is the consequence of free choice man. Then presents the relationship between the body and the soul, and stresses that the body is subordinate to the soul, to whom falls the responsibility for the deeds of the body. The soul is immortal by the will of God and his dignity transcends the body. The Preacher explains that the worldly biological life doesn't mean real life. John Chrysostom in teaching on man understands the word "spirit" not as a living soul, that is to say, the spiritual element of the man, but as the "Holy Spirit", of course, without the recognition of the role of anything of the soul. Consequently, the struggle between body and spirit means the fight between earthy concern resultin...
The purpose of this study is to determine the effect of Leadership style, Organizational Culture towards the Employee Performance, by partially and simultaneously at commmanditaire vennootschaap (c.v) Kaka Bersaudari, Pangkalpinang. Based on the results of the study shows that: ( ) there is a significant influence between Leadership style towards Employee Performance, which is approved by the value of t -count much greater thant t -table ( > ). ( ) The results also shows that there is a significant influence between Organizational Culture towards Employee Performance, which is proven by the value of t-count much greater than t -table ( > ). ( ) The results show that there is a significant influence between Leadership style and Organizational Culture simultantenously towards Employee Performance by the means of empirically finding by the value of F -count much greater than F -table ( > ). In conclusion, according to the result of this study we suggested to the commmanditaire vennootscha...
The purpose of this study was to examine the influence of leadership on interpersonal communication, and work motivation on work productivity of employees at PT. Pos Indonesia (Persero) Branch Pangkalpinang. The research method used probability sampling method. Respondents of this research are employees at PT. Pos Indonesia (Persero) Branch Pangkalpinang number of people. The variables used are leadership as independent variable and work productivity as dependent variable and variable of interpersonal communication and work motivation as intervening variable developed by itself according to its indicators. This study uses qualitative analysis of direct primary data field, as a tool in the processing of statistical data used SPSS program.The results showed that: After the calculation through the application of SPSS version program obtained the conclusion that all variables affect each other directly or indirectly that has been proven by hypothesis testing on each variable. Based on the ...
Epigenetic modifications influence gene expression and provide a unique mechanism for fine-tuning cellular differentiation and development in multicellular organisms. Here we report on the biological functions of UTX- , the Caenorhabditis elegans homologue of mammalian UTX, a histone demethylase specific for H0K00me0/ . We demonstrate that utx- is an essential gene that is required for correct embryonic and postembryonic development. Consistent with its homology to UTX, UTX- regulates global levels of H0K00me0/ in C. elegans. Surprisingly, we found that the catalytic activity is not required for the developmental function of this protein. Biochemical analysis identified UTX- as a component of a complex that includes SET- (MLL), and genetic analysis indicates that the defects associated with loss of UTX- are likely mediated by compromised SET- /UTX- complex activity. Taken together, these results demonstrate that UTX- is required for many aspects of nematode development; but, unexpected...
- Loss:
TripletLoss
with these parameters:{ "distance_metric": "TripletDistanceMetric.COSINE", "triplet_margin": 0.6 }
Training Hyperparameters
Non-Default Hyperparameters
eval_strategy
: stepsper_device_train_batch_size
: 12per_device_eval_batch_size
: 12learning_rate
: 2e-05weight_decay
: 0.01num_train_epochs
: 1warmup_ratio
: 0.2batch_sampler
: no_duplicates
All Hyperparameters
Click to expand
overwrite_output_dir
: Falsedo_predict
: Falseeval_strategy
: stepsprediction_loss_only
: Trueper_device_train_batch_size
: 12per_device_eval_batch_size
: 12per_gpu_train_batch_size
: Noneper_gpu_eval_batch_size
: Nonegradient_accumulation_steps
: 1eval_accumulation_steps
: Nonetorch_empty_cache_steps
: Nonelearning_rate
: 2e-05weight_decay
: 0.01adam_beta1
: 0.9adam_beta2
: 0.999adam_epsilon
: 1e-08max_grad_norm
: 1.0num_train_epochs
: 1max_steps
: -1lr_scheduler_type
: linearlr_scheduler_kwargs
: {}warmup_ratio
: 0.2warmup_steps
: 0log_level
: passivelog_level_replica
: warninglog_on_each_node
: Truelogging_nan_inf_filter
: Truesave_safetensors
: Truesave_on_each_node
: Falsesave_only_model
: Falserestore_callback_states_from_checkpoint
: Falseno_cuda
: Falseuse_cpu
: Falseuse_mps_device
: Falseseed
: 42data_seed
: Nonejit_mode_eval
: Falseuse_ipex
: Falsebf16
: Falsefp16
: Falsefp16_opt_level
: O1half_precision_backend
: autobf16_full_eval
: Falsefp16_full_eval
: Falsetf32
: Nonelocal_rank
: 0ddp_backend
: Nonetpu_num_cores
: Nonetpu_metrics_debug
: Falsedebug
: []dataloader_drop_last
: Falsedataloader_num_workers
: 0dataloader_prefetch_factor
: Nonepast_index
: -1disable_tqdm
: Falseremove_unused_columns
: Truelabel_names
: Noneload_best_model_at_end
: Falseignore_data_skip
: Falsefsdp
: []fsdp_min_num_params
: 0fsdp_config
: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap
: Noneaccelerator_config
: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed
: Nonelabel_smoothing_factor
: 0.0optim
: adamw_torchoptim_args
: Noneadafactor
: Falsegroup_by_length
: Falselength_column_name
: lengthddp_find_unused_parameters
: Noneddp_bucket_cap_mb
: Noneddp_broadcast_buffers
: Falsedataloader_pin_memory
: Truedataloader_persistent_workers
: Falseskip_memory_metrics
: Trueuse_legacy_prediction_loop
: Falsepush_to_hub
: Falseresume_from_checkpoint
: Nonehub_model_id
: Nonehub_strategy
: every_savehub_private_repo
: Nonehub_always_push
: Falsegradient_checkpointing
: Falsegradient_checkpointing_kwargs
: Noneinclude_inputs_for_metrics
: Falseinclude_for_metrics
: []eval_do_concat_batches
: Truefp16_backend
: autopush_to_hub_model_id
: Nonepush_to_hub_organization
: Nonemp_parameters
:auto_find_batch_size
: Falsefull_determinism
: Falsetorchdynamo
: Noneray_scope
: lastddp_timeout
: 1800torch_compile
: Falsetorch_compile_backend
: Nonetorch_compile_mode
: Nonedispatch_batches
: Nonesplit_batches
: Noneinclude_tokens_per_second
: Falseinclude_num_input_tokens_seen
: Falseneftune_noise_alpha
: Noneoptim_target_modules
: Nonebatch_eval_metrics
: Falseeval_on_start
: Falseuse_liger_kernel
: Falseeval_use_gather_object
: Falseaverage_tokens_across_devices
: Falseprompts
: Nonebatch_sampler
: no_duplicatesmulti_dataset_batch_sampler
: proportional
Training Logs
Epoch | Step | Training Loss | Validation Loss | specter_2__cosine_accuracy |
---|---|---|---|---|
0 | 0 | - | - | 0.9493 |
0.0138 | 50 | 0.488 | 0.4523 | 0.9539 |
0.0276 | 100 | 0.3873 | 0.3068 | 0.9592 |
0.0414 | 150 | 0.2534 | 0.1969 | 0.96 |
0.0552 | 200 | 0.1714 | 0.1464 | 0.9686 |
0.0690 | 250 | 0.1376 | 0.1196 | 0.9684 |
0.0828 | 300 | 0.1069 | 0.1032 | 0.9697 |
0.0966 | 350 | 0.1195 | 0.0961 | 0.9695 |
0.1103 | 400 | 0.1085 | 0.0952 | 0.9707 |
0.1241 | 450 | 0.0867 | 0.0895 | 0.9706 |
0.1379 | 500 | 0.094 | 0.0867 | 0.9707 |
0.1517 | 550 | 0.0979 | 0.0906 | 0.9694 |
0.1655 | 600 | 0.1003 | 0.0849 | 0.9707 |
0.1793 | 650 | 0.0877 | 0.0842 | 0.9716 |
0.1931 | 700 | 0.0967 | 0.0851 | 0.9683 |
0.2069 | 750 | 0.0953 | 0.0888 | 0.9679 |
0.2207 | 800 | 0.0761 | 0.0848 | 0.9683 |
0.2345 | 850 | 0.0966 | 0.0809 | 0.9699 |
0.2483 | 900 | 0.1048 | 0.0875 | 0.9677 |
0.2621 | 950 | 0.0929 | 0.0838 | 0.9691 |
0.2759 | 1000 | 0.0851 | 0.0817 | 0.9697 |
0.2897 | 1050 | 0.0765 | 0.0860 | 0.9676 |
0.3034 | 1100 | 0.0836 | 0.0835 | 0.9706 |
0.3172 | 1150 | 0.0811 | - | - |
Framework Versions
- Python: 3.10.12
- Sentence Transformers: 3.3.1
- Transformers: 4.49.0.dev0
- PyTorch: 2.5.1+cu121
- Accelerate: 1.2.1
- Datasets: 3.2.0
- Tokenizers: 0.21.0
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
TripletLoss
@misc{hermans2017defense,
title={In Defense of the Triplet Loss for Person Re-Identification},
author={Alexander Hermans and Lucas Beyer and Bastian Leibe},
year={2017},
eprint={1703.07737},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
- Downloads last month
- 3
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.
Model tree for m7n/discipline-tuned_specter_2_015
Base model
allenai/specter2_aug2023refresh_base