m7n commited on
Commit
d41cd9a
·
verified ·
1 Parent(s): 015be56

Add new SentenceTransformer model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,672 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - generated_from_trainer
7
+ - dataset_size:40000
8
+ - loss:TripletLoss
9
+ base_model: allenai/specter2_aug2023refresh_base
10
+ widget:
11
+ - source_sentence: Abstract Simple and rapid voltammetric method for simultaneous
12
+ determination of all trans retinyl acetate (RAc) or all trans retinyl palmitate
13
+ (RPa) and tocopheryl acetate (TOAc) has been proposed. The respective method was
14
+ based on the anodic oxidation of the compounds of interest by squarewave voltammetry
15
+ in acetone with mol L LiClO at the glassy carbon electrode. The procedure was
16
+ also beneficial with respect to simple dissolution of sample directly in the supporting
17
+ electrolyte. The all trans retinyl acetate could be quantified in two linear ranges
18
+ ( mol L and mol L ) and tocopheryl acetate in linear range mol L with detection
19
+ limits of mol L RAc (or mol L RPa) and of mol L TOAc. Selected commercial cosmetic
20
+ products were analysed achieving satisfactory recoveries.
21
+ sentences:
22
+ - 'The nitrification inhibitors (NIs) -dimethylpyrazole (DMPP) and dicyandiamide
23
+ (DCD) can effectively reduce N0 O emissions; however, which species are targeted
24
+ and the effect of these NIs on the microbial nitrifier community is still unclear.
25
+ Here, we identified the ammonia oxidizing bacteria (AOB) species linked to N0
26
+ O emissions and evaluated the effects of urea and urea with DCD and DMPP on the
27
+ nitrifying community in a day field experiment under sugarcane. Using an amoA
28
+ AOB amplicon sequencing approach and mining a previous dataset of 00S rRNA sequences,
29
+ we characterized the most likely N0 O-producing AOB as a Nitrosospira spp. and
30
+ identified Nitrosospira (AOB), Nitrososphaera (archaeal ammonia oxidizer) and
31
+ Nitrospira (nitrite-oxidizer) as the most abundant, present nitrifiers. The fertilizer
32
+ treatments had no effect on the alpha and beta diversities of the AOB communities.
33
+ Interestingly, we found three clusters of co-varying variables with nitrifier
34
+ operational taxonomic units (OTUs): the N0 O-producing AOB Nitrosospira with N0
35
+ O, NO0- , NH0+ , water-filled pore space (WFPS) and pH; AOA Nitrososphaera with
36
+ NO0- , NH0+ and pH; and AOA Nitrososphaera and NOB Nitrospira with NH0+ , which
37
+ suggests different drivers. These results support the co-occurrence of non-N0
38
+ O-producing Nitrososphaera and Nitrospira in the unfertilized soils and the promotion
39
+ of N0 O-producing Nitrosospira under urea fertilization. Further, we suggest that
40
+ DMPP is a more effective NI than DCD in tropical soil under sugarcane.'
41
+ - In order to achieve cost efficiency, customer satisfaction and also to concentrate
42
+ on core business operations, many manufacturing firms are outsourcing their logistics
43
+ activities to third party logistics (0PLs) provider. Reverse logistics is one
44
+ type of logistics in which used products or end-of-life products are collected
45
+ from the customers/retailers and send for reuse, refurbishing, recycling and/or
46
+ remanufacturing. The third party reverse logistics provider (0PRLP) who is performing
47
+ the reverse logistics operations is under a pressure of reducing the transportation
48
+ cost between the customers and the collecting centre. Decreasing transport costs
49
+ can be achieved through better utilization of resources such as vehicles (i.e.
50
+ through proper vehicle routing). This study aims to find the optimal routes which
51
+ will minimize the total distance traveled and corresponding transportation costs
52
+ for a 0PRLP who transports the used tires from various customers to the centralized
53
+ depot for the purpose of tire remanufacturing/retreading. A hybrid approach of
54
+ combining Sweep and Clarke-Wright savings algorithm with Simulated Annealing (SA)
55
+ algorithm is proposed in this study and also the results of SA are compared with
56
+ Sweep and Clarke-Wright savings algorithm results.
57
+ - Abstract Orientin, eriodictyol and robinin are polyphenolic compounds, and their
58
+ oxidation mechanism is pHdependent, in two steps, involving a different number
59
+ of electrons and protons. Orientin and eriodictyol first oxidation occurs at a
60
+ lower potential, corresponding to the reversible oxidation of the catechol group,
61
+ and is followed by an irreversible oxidation on the ringA at more positive potential.
62
+ Robenin oxidation is irreversible, with the formation of electroactive products,
63
+ and occurs at ringA and ringB. The electrochemical characterization of their redox
64
+ behaviour brought useful data about their chemical stability, antioxidant and
65
+ prooxidant activity, enabling a comprehensive understanding of their redox mechanism.
66
+ - source_sentence: This work studied the degradation of polyethylene terephthalate
67
+ by ethanol with and without catalysts. The degradation without catalyst, PET was
68
+ introduced into an autoclave with ethanol and heated at the temperature of 000o
69
+ C for , and hours. After heating it was cooled down to room temperature, amd the
70
+ product was taken to check percentage yield by the Nuclear Magnetic Resonance
71
+ Spectrometer. In case of using the catalysts, cobalt acetate, zinc acetate and
72
+ stannous chloride were used. The results showed that the degradation with the
73
+ catalysts obtained percentage yield of product, diethylene terephthalate (DET),
74
+ higher than without catalyst for this purpose than zinc acetate and stannous chloride,
75
+ respectively. The DET yield increased with an increase in the reaction time.
76
+ sentences:
77
+ - 'Poplars and willows planted on farms for soil conservation and shelter are also
78
+ potential sources of supplementary forage. The objective of this paper is to provide
79
+ information that assists in the estimation of the value of poplar and willow forage.
80
+ The quantity of forage in trees and branches was measured and non-destructive
81
+ methods for estimating forage yield were evaluated. The edible forage dry matter
82
+ (DM) of - -year-old trees ranged from - kg DM/tree. The edible forage yield of
83
+ poplar and willow branches with a basal diameter (BD) up to mm was shown to be
84
+ estimated from kg DM = BD - . The nutritive values of poplars and willows were
85
+ found to be similar, but the concentration of condensed tannins was usually higher
86
+ in willows. Tree bark was found to have sufficient nutritive value to be stripped
87
+ from trees for its feed value by livestock. Cattle were observed to be able to
88
+ browse willows to a height of 0m and to eat stems with a diameter from to mm.
89
+ Keywords: browse estimation, condensed tannins, nutritive value, poplar, supplements,
90
+ willow'
91
+ - In Lake Rogoznica, a small saline and eutrophic lake on the coast of the Adriatic
92
+ Sea, the copepod Acartia (Acanthacartia) italica Steuer, is common, occasionally
93
+ as an extremely dense population. This phenomenon provided an opportunity for
94
+ a redescription of the adults and for description of the developmental stages.
95
+ The segmentation and setation patterns of the antennules, antennae and mandibles
96
+ of A. italica are analysed in detail through the naupliar and copepodid phases
97
+ and the other limbs are analysed through the copepodid phase. In addition, wider
98
+ comparisons are made with available data for other species of the subgenus Acanthacartia
99
+ Steuer, .
100
+ - This research studied the effect of other plastics blending on the degradation
101
+ of polypropylene by mixing polyethylene and polystyrene as impurities with polypropylene
102
+ in concentrations of %, %, % and % by weight and pyrolysing under nitrogen atmosphere.
103
+ From the thermal analysis by Thermo gravimetric analyzer (TGA), it is found that
104
+ the virgin polypropylene was degraded at oC and that for polyethylene blending
105
+ on polypropylene, the temperature of degradation was increased to the range of
106
+ oC and for polrstyrene blending on polypropylene, temperature was decreased to
107
+ the range of oC. The pyrolysis of plastics mixtures in various ratios at oC gave
108
+ oil, gas and residue as product. The oil and gas are mixture of micro molecular
109
+ hydrocarbon and their derivatives which could be served as feedstock for light
110
+ olifins manufacture in the same way as crude petroleum
111
+ - source_sentence: Abstract Full-length A0- and A0- , N-truncated pyroglutamate A0-
112
+ and A0- are major variants in the Alzheimer brain. A0- has not been considered
113
+ as a therapeutic target yet. We demonstrate that the antibody NT0X and its Fab
114
+ fragment reacting with both the free N-terminus of A0-x and pyroglutamate A0-X
115
+ mitigated neuron loss in Tg0- mice expressing A0- and completely rescued spatial
116
+ reference memory deficits after passive immunization. NT0X and its Fab fragment
117
+ also rescued working memory deficits in wild type mice induced by intraventricular
118
+ injection of A0- . NT0X reduced pyroglutamate A0-x, Ax- and Thioflavin-S positive
119
+ plaque load after passive immunization of 0XFAD mice. A0-x and Ax- plaque deposits
120
+ were unchanged. Importantly, for the first time, we demonstrate that passive immunization
121
+ using the antibody NT0X is therapeutically beneficial in Alzheimer mouse models
122
+ showing that N-truncated A starting with position four in addition to pyroglutamate
123
+ A0-x is a relevant target to fight Alzheimer's disease.
124
+ sentences:
125
+ - Abstract Maternal hypoglycaemia throughout gestation until gestation day (GD)
126
+ delays foetal growth and skeletal development. While partially prevented by return
127
+ to normoglycaemia after completed organogenesis (GD00), underlying mechanisms
128
+ are not fully understood. Here, we investigated the pathogenesis of these changes
129
+ and significance of maternal hypoglycaemia extending beyond organogenesis in non-diabetic
130
+ rats. Pregnant rats received insulin-infusion until GD00 or GD00, with sacrifice
131
+ on GD00. Hypoglycaemia throughout gestation increased maternal corticosterone
132
+ levels, which correlated with foetal levels. Growth plates displayed central histopathologic
133
+ changes comprising disrupted cellular organisation, hypertrophic chondrocytes,
134
+ and decreased cellular density; expression of pro-angiogenic factors, HIF- and
135
+ VEGF-A increased in surrounding areas. Disproportionately decreased growth plate
136
+ zone volumes and lower expression of the structural protein MATN- were seen, while
137
+ bone ossification parameters were normal. Ending maternal/foetal hypoglycaemia
138
+ on GD00 reduced incidence and severity of histopathologic changes and with normal
139
+ growth plate volume. Compromised foetal skeletal development following maternal
140
+ hypoglycaemia throughout gestation is hypothesised to result from corticosterone-induced
141
+ hypoxia in growth plates, where hypoxia disrupts chondrocyte maturation and growth
142
+ plate structure and volume, decreasing long bone growth. Maternal/foetal hypoglycaemia
143
+ lasting only until GD00 attenuated these changes, suggesting a pivotal role of
144
+ glucose in growth plate development.
145
+ - The observation of significant neutron yield from gas loaded titanium samples
146
+ at Frascati in April opened up an alternate pathway to the investigation of anomalous
147
+ nuclear phenomena in deuterium/solid systems, complimenting the electrolytic approach.
148
+ Since then at least six different groups have successfully measured burst neutron
149
+ emission from deuterated titanium shavings following the Frascati methodology,
150
+ the special feature of which was the use of liquid nitrogen to create repeated
151
+ thermal cycles resulting in the production of nonequilibrium conditions in the
152
+ deuterated samples. At Trombay several variations of the gas loading procedure
153
+ have been investigated including induction heating of single machined titanium
154
+ targets in a glass chamber as well as use of a plasma focus device for deuteriding
155
+ its central titanium electrode. Stemming from earlier observations both at BARC
156
+ and elsewhere that tritium yield is times higher than neutron output in cold fusion
157
+ experiments, we have channelised our efforts to the search for tritium rather
158
+ than neutrons. The presence of tritium in a variety gas/plasma loaded titanium
159
+ samples has been established successfully through a direct measurement of the
160
+ radiations emitted as a result of tritium decay, in contradistinction to other
161
+ groups who have looked for tritium in the extracted gases. In some samples we
162
+ have thus observed tritium levels of over MBq with a corresponding (t/d) ratio
163
+ of .
164
+ - Two small areas of middle Paleozoic limestone were discovered near Gertrude Creek,
165
+ km north of Becharof Lake on the Alaska Peninsula, during reconnaissance flying
166
+ as part of the Alaska Mineral Resource Assessment Program (AMRAP) for the Alaska
167
+ Peninsula. Previously, the only known occurrence of Paleozoic rocks on the Alaska
168
+ Peninsula was a small exposure of middle Permian limestone on an island at the
169
+ entrance to Puale Bay (Hanson, ). This is the first reported occurrence of middle
170
+ Paleozoic rocks in what is considered to be a Mesozoic and Tertiary province.
171
+ - source_sentence: Nature Reserve now has become one of the foci of tourism.There
172
+ are a number of arguments and treaties on tourism exploitation in this special
173
+ area.Unfortunately,in the process of dealing with the conflicts between reservation
174
+ and exploition,we emphasizes the latter,and neglects its prerequisite-reservation;as
175
+ a result,inappropriate tourism development has destroyed the local ecosystem to
176
+ some extent.This article makes an inquiry into the advantages and factual condition
177
+ of tourism development in Nature Reserve,analyses emphatically the ecological
178
+ risks caused by blind tourism exploitation,points out that the Nature Reserve
179
+ should be exploited appropriately under protecting conditions and finally puts
180
+ forward the countermeasures against the problem.
181
+ sentences:
182
+ - This study involved studying fatigue crack propagation in elastic-plastic and
183
+ linear elastic fracture mechanics LEFM fracture mechanics EPFM for each bovine
184
+ and cadaveric human cortical bone. The results of the fatigue crack propagation
185
+ showed that the fatigue crack propagation in elastic-plastic fracture mechanics
186
+ is better than fatigue crack propagation in linear elastic fracture mechanics
187
+ for comparison of the bone at small frequencies. Therefore, fatigue crack growth
188
+ rate in cadaveric human bone is larger than bovine cortical bone. In addition,
189
+ the cutting of the bone by hand saw is the better method than any an electric
190
+ cutting machine.
191
+ - Bacteriolyses of bacterial cell walls by zinc () ions on the basis of the results
192
+ of halo antibacterial susceptibility tests were investigated for the nitrate and
193
+ the sulfate solutions.From the results obtained by halo antibacterial tests of
194
+ sulfate solutions against Staphylococcus epidermidis, the antibacterial order
195
+ is Zn + >Cu + >Ag + >Al + , in which Zn + ions indicate the highest antibacterial
196
+ effect.Bacteriolysis of S.aureus PGN cell wall by zinc ion is due to the inhibition
197
+ of PGN elongation by the activation of PGN autolysins of amidases and side-chain
198
+ endopeptidase.On the other hand, bacteriolysis of E.coli cell wall by zinc ions
199
+ is attributed to the destruction of outer membrane structure due to degradative
200
+ enzymes of lipoproteins at N-and C-terminals, and also is dependent on the activities
201
+ of PGN hydrolases and autolysins of amidases and carboxypeptidase-transpeptidase.Zinc
202
+ ions induced ROS such as O0 -, H0O0, OH, OH -producing in bacterial cell wall
203
+ occur oxidative stress.
204
+ - There are some different tendencies in Hu Feng and he Qifang's new-poetry-creation
205
+ (One is about the struggling at the bottom of society. The other is about the
206
+ reciting poetry with a cadence in the ivory tower. ) After engaged in the theoreti-cal
207
+ research, Hu has independent and individual theoretical character and he still
208
+ combines his theory with his creative experience from beginning to end. However,
209
+ He catches obvious dogmatism and often neglects the creative experience. While
210
+ some inde-pendent thoughts of latter is inwardly interlinked with the criticized
211
+ former. But each of them believes himself right. There is pro-found and deep cultural
212
+ connotation under social environment.
213
+ - source_sentence: The aim of the study is to describe our experience with ultrasound
214
+ guided drainage of tubo-ovarian abscess with concomitant use of antibiotics in
215
+ a second level center. Seven women diagnosed with a tubo-ovarian abscess and treated
216
+ with transvaginal ultrasound guided drainage with concomitant use of antibiotics,
217
+ between January and January , were reviewed. Intravenous antibiotics were administered
218
+ as soon as the diagnosis was reached and transvaginal ultrasound guided aspiration
219
+ of the abscess material was performed within hours with no need of anaesthesia.
220
+ Transvaginal route was used since it provides a better visualization and access
221
+ to the region of interest than other ultrasound routes. All cases but one ( %)
222
+ improved clinically within hours of aspiration and only one required surgery due
223
+ to refilling of a bilateral tubo-ovarian abscess hours after drainage. Mean hospital
224
+ stay was days (range - ). No procedure related complications were diagnosed. A
225
+ follow up ultrasound six months after the drainage showed in cases sonographic
226
+ markers of chronic tubal inflammatory disease but in all cases the patients remained
227
+ asymptomatic. Transvaginal ultrasound-guided drainage with concomitant antibiotics
228
+ appears to be a safe, efficacious and well tolerated procedure in the treatment
229
+ approach of tubo-ovarian abscess as reported in the literature. We consider this
230
+ approach as a feasible alternative to surgical drainage whenever indicated.
231
+ sentences:
232
+ - To compare the usefulness and accuracy of sonographically guided endometrial biopsies.
233
+ After obtaining informed consents endometrial biopsies were performed using ultrasound
234
+ guidance in patients followed by operative hysteroscopy. Diagnostic accuracy and
235
+ treatment efficiency for sono guidance were established. The hysteroscopic procedure
236
+ was in all cases started by using a fore-oblique mm hysteroscope (Karl Storz®️
237
+ Endoscopy-America, Inc. Corporate Pointe, Culver City, CA - ) with a diagnostic
238
+ sleeve through the cervical os (Karl Storz®️ Endoscopy-America, Inc. Corporate
239
+ Pointe, Culver City, CA - ), without prior dilatation of the cervix. The catheter
240
+ used for the polypectomy was an "Intrauterine Access Balloon Catheter" (Cook OB/GYN®️
241
+ West Morgan Street, P.O. Box , Spencer, Indiana ). Successful sonographic management
242
+ of the endometrial pathology had been achieved in patients ( %). Endometrial polyps
243
+ had been completely removed under sonographic guidance in patients, partially
244
+ in as confirmed by hysteroscopy. All incompletely removed polyps were of large
245
+ size (> cm), the remnants were taken out hysteroscopically. Targeted endometrial
246
+ biopsy was performed under sono guidance in patients. The completion of the procedure
247
+ was confirmed by hysteroscopy. Targeted endometrial biopsies and polyp removal
248
+ can be successfully performed under sonographic guidance. Large size endometrial
249
+ polyps may require hysteroscopy.
250
+ - Aqueous colloidal suspensions of clay platelets display a sol/gel transition that
251
+ is not yet understood. Depending on the nature of the clay, liquid-crystalline
252
+ behavior may also be observed. For example, the suspensions of beidellite display
253
+ a nematic phase whereas those of montmorillonite do not. Both beidellite and montmorillonite
254
+ have a "TOT" structure but the structural electric charge is located in the tetrahedral
255
+ layer for the former and in the octahedral layer for the latter. We built a setup
256
+ to perform SAXS experiments on complex fluids submitted to an electric field in
257
+ situ. We found that the fluid nematic phase of beidellite suspensions readily
258
+ aligns in the field. However, the field had no influence on the gels, showing
259
+ that the orientational degrees of freedom of the platelets are effectively frozen.
260
+ Moreover, strong platelet alignment was induced by the field in the isotropic
261
+ phase of both clays, in a similar way, regardless of their ability to form a nematic
262
+ phase. This surprising result would suggest that the orientational degrees of
263
+ freedom are not directly involved in the sol/gel transition. The ability to induce
264
+ orientational order in the isotropic phase of clay suspensions can be exploited
265
+ to prepare materials of controlled anisotropy.
266
+ - 'The article is devoted to the peculiarities of the paid domestic labor market
267
+ in the Russian economy. It is shown that this market is characterized by the following
268
+ features: weak state regulation; a high proportion of internal and external migrants;
269
+ a wide spread of the shadow economy and informal labor relations; gender differences;
270
+ the presence in the market of an "elite" segment of workers providing higher-quality
271
+ and highly paid services, and a segment of workers performing temporary, episodic
272
+ work. It is proved on the basis of market analysis that there is a predominant
273
+ demand for skilled labor, and wages are at or above the national average. It is
274
+ concluded that further efforts are needed to legalize the work of domestic workers
275
+ within the framework of the state employment policy.'
276
+ pipeline_tag: sentence-similarity
277
+ library_name: sentence-transformers
278
+ metrics:
279
+ - cosine_accuracy
280
+ model-index:
281
+ - name: SentenceTransformer based on allenai/specter2_aug2023refresh_base
282
+ results:
283
+ - task:
284
+ type: triplet
285
+ name: Triplet
286
+ dataset:
287
+ name: 'specter 2 '
288
+ type: specter_2_
289
+ metrics:
290
+ - type: cosine_accuracy
291
+ value: 0.934125
292
+ name: Cosine Accuracy
293
+ - task:
294
+ type: triplet
295
+ name: Triplet
296
+ dataset:
297
+ name: discipline tuned specter 2 010
298
+ type: discipline-tuned_specter_2_010
299
+ metrics:
300
+ - type: cosine_accuracy
301
+ value: 0.93575
302
+ name: Cosine Accuracy
303
+ ---
304
+
305
+ # SentenceTransformer based on allenai/specter2_aug2023refresh_base
306
+
307
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [allenai/specter2_aug2023refresh_base](https://huggingface.co/allenai/specter2_aug2023refresh_base). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
308
+
309
+ ## Model Details
310
+
311
+ ### Model Description
312
+ - **Model Type:** Sentence Transformer
313
+ - **Base model:** [allenai/specter2_aug2023refresh_base](https://huggingface.co/allenai/specter2_aug2023refresh_base) <!-- at revision 084e9624d354a1cbc464ef6cc1e3646d236b95d9 -->
314
+ - **Maximum Sequence Length:** 512 tokens
315
+ - **Output Dimensionality:** 768 dimensions
316
+ - **Similarity Function:** Cosine Similarity
317
+ <!-- - **Training Dataset:** Unknown -->
318
+ <!-- - **Language:** Unknown -->
319
+ <!-- - **License:** Unknown -->
320
+
321
+ ### Model Sources
322
+
323
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
324
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
325
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
326
+
327
+ ### Full Model Architecture
328
+
329
+ ```
330
+ SentenceTransformer(
331
+ (0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel
332
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
333
+ (2): Normalize()
334
+ )
335
+ ```
336
+
337
+ ## Usage
338
+
339
+ ### Direct Usage (Sentence Transformers)
340
+
341
+ First install the Sentence Transformers library:
342
+
343
+ ```bash
344
+ pip install -U sentence-transformers
345
+ ```
346
+
347
+ Then you can load this model and run inference.
348
+ ```python
349
+ from sentence_transformers import SentenceTransformer
350
+
351
+ # Download from the 🤗 Hub
352
+ model = SentenceTransformer("m7n/discipline-tuned_specter_2_010")
353
+ # Run inference
354
+ sentences = [
355
+ 'The aim of the study is to describe our experience with ultrasound guided drainage of tubo-ovarian abscess with concomitant use of antibiotics in a second level center. Seven women diagnosed with a tubo-ovarian abscess and treated with transvaginal ultrasound guided drainage with concomitant use of antibiotics, between January and January , were reviewed. Intravenous antibiotics were administered as soon as the diagnosis was reached and transvaginal ultrasound guided aspiration of the abscess material was performed within hours with no need of anaesthesia. Transvaginal route was used since it provides a better visualization and access to the region of interest than other ultrasound routes. All cases but one ( %) improved clinically within hours of aspiration and only one required surgery due to refilling of a bilateral tubo-ovarian abscess hours after drainage. Mean hospital stay was days (range - ). No procedure related complications were diagnosed. A follow up ultrasound six months after the drainage showed in cases sonographic markers of chronic tubal inflammatory disease but in all cases the patients remained asymptomatic. Transvaginal ultrasound-guided drainage with concomitant antibiotics appears to be a safe, efficacious and well tolerated procedure in the treatment approach of tubo-ovarian abscess as reported in the literature. We consider this approach as a feasible alternative to surgical drainage whenever indicated.',
356
+ 'To compare the usefulness and accuracy of sonographically guided endometrial biopsies. After obtaining informed consents endometrial biopsies were performed using ultrasound guidance in patients followed by operative hysteroscopy. Diagnostic accuracy and treatment efficiency for sono guidance were established. The hysteroscopic procedure was in all cases started by using a fore-oblique mm hysteroscope (Karl Storz®️ Endoscopy-America, Inc. Corporate Pointe, Culver City, CA - ) with a diagnostic sleeve through the cervical os (Karl Storz®️ Endoscopy-America, Inc. Corporate Pointe, Culver City, CA - ), without prior dilatation of the cervix. The catheter used for the polypectomy was an "Intrauterine Access Balloon Catheter" (Cook OB/GYN®️ West Morgan Street, P.O. Box , Spencer, Indiana ). Successful sonographic management of the endometrial pathology had been achieved in patients ( %). Endometrial polyps had been completely removed under sonographic guidance in patients, partially in as confirmed by hysteroscopy. All incompletely removed polyps were of large size (> cm), the remnants were taken out hysteroscopically. Targeted endometrial biopsy was performed under sono guidance in patients. The completion of the procedure was confirmed by hysteroscopy. Targeted endometrial biopsies and polyp removal can be successfully performed under sonographic guidance. Large size endometrial polyps may require hysteroscopy.',
357
+ 'The article is devoted to the peculiarities of the paid domestic labor market in the Russian economy. It is shown that this market is characterized by the following features: weak state regulation; a high proportion of internal and external migrants; a wide spread of the shadow economy and informal labor relations; gender differences; the presence in the market of an "elite" segment of workers providing higher-quality and highly paid services, and a segment of workers performing temporary, episodic work. It is proved on the basis of market analysis that there is a predominant demand for skilled labor, and wages are at or above the national average. It is concluded that further efforts are needed to legalize the work of domestic workers within the framework of the state employment policy.',
358
+ ]
359
+ embeddings = model.encode(sentences)
360
+ print(embeddings.shape)
361
+ # [3, 768]
362
+
363
+ # Get the similarity scores for the embeddings
364
+ similarities = model.similarity(embeddings, embeddings)
365
+ print(similarities.shape)
366
+ # [3, 3]
367
+ ```
368
+
369
+ <!--
370
+ ### Direct Usage (Transformers)
371
+
372
+ <details><summary>Click to see the direct usage in Transformers</summary>
373
+
374
+ </details>
375
+ -->
376
+
377
+ <!--
378
+ ### Downstream Usage (Sentence Transformers)
379
+
380
+ You can finetune this model on your own dataset.
381
+
382
+ <details><summary>Click to expand</summary>
383
+
384
+ </details>
385
+ -->
386
+
387
+ <!--
388
+ ### Out-of-Scope Use
389
+
390
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
391
+ -->
392
+
393
+ ## Evaluation
394
+
395
+ ### Metrics
396
+
397
+ #### Triplet
398
+
399
+ * Datasets: `specter_2_` and `discipline-tuned_specter_2_010`
400
+ * Evaluated with [<code>TripletEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.TripletEvaluator)
401
+
402
+ | Metric | specter_2_ | discipline-tuned_specter_2_010 |
403
+ |:--------------------|:-----------|:-------------------------------|
404
+ | **cosine_accuracy** | **0.9341** | **0.9357** |
405
+
406
+ <!--
407
+ ## Bias, Risks and Limitations
408
+
409
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
410
+ -->
411
+
412
+ <!--
413
+ ### Recommendations
414
+
415
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
416
+ -->
417
+
418
+ ## Training Details
419
+
420
+ ### Training Dataset
421
+
422
+ #### Unnamed Dataset
423
+
424
+
425
+ * Size: 40,000 training samples
426
+ * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
427
+ * Approximate statistics based on the first 1000 samples:
428
+ | | anchor | positive | negative |
429
+ |:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
430
+ | type | string | string | string |
431
+ | details | <ul><li>min: 75 tokens</li><li>mean: 231.88 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 86 tokens</li><li>mean: 228.45 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 83 tokens</li><li>mean: 238.29 tokens</li><li>max: 512 tokens</li></ul> |
432
+ * Samples:
433
+ | anchor | positive | negative |
434
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
435
+ | <code>Self-report checklists are used to assess computer workstation set up, typically by workers not trained in ergonomic assessment or checklist interpretation.Though many checklists exist, few have been evaluated for reliability and validity.This study examined reliability and validity of the Computer Workstation Checklist (CWC) to identify mismatches between workers' self-reported workstation problems.The CWC was completed at baseline and at month to establish reliability. Validity was determined with CWC baseline data compared to an onsite workstation evaluation conducted by an expert in computer workstation assessment.Reliability ranged from fair to near perfect (prevalence-adjusted bias-adjusted kappa, - ); items with the strongest agreement were related to the input device, monitor, computer table, and document holder. The CWC had greater specificity ( of items) than sensitivity ( of items). The positive predictive value was greater than the negative predictive value for all question...</code> | <code>The support of good management is fundamental to the success of any safety and health program. Residential construction is a high-risk industry requiring significant commitment by management to impact day-to-day safety and health challenges. Investigators have evaluated management practices and spending trends in a cohort of residential homebuilders in the Denver metro area of Colorado. Findings suggest that companies significantly increased dollars allocated to support safety and health practices between and . In addition, the HomeSafe Pilot Program has positively impacted financial commitments of partner companies. Resource allocations were significantly greater for specific expense categories when comparing pre to post HomeSafe intervention. This paper presents data on the use of written safety and health programs, safety committees, and workers compensation premium cost containment certification, as well as allocations to safety incentive programs (SIP), personal protective equipme...</code> | <code>Abstract Background Traumatic brain injury (TBI) occurs in as many as million people worldwide each year and often results in one or more post-traumatic syndromes, including depression, cognitive, emotional, and behavioral deficits. TBI can also increase seizure susceptibility, as well as increase the incidence of epilepsy, a phenomenon known as post-traumatic epilepsy (PTE). Injury type and severity appear to partially predict PTE susceptibility. However, a complete mechanistic understanding of risk factors for PTE is incomplete. Main body From the earliest days of modern neuroscience, to the present day, accumulating evidence supports a significant role for neuroinflammation in the post-traumatic epileptogenic progression. Notably, substantial evidence indicates a role for astrocytes, microglia, chemokines, and cytokines in PTE progression. Although each of these mechanistic components is discussed in separate sections, it is highly likely that it is the totality of cellular and neur...</code> |
436
+ | <code>Using a rabbit in vivo joint injury model, the primary objective of the study was to determine if a relationship exists between earlier time to initiation of ketotifen fumarate (KF) treatment and posttraumatic joint contracture (PTJC) reduction. The secondary objective was to determine if a coagulation response could be detected with serial thrombelastography (TEG) analysis following acute trauma in this model.PTJC of the knee were created in skeletally mature, New Zealand White rabbits. Five groups of animals were studied: a control group that received twice daily subcutaneous injections of normal saline and treatment groups that received twice daily subcutaneous injections of KF ( mg/kg) starting immediately, -, -, and -weeks post-injury. After weeks of immobilization, flexion contractures were measured biomechanically. Serial TEG analysis was performed on the control group animals pre-injury and weekly post-injury.The average joint contracture in the Control Group ( ) was higher tha...</code> | <code>To compare inpatient compliance with venous thromboembolism prophylaxis regimens.A secondary analysis of patients enrolled in the ADAPT (A Different Approach to Preventing Thrombosis) randomized controlled trial.Level I trauma center.Patients with operative extremity or any pelvic or acetabular fracture requiring venous thromboembolism prophylaxis.We compared patients randomized to receive either low molecular weight heparin (LMWH) mg or aspirin mg BID during their inpatient admission.The primary outcome measure was the number of doses missed compared with prescribed number of doses.A total of patients were randomized to receive either LMWH mg BID ( patients) or aspirin mg BID ( patients). No differences observed in percentage of patients who missed a dose (aspirin: % vs LMWH: %, P = ) or mean number of missed doses ( vs doses, P = ). The majority of patients ( %, n = ) did not miss any doses. Missed doses were often associated with an operation.These data should reassure clinicians th...</code> | <code>In treatment of dementia, further to the use of medicine, methodological approaches have shown positive results as to the improvement of the people's condition, by employing cognitive, relational, behavioral stimulation techniques, or intervention on the surroundings. The aim of this research file is to verify the efficacy of BAPNE method as a cognitive and relational stimulation tool, on elderly patients diagnosed with Alzheimer's disease or with other kind of mild to moderate dementia. Scientific research has already given evidence of positive results of the BAPNE method on people with mild impairment, in particular concerning the executive functions. In this experiment, a sample group of elderly patients will undergo a cycle of sessions; the estimation of the quantitative results will be determined by comparing the data of the experimental sample group ( elderly patients), with those of the control group ( elderly patients). The cognitive functions and the executive functions will b...</code> |
437
+ | <code>Objective To examine the validity and usefulness of pandemic simulations aimed at informing practical decision-making in public health.Methods We recruited a multidisciplinary group of nine experts to assess a case-study simulation of influenza transmission in a Swedish county.We used a non-statistical nominal group technique to generate evaluations of the plausibility, formal validity (verification) and predictive validity of the simulation.A health-effect assessment structure was used as a framework for data collection.Findings The unpredictability of social order during disasters was not adequately addressed by simulation methods; even minor disruptions of the social order may invalidate key infrastructural assumptions underpinning current pandemic simulation models.Further, a direct relationship between model flexibility and computation time was noted.Consequently, simulation methods cannot, in practice, support integrated modifications of microbiological, epidemiological and spati...</code> | <code>With the onset of the coronavirus disease (COVID- ) pandemic, public health measures such as physical distancing were recommended to reduce transmission of the virus causing the disease. However, the same approach in all areas, regardless of context, may lead to measures being of limited effectiveness and having unforeseen negative consequences, such as loss of livelihoods and food insecurity. A prerequisite to planning and implementing effective, context-appropriate measures to slow community transmission is an understanding of any constraints, such as the locations where physical distancing would not be possible. Focusing on sub-Saharan Africa, we outline and discuss challenges that are faced by residents of urban informal settlements in the ongoing COVID- pandemic. We describe how new geospatial data sets can be integrated to provide more detailed information about local constraints on physical distancing and can inform planning of alternative ways to reduce transmission of COVID- b...</code> | <code>Since , the Australian Aboriginal and Torres Strait Islander Health Performance Framework (HPF) reports have provided information about Indigenous Australians' health outcomes. The HPF was designed, in consultation with Indigenous stakeholder groups, to promote accountability and inform policy and research. This paper explores bridging the HPF as a theoretical construct and the publicly available data provided against its measures. A whole-of-framework, whole-of-system monitoring perspective was taken to summarise eligible indicators at the state/territory level, organised by the HPF's tier and group hierarchy. Data accompanying the and reports were used to compute improvement over time. Unit change and confidence indicators were developed to create an abstract but interpretable improvement score suitable for aggregation and visualisation at scale. The result is an exploratory methodology that summarises changes over time. An example dashboard visualisation is presented. The use of sec...</code> |
438
+ * Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
439
+ ```json
440
+ {
441
+ "distance_metric": "TripletDistanceMetric.COSINE",
442
+ "triplet_margin": 0.3
443
+ }
444
+ ```
445
+
446
+ ### Evaluation Dataset
447
+
448
+ #### Unnamed Dataset
449
+
450
+
451
+ * Size: 2,000 evaluation samples
452
+ * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
453
+ * Approximate statistics based on the first 1000 samples:
454
+ | | anchor | positive | negative |
455
+ |:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
456
+ | type | string | string | string |
457
+ | details | <ul><li>min: 80 tokens</li><li>mean: 231.73 tokens</li><li>max: 509 tokens</li></ul> | <ul><li>min: 84 tokens</li><li>mean: 236.04 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 86 tokens</li><li>mean: 233.46 tokens</li><li>max: 512 tokens</li></ul> |
458
+ * Samples:
459
+ | anchor | positive | negative |
460
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
461
+ | <code>Abstract Objective This prospective 0year longitudinal study examined the use of coping styles of fathers and mothers of pediatric cancer patients over time and the prospective effects of coping on distress. Methods Psychological distress (General Health Questionnaire) and the use of seven coping styles (Utrecht Coping List: active problem focussing, palliative and passive reaction patterns, avoidance, social support seeking, expression of emotions, and comforting cognition) were assessed in parents shortly after diagnosis, and months, and years later. Results At diagnosis, parents' use of coping styles did not differ from the norm population except more frequent use of support seeking. No significant change over time was found in a palliative reaction pattern. Support seeking declined and emotional expression increased linearly, whereas use of the remaining coping styles decreased, followed by an increase. At years, parents' use differed from the norm population only in less use of ex...</code> | <code>Abstract Objective Event centrality, the degree to which a traumatic event is perceived as central to one's identity, has been associated with posttraumatic stress (PTS) symptoms and posttraumatic growth (PTG) outcomes in various trauma samples. Trauma frameworks are widely used to understand the psychological impact of pediatric cancer; however, event centrality has not been studied in this population. We investigated event centrality in pediatric cancer survivors and healthy comparisons, and its relation with PTS and PTG outcomes. Method Cancer survivors, age ( N = ) and healthy comparisons ( N = ) completed the Centrality of Events Scale and PTS and PTG measures in reference to their most traumatic life event. Cancer survivors who first identified a noncancerrelated event repeated all measures in reference to cancer. Results Centrality scores were significantly higher when referencing cancer compared to noncancer events, even in survivors for whom cancer was not rated as most stress...</code> | <code>Abstract Introduction To assess the reliability of short versions of the Australian National University Alzheimer's Disease Risk Index (ANUADRI). Methods A short form of the ANUADRI (ANUADRISF) was developed by assessing risk and protective factors with single questions where possible and with short forms of subquestionnaires where available. The tick box form of the ANUADRI (ANUADRITB) was developed with unique questions for each risk and protective factor for Alzheimer's disease. The short versions were evaluated in an independent community sample of participants with a mean age of (SD = , range = ). Results The short versions demonstrated high reliabilities when compared with the ANUADRI. However, the proportion of misclassification was high for some risk factors and particularly for the ANUADRITB. Discussion The ANUADRISF may be considered if less reliable questions from the ANUADRISF can be replaced with more reliable questions from the ANUADRI for risk/protective factors with hig...</code> |
462
+ | <code>The effects of glucocorticoids on estrogen-induced changes in LH secretion in the ovariectomized rat and on the estrous cycle and gonadotropin levels in the intact female rat were studied. Preliminary experiments showed that multiple injections of dexamethasone or triamcinolone acetonide (TA) inhibited the estradiol benzoate (EB)-induced elevation of LH in the ovariectomized rat. In subsequent experiments, a single injection of TA was found to inhibit the EB-induced elevation in LH in a dose-dependent manner (minimal effective dose, g) when given h after EB but not at times before EB. Single injections of dexamethasone, cortisol, or progesterone given at this time did not alter LH release. TA given h after EB also blocked the estrogen-dependent increase in pituitary responsiveness to LHRH and the priming effect of multiple injections of LHRH. The pituitary response in oil controls given TA was not altered. Cortisol implants which maintained continuously elevated levels of plasma cortis...</code> | <code>Abstract Hindbrain adrenergic/noradrenergic nuclei facilitate endocrine and autonomic responses to physical and psychological challenges. Neurons that synthesize adrenaline and noradrenaline target hypothalamic structures to modulate endocrine responses while descending spinal projections regulate sympathetic function. Furthermore, these neurons respond to diverse stress-related metabolic, autonomic, and psychosocial challenges. Accordingly, adrenergic and noradrenergic nuclei are integrative hubs that promote physiological adaptation to maintain homeostasis. However, the precise mechanisms through which adrenaline- and noradrenaline-synthesizing neurons sense interoceptive and exteroceptive cues to coordinate physiological responses have yet to be fully elucidated. Additionally, the regulatory role of these cells in the context of chronic stress has received limited attention. This mini-review consolidates reports from preclinical rodent studies on the organization and function of bra...</code> | <code>Abstract This paper will describe the scope of the Drilling, Completion, and Subsea construction activities and the approach taken by the BP Atlantis Wells Delivery Team in planning and execution. The BP Atlantis Wells Delivery Team recognized early that in order to efficiently execute all of the drilling, completion, subsea construction, and tie back operations to the producing facility, a very disciplined Project Planning and Scheduling approach would be required. A group of dedicated, competent scheduling professionals were assigned to the Drilling and Completion (D&C) Team and proved instrumental to the successful outcome. The D&C scheduling professionals complemented the other professional schedulers strategically selected for each of the project's necessary functional teams and key construction sites. The D&C Team started gaining competency in true project management through development and recruitment as early as three years ( ) prior to the start of development operations. Atla...</code> |
463
+ | <code>A discharging ear is the most common presenting symptom for ENT conditions. However, some degree of hearing loss is always present. In order to compare the degree of hearing impairment with the size and location of the perforation, we made an effort to conduct this study. The purpose of the study is to ascertain whether, and if so, what, a relationship exists between the location and extent of the tympanic membrane perforation and the severity of hearing loss. In a systematic scoping review of randomized controlled trials, each database was subjected to a unique systematic search approach. Utilizing the methodological approaches specified in the Cochrane Handbook for Systematic Reviewers, a systematic scoping review is conducted after selection criteria, with results reported in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA). Tympanic membrane anomalies are the root cause of various degrees of conducive deafness. The size of the perforat...</code> | <code>Most head and neck cancers are derived from the mucosal epithelium in the oral cavity, pharynx andlarynx and are known collectively as head and neck squamous cell carcinoma (HNSCC). Oral cavity cancers are generally associated with tobacco consumption, alcohol abuse,exposure to environmental pollutants and infection with viral agents, namely HPV and EBV or both, whereaspharynx cancers are increasingly attributed to infection with humanpapillomavirus (HPV), primarilyHPV- . Despiteevidence of histological progression from cellular atypia through various degrees of dysplasia,ultimately leading to invasive HNSCC, most patients are diagnosed with late-stage HNSCC without a clinically evident pre malignant lesion.</code> | <code>This article reflects on the capacity of Dante's Comedy, through its words and images, to permeate cultures of different eras. It may be viewed as more than a central element of culture, and as an open work characterised by fluidity and change. This essay, after examining cinematographic and literature examples, attempts to show the Comedy as an important piece of evolving semantic structure, able to resettle in many generations' imagery, perhaps even to mark the genealogy of western representation. If Dante can be understood as a classic suitable to be examined in several worlds and times, his Purgatory may be viewed as a cantica that gives voice and body to typical features of modernity in its current phase. Keywords: Sociologia della letteratura, comunicazione, Purgatorio, modernita, industria culturale</code> |
464
+ * Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
465
+ ```json
466
+ {
467
+ "distance_metric": "TripletDistanceMetric.COSINE",
468
+ "triplet_margin": 0.3
469
+ }
470
+ ```
471
+
472
+ ### Training Hyperparameters
473
+ #### Non-Default Hyperparameters
474
+
475
+ - `eval_strategy`: steps
476
+ - `learning_rate`: 1e-05
477
+ - `weight_decay`: 0.01
478
+ - `num_train_epochs`: 1
479
+ - `warmup_ratio`: 0.1
480
+ - `batch_sampler`: no_duplicates
481
+
482
+ #### All Hyperparameters
483
+ <details><summary>Click to expand</summary>
484
+
485
+ - `overwrite_output_dir`: False
486
+ - `do_predict`: False
487
+ - `eval_strategy`: steps
488
+ - `prediction_loss_only`: True
489
+ - `per_device_train_batch_size`: 8
490
+ - `per_device_eval_batch_size`: 8
491
+ - `per_gpu_train_batch_size`: None
492
+ - `per_gpu_eval_batch_size`: None
493
+ - `gradient_accumulation_steps`: 1
494
+ - `eval_accumulation_steps`: None
495
+ - `torch_empty_cache_steps`: None
496
+ - `learning_rate`: 1e-05
497
+ - `weight_decay`: 0.01
498
+ - `adam_beta1`: 0.9
499
+ - `adam_beta2`: 0.999
500
+ - `adam_epsilon`: 1e-08
501
+ - `max_grad_norm`: 1.0
502
+ - `num_train_epochs`: 1
503
+ - `max_steps`: -1
504
+ - `lr_scheduler_type`: linear
505
+ - `lr_scheduler_kwargs`: {}
506
+ - `warmup_ratio`: 0.1
507
+ - `warmup_steps`: 0
508
+ - `log_level`: passive
509
+ - `log_level_replica`: warning
510
+ - `log_on_each_node`: True
511
+ - `logging_nan_inf_filter`: True
512
+ - `save_safetensors`: True
513
+ - `save_on_each_node`: False
514
+ - `save_only_model`: False
515
+ - `restore_callback_states_from_checkpoint`: False
516
+ - `no_cuda`: False
517
+ - `use_cpu`: False
518
+ - `use_mps_device`: False
519
+ - `seed`: 42
520
+ - `data_seed`: None
521
+ - `jit_mode_eval`: False
522
+ - `use_ipex`: False
523
+ - `bf16`: False
524
+ - `fp16`: False
525
+ - `fp16_opt_level`: O1
526
+ - `half_precision_backend`: auto
527
+ - `bf16_full_eval`: False
528
+ - `fp16_full_eval`: False
529
+ - `tf32`: None
530
+ - `local_rank`: 0
531
+ - `ddp_backend`: None
532
+ - `tpu_num_cores`: None
533
+ - `tpu_metrics_debug`: False
534
+ - `debug`: []
535
+ - `dataloader_drop_last`: False
536
+ - `dataloader_num_workers`: 0
537
+ - `dataloader_prefetch_factor`: None
538
+ - `past_index`: -1
539
+ - `disable_tqdm`: False
540
+ - `remove_unused_columns`: True
541
+ - `label_names`: None
542
+ - `load_best_model_at_end`: False
543
+ - `ignore_data_skip`: False
544
+ - `fsdp`: []
545
+ - `fsdp_min_num_params`: 0
546
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
547
+ - `fsdp_transformer_layer_cls_to_wrap`: None
548
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
549
+ - `deepspeed`: None
550
+ - `label_smoothing_factor`: 0.0
551
+ - `optim`: adamw_torch
552
+ - `optim_args`: None
553
+ - `adafactor`: False
554
+ - `group_by_length`: False
555
+ - `length_column_name`: length
556
+ - `ddp_find_unused_parameters`: None
557
+ - `ddp_bucket_cap_mb`: None
558
+ - `ddp_broadcast_buffers`: False
559
+ - `dataloader_pin_memory`: True
560
+ - `dataloader_persistent_workers`: False
561
+ - `skip_memory_metrics`: True
562
+ - `use_legacy_prediction_loop`: False
563
+ - `push_to_hub`: False
564
+ - `resume_from_checkpoint`: None
565
+ - `hub_model_id`: None
566
+ - `hub_strategy`: every_save
567
+ - `hub_private_repo`: None
568
+ - `hub_always_push`: False
569
+ - `gradient_checkpointing`: False
570
+ - `gradient_checkpointing_kwargs`: None
571
+ - `include_inputs_for_metrics`: False
572
+ - `include_for_metrics`: []
573
+ - `eval_do_concat_batches`: True
574
+ - `fp16_backend`: auto
575
+ - `push_to_hub_model_id`: None
576
+ - `push_to_hub_organization`: None
577
+ - `mp_parameters`:
578
+ - `auto_find_batch_size`: False
579
+ - `full_determinism`: False
580
+ - `torchdynamo`: None
581
+ - `ray_scope`: last
582
+ - `ddp_timeout`: 1800
583
+ - `torch_compile`: False
584
+ - `torch_compile_backend`: None
585
+ - `torch_compile_mode`: None
586
+ - `dispatch_batches`: None
587
+ - `split_batches`: None
588
+ - `include_tokens_per_second`: False
589
+ - `include_num_input_tokens_seen`: False
590
+ - `neftune_noise_alpha`: None
591
+ - `optim_target_modules`: None
592
+ - `batch_eval_metrics`: False
593
+ - `eval_on_start`: False
594
+ - `use_liger_kernel`: False
595
+ - `eval_use_gather_object`: False
596
+ - `average_tokens_across_devices`: False
597
+ - `prompts`: None
598
+ - `batch_sampler`: no_duplicates
599
+ - `multi_dataset_batch_sampler`: proportional
600
+
601
+ </details>
602
+
603
+ ### Training Logs
604
+ | Epoch | Step | Training Loss | Validation Loss | specter_2__cosine_accuracy | discipline-tuned_specter_2_010_cosine_accuracy |
605
+ |:-----:|:----:|:-------------:|:---------------:|:--------------------------:|:----------------------------------------------:|
606
+ | 0 | 0 | - | - | 0.8939 | - |
607
+ | 0.02 | 100 | 0.1822 | 0.1227 | 0.9083 | - |
608
+ | 0.04 | 200 | 0.0858 | 0.0739 | 0.9191 | - |
609
+ | 0.06 | 300 | 0.0697 | 0.0634 | 0.9251 | - |
610
+ | 0.08 | 400 | 0.0553 | 0.0584 | 0.9284 | - |
611
+ | 0.1 | 500 | 0.0539 | 0.0552 | 0.9316 | - |
612
+ | 0.12 | 600 | 0.0599 | 0.0542 | 0.9329 | - |
613
+ | 0.14 | 700 | 0.0492 | 0.0494 | 0.934 | - |
614
+ | 0.16 | 800 | 0.0552 | 0.0495 | 0.9341 | - |
615
+ | 0.18 | 900 | 0.051 | - | - | 0.9357 |
616
+
617
+
618
+ ### Framework Versions
619
+ - Python: 3.10.12
620
+ - Sentence Transformers: 3.3.1
621
+ - Transformers: 4.49.0.dev0
622
+ - PyTorch: 2.5.1+cu121
623
+ - Accelerate: 1.2.1
624
+ - Datasets: 3.2.0
625
+ - Tokenizers: 0.21.0
626
+
627
+ ## Citation
628
+
629
+ ### BibTeX
630
+
631
+ #### Sentence Transformers
632
+ ```bibtex
633
+ @inproceedings{reimers-2019-sentence-bert,
634
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
635
+ author = "Reimers, Nils and Gurevych, Iryna",
636
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
637
+ month = "11",
638
+ year = "2019",
639
+ publisher = "Association for Computational Linguistics",
640
+ url = "https://arxiv.org/abs/1908.10084",
641
+ }
642
+ ```
643
+
644
+ #### TripletLoss
645
+ ```bibtex
646
+ @misc{hermans2017defense,
647
+ title={In Defense of the Triplet Loss for Person Re-Identification},
648
+ author={Alexander Hermans and Lucas Beyer and Bastian Leibe},
649
+ year={2017},
650
+ eprint={1703.07737},
651
+ archivePrefix={arXiv},
652
+ primaryClass={cs.CV}
653
+ }
654
+ ```
655
+
656
+ <!--
657
+ ## Glossary
658
+
659
+ *Clearly define terms in order to be accessible across audiences.*
660
+ -->
661
+
662
+ <!--
663
+ ## Model Card Authors
664
+
665
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
666
+ -->
667
+
668
+ <!--
669
+ ## Model Card Contact
670
+
671
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
672
+ -->
config.json ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "allenai/specter2_aug2023refresh_base",
3
+ "adapters": {
4
+ "adapters": {},
5
+ "config_map": {},
6
+ "fusion_config_map": {},
7
+ "fusions": {}
8
+ },
9
+ "architectures": [
10
+ "BertModel"
11
+ ],
12
+ "attention_probs_dropout_prob": 0.1,
13
+ "classifier_dropout": null,
14
+ "hidden_act": "gelu",
15
+ "hidden_dropout_prob": 0.1,
16
+ "hidden_size": 768,
17
+ "initializer_range": 0.02,
18
+ "intermediate_size": 3072,
19
+ "layer_norm_eps": 1e-12,
20
+ "max_position_embeddings": 512,
21
+ "model_type": "bert",
22
+ "num_attention_heads": 12,
23
+ "num_hidden_layers": 12,
24
+ "pad_token_id": 0,
25
+ "position_embedding_type": "absolute",
26
+ "torch_dtype": "float32",
27
+ "transformers_version": "4.49.0.dev0",
28
+ "type_vocab_size": 2,
29
+ "use_cache": true,
30
+ "vocab_size": 31090
31
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "3.3.1",
4
+ "transformers": "4.49.0.dev0",
5
+ "pytorch": "2.5.1+cu121"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null,
9
+ "similarity_fn_name": "cosine"
10
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:87c3ba46b2f5e2735f4f4fe344df88cbea4b3b45967de00dea95b3a158891308
3
+ size 439696224
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 512,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,58 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "101": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "102": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "103": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "104": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "clean_up_tokenization_spaces": true,
45
+ "cls_token": "[CLS]",
46
+ "do_basic_tokenize": true,
47
+ "do_lower_case": true,
48
+ "extra_special_tokens": {},
49
+ "mask_token": "[MASK]",
50
+ "model_max_length": 512,
51
+ "never_split": null,
52
+ "pad_token": "[PAD]",
53
+ "sep_token": "[SEP]",
54
+ "strip_accents": null,
55
+ "tokenize_chinese_chars": true,
56
+ "tokenizer_class": "BertTokenizer",
57
+ "unk_token": "[UNK]"
58
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff