gte-small-finetuned / README.md
deepapaikar's picture
Upload folder using huggingface_hub
74e0d23 verified
---
base_model: thenlper/gte-small
library_name: sentence-transformers
pipeline_tag: sentence-similarity
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:4319
- loss:MultipleNegativesRankingLoss
widget:
- source_sentence: whine stare jelli comfort fairmount former poni guttur innoc latitud
ceas firm spoil impress base sentiment aeroplan globe usurp monogram keen frau
opposit reimburs express ever craze oil bade directric save notic helmet proper
schmierkäs pale engrav chateau pair sleep cautious audibl squar disinclin keeper
gosh ravag brigandish stammer death colleg suit treason year storekeep rib wake
gaunt appear perforc afterward intim noisi goodi fear illumin marvel mantel volum
health belief soften conclus rode social sip tip mina wing suppli tommi slight
pomatum bombard simpli foreign mustard leg semi twist bend helen flush rough wound
rifl unqualifi shill soon enorm gloomi depress scratch soil car broken slam count
grace millineri pour swirl depriv smell strode direct wholli method modest shift
immigr finger leon differ worker fetch advis hord affair ardmor rivet white four
occup burst ridg exclam knuckl interest dead dentist wreckag drizzl kraus famous
chariti vite permiss jabez written twice pile loyalti frenchman shine wave meat
rang civilian deserv forev slip halt see charact feed foremost uniform chime within
suspect mount bash often lath outlin aim straw distract past erupt seren whoever
whenev fall doughboy remain scoot lawn task spectat altar bavarian spirit either
sold individu hillsid easili wick dream brain guess floor start decor incid dramat
condit guest accus monteith rumbl camp wherev truer blanket brown puzzl distanc
sight addit head withdrew everybodi ahead morn sought lost besid fli snatch local
mutil contribut pat cover behalf contest whistl futur hope patriot weari law area
brancardi block nose push girl conceiv offer fisherman stroll appeal bail stone
suppos caught amaz subsid bundl tack poorest roadsid princ miss clamor held retir
felt signific annex 1918 wild come whether demur mutter hesit panopli hetti soundless
felin patri bombast dyke concoct busili result impati covert stall mademoisell
danger moon grave abrupt dim goggl light overturn brought imaginari statement
commiss unbroken found
sentences:
- Is the content related to fiction genere
- Is the content related to non-fiction genere
- Is the content related to fiction genere
- source_sentence: condemn elaps reunion sword swept brow file kept hors high love
laughter phase heavier roof screen process fire skill deck moan remov moqui turn
welfar suspici buri countersign accept shrewd life cloud enjoy stiff rave ripe
insist shut pressur rule barb contact horizont handcart nutshel circumst oblig
attract summer brew sens gas raptur lest glimps depart ought rattler scene boat
tone price good famili valu wooden machin wheelwright dismiss instrument soul
self cannon scotch reconsid cling pall unfold temporarili influenc esteem astir
first tide stock lamp decid hush sign none choic note particl fizzl call entrench
steel retail horribl retain throw els temptat follow terribl labor kiss vers prouder
dawn cane anthoni wear quarter ransom flea unkind charg stream namur anoth skillet
glad bustl woebegon greater emin cord batch dame badg briefli shini answer lodg
bolt east suspicion milk lookout pronounc detect villa conclud hurt heard sprinter
charl neither wrinkl look associ reach pilot spite furi messag copybook dispos
rush plummet commenc adventur eventu left emperor strongest thank popul truth
chooser feroci gruel smash without halfbre bled possess engin rub athlet sympathet
riski bold thirti undertak surmount astronomi told citizen furnitur tenac do laddi
albert netherland princip assail brief poss hero drew swift rake philip victim
broil rate object thereof higher press discov conjectur cement clumsi tribut whirl
unaid great assur burnt alter bridg invad sprawl succeed think valuabl conquer
billow molest shaki motiv develop ruefulli bullet pretend dread special unruli
insinu confin vein reckon defi supplement sale popular ghost unpleas opportun
zeppelin heap pigskin readi fame sore forlorn seventh luncheon difficulti oven
sledg meredith interrupt linen sank live mistak hast cherish ambuscad mistaken
egg bridl whole neck snake pulp even cours gallant vocabulari protest repent tubbi
anchorag stay shuttl import allay plenti convict blindfold thousand timber crown
owner boundari echo suffic poke nearer
sentences:
- Is the content related to romance genere
- Is the content related to romance genere
- Is the content related to non-fiction genere
- source_sentence: hugo fume immortell memori shrine salut end withdraw potenti famin
stain scrawl gross avranch accomplish forgo queen tardet torn spoke rhone freedom
anglo priest boar bohemia rubicon discern collar myth lie captur cite uninstruct
waterwork child exist recoup eav commonwealth algebra hundr uphil whimsic wit
dignifi agreement trap draft ripost strait excit asylum conceal alfr nervii william
ask footnot inmost astonish hypothesi crisi spur deepest coalesc bottl fabric
pillar domqueur omnibus provoc cliff stuf spectacl rare mosqu hunger upkeep visit
magic scot melbourn frozen bind visigoth band portus complain smallest glasgow
irrig obtain starv hoof mould passion trial russia lad signal acquiesc john keat
sheet huntsman acceler immun easier trouser papaci mental expans ear roncesvall
topograph napl increas stupend bayeux fulli household tomfooleri briberi argonn
iliad contour elsewher inevit curios clean exterior snow generous normal leather
1030 mankind agincourt product apologia laughabl outer northeast stead appal kick
indic limeston algier river stage bout indomit residenti measur paradis vengeanc
statist observ ball aumal scientif guardian impecc rhetor crimin majest visibl
doorstep dauphin blown mantelshelf bethlehem earlier peculiar compli snort rocquefort
snowstorm tore llygnant expound unroof thenc misfortun tenpenc bookstal crosier
bowl reform expect fantast £100 barg prima notabl rock malign croker southward
scoundrel margin militari capstan injur decay prime amount sprang decis attitud
incapac brood monday econom valparaiso evid dimmer drought receptacl aesthet walk
fraud truli 1905 volley allen leader rumin troubl activ irredeem harmoni headlong
slant prig guadarrama peasantri began coupl climax british profound ayrshir tenth
convey platform will drunk proport cinematograph imago mother prolong equabl brick
approxim search foothil cun truest crusad report woman step tenur might anim frame
gaze threaten bosh renom explan whose crier parasit compel quest surviv lisieux
monograph hunt real
sentences:
- Is the content related to non-fiction genere
- Is the content related to romance genere
- Is the content related to romance genere
- source_sentence: argu hubertmil grace copious alphabet plombier beaumont fete fontanel
profus treasuri boyer fleurieu unworthili subprefect stuttgart majesti disinterest
hilar perish finger valencienn hord quell occup varieti madman diabl preoccupi
interest gras ordeal legion soap nail tract ostrich infanta imbu swaddl picturesqu
cent canon robust blasphemi titl pfister slay provenc outlin award indol past
pillag erupt sweat lawn remain fortif spectat pallid decreas resum gros heartrend
dizzi costaz hillsid guess sourc umbrella talleyrand kremlin overshadow dramat
condit blotch purport drank flore wherev circus chessman addit withdrew vien regim
benign 1792 brenta nazzolini cover snatch local contest contribut sabin victoir
walsh joiner necessarili wealth law talisman payment ivri vent raider push pichegru
beri conceiv pleiss haversack epidem steepl allot subsid embroid religi volney
broadest clamor summari wild inexhaust serent come transmiss vase midway danger
lill warrior guastalla gravest canouvill stormi partridg solemn drawer bray benefact
hurri ladl uphold chicken bereft ghiesubel let cower pacifi dragoon element smother
inexcus humili yield engag intermingl camel pillow must adversari modern heavili
writer porcelain harvill thing leav horizon baudemont freder carnot relat phrase
mistress adolph 1814 postur sordi moskwa quadrill alp arrang buit devast renown
deepli wismar three belliger domin ness disappoint cordial mahogani sampl 1788
poultri orosman surrend succumb commerc late promulg reput bouill legislatur compar
finess fair carbonari round danc crush particip move mold huddl benumb overlook
cleopatra varengo erad delic rais trumpet emili xviii calcul specimen suburb geograph
ukrain surveil disparag sceptr hinguerlot enlarg discus steer aright assum maul
loud oper cher immedi exploit fesch wealthi filial lieg imperator sleepi marriag
nich rise pend baudin revolv stupor fool voic manner malet thereupon cargo orfevr
mountain general chariot moldrecht immens amend sulmett allevi flew intox poet
laid indemn ugli
sentences:
- Is the content related to romance genere
- Is the content related to non-fiction genere
- Is the content related to romance genere
- source_sentence: celebr hitt correspond windmil doivent take june hove sequel petition
hamlet crash mond knotti grudg sportsman prowl morrow semblanc jargon reap full
ancestress cheruel manabozho merit buoy governor dine plain misstat grand dwelt
fir kind joint around hound san moranget cricket confirm frosti balk straggl regret
tenant invoc crop fervent tie uncharit savag omaha chassagoac conqueror infer
repast crack répondu mèmoir splendor anywher match sept divan prey caus pratiqu
theft dot disguis crime chaff incubus ouabouskiaou strike regardless disk croyant
auec top droitur brulé 1701 much infuri morass misconceiv back rigg midnight atroci
femm audess disput avail reluct tree shield andast peac solac utica set déchargent
ouasi resté lock nativ kaskaskia negoti renounc confeder crude luth part horseback
treacher orang réserv sit speedili mohegan enmiti pretens motionless giraff platt
estr clap accliv proceed pervers access fish probabl ambassador faillon visag
extend bow ottawa islinoi vexilla diver foment accuraci canton loutr bark level
spring asthmat carolina term assent antonio considér jesuit bishop disprov daumont
aver tangibao seneca amiti defect letter confluenc french dabbl threshold tomb
inquiri travel proprieti bush espèc idl dreami document descend courag foray downward
fring sandston incorrect parrot menez expressli displeasur eagl sépultur indec
escarpé dens strip quiet mush eastern evinc natur pick honnêt coureur 83me eighti
lichen toriman bell cachent confer stealthili spear waist catharin transfer merg
ferland gratitud blue friabl paw forget prochain risk caution still generos awar
burlesqu concentr mingl cinquièm pourtant altern us somebodi suppress unscrupul
discord coat dog pierron loup campaign mangèrent cloth theme rope unnatur discipl
haw battl superfici spendthrift empti tavern threat épuisé deliv deceas vicious
employ trunk endow notwithstand jansenist baptism offend sustain complic almost
larger commit villag invect green careen ownership request lightn braveri sunday
remedi current
sentences:
- Is the content related to romance genere
- Is the content related to non-fiction genere
- Is the content related to romance genere
---
# SentenceTransformer based on thenlper/gte-small
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [thenlper/gte-small](https://huggingface.co/thenlper/gte-small). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
## Model Details
### Model Description
- **Model Type:** Sentence Transformer
- **Base model:** [thenlper/gte-small](https://huggingface.co/thenlper/gte-small) <!-- at revision 50c7dd33df1027ef560fd504d95e277948c3c886 -->
- **Maximum Sequence Length:** 512 tokens
- **Output Dimensionality:** 384 tokens
- **Similarity Function:** Cosine Similarity
<!-- - **Training Dataset:** Unknown -->
<!-- - **Language:** Unknown -->
<!-- - **License:** Unknown -->
### Model Sources
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
### Full Model Architecture
```
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel
(1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
```
## Usage
### Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
```bash
pip install -U sentence-transformers
```
Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("sentence_transformers_model_id")
# Run inference
sentences = [
'celebr hitt correspond windmil doivent take june hove sequel petition hamlet crash mond knotti grudg sportsman prowl morrow semblanc jargon reap full ancestress cheruel manabozho merit buoy governor dine plain misstat grand dwelt fir kind joint around hound san moranget cricket confirm frosti balk straggl regret tenant invoc crop fervent tie uncharit savag omaha chassagoac conqueror infer repast crack répondu mèmoir splendor anywher match sept divan prey caus pratiqu theft dot disguis crime chaff incubus ouabouskiaou strike regardless disk croyant auec top droitur brulé 1701 much infuri morass misconceiv back rigg midnight atroci femm audess disput avail reluct tree shield andast peac solac utica set déchargent ouasi resté lock nativ kaskaskia negoti renounc confeder crude luth part horseback treacher orang réserv sit speedili mohegan enmiti pretens motionless giraff platt estr clap accliv proceed pervers access fish probabl ambassador faillon visag extend bow ottawa islinoi vexilla diver foment accuraci canton loutr bark level spring asthmat carolina term assent antonio considér jesuit bishop disprov daumont aver tangibao seneca amiti defect letter confluenc french dabbl threshold tomb inquiri travel proprieti bush espèc idl dreami document descend courag foray downward fring sandston incorrect parrot menez expressli displeasur eagl sépultur indec escarpé dens strip quiet mush eastern evinc natur pick honnêt coureur 83me eighti lichen toriman bell cachent confer stealthili spear waist catharin transfer merg ferland gratitud blue friabl paw forget prochain risk caution still generos awar burlesqu concentr mingl cinquièm pourtant altern us somebodi suppress unscrupul discord coat dog pierron loup campaign mangèrent cloth theme rope unnatur discipl haw battl superfici spendthrift empti tavern threat épuisé deliv deceas vicious employ trunk endow notwithstand jansenist baptism offend sustain complic almost larger commit villag invect green careen ownership request lightn braveri sunday remedi current',
'Is the content related to romance genere',
'Is the content related to romance genere',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 384]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
```
<!--
### Direct Usage (Transformers)
<details><summary>Click to see the direct usage in Transformers</summary>
</details>
-->
<!--
### Downstream Usage (Sentence Transformers)
You can finetune this model on your own dataset.
<details><summary>Click to expand</summary>
</details>
-->
<!--
### Out-of-Scope Use
*List how the model may foreseeably be misused and address what users ought not to do with the model.*
-->
<!--
## Bias, Risks and Limitations
*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
-->
<!--
### Recommendations
*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
-->
## Training Details
### Training Dataset
#### Unnamed Dataset
* Size: 4,319 training samples
* Columns: <code>anchor</code> and <code>positive</code>
* Approximate statistics based on the first 1000 samples:
| | anchor | positive |
|:--------|:--------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
| type | string | string |
| details | <ul><li>min: 449 tokens</li><li>mean: 506.92 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 10 tokens</li><li>mean: 10.73 tokens</li><li>max: 12 tokens</li></ul> |
* Samples:
| anchor | positive |
|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------|
| <code>assum discredit loud immedi incumb wealthi speck flare sleepi marriag intang rise revolv stupor fool voic manner thereupon abhorr mountain general amend flew posi intox poet laid tel ugli issu insult armament assert croak illus deign discourag trust fund pray irregular aristocraci shoulder overcom dumb devil pas grass unnecessari heat event factotum shot stabl innumer fleshi later struggl vike arrog orchardward tune dissatisfact presum reclus seven behavior fine hebe hind ripen irrate brother annoy whitewash sunris curtain indulg delirium youth labori would unlucki unwrinkl initi hark bliss occas everyth folli subordin stamp glossi finish consist hall cave insight forg matter forward familiar hidden sandi noblest undevelop acr masonri wand took endeavor joke standpoint loveli picket caress nicknam coil temper unknown pledg sunk looker abil subterranean wari effemin go spit denounc recoveri violenc moorish gloomili wind stove religion senior stiffli shudder lean encount luckili pull weld approach liveli glyphi plagu funnel soulless inquir pearl tenabl unsaf justifi unhero curious subject laboratori societi afford dose hundredth thief tremor grizzl villan tumult knocker rainbow boy drama pitiless cynosur demeanor communic ironi lurk loftiest freshen offenc environ mixtur habitu blunt shirt straightway lieuten sofa lineament poison hypothet nonsens censor æon applaus blew blade sanguin caller heavenward resist readili tempor hatr rivalri purpl coward barber damask dialogu carpet seat disadvantag gad littl insignific rather apolog surpris frivol aloft uproari boot review ad thrown lavish trod curv join infirm wise undecid seclud protector humorist quiver peep repossess transit brewer warn swimmer reproduc failur upon rob draw wrist triumphant horror unusu leastway larg field rig durabl lord brink barrist show probe grow redund jacob sincer work twain sleev betroth anyon undo sadden darksom satin saint entreati central breez unconsid permit intellig gallon photograph whenc asid aristocrat taint ceil aloud</code> | <code>Is the content related to non-fiction genere</code> |
| <code>last highest gynê smoke proximum inclin synapteon gladden ekeinên flutter could ænian lead exact sleeper ascend faithless alik satisfi orcus merus nave frustra delphi muse balm realli regain arist convoy formid sell recal surest blast respect carnean mead envelop better dare moriar reduc talk glori mightest dicendum shrink abroad calm altisono sin ultima xxviii pous subterran kisso rage entha marri naught seldom upros race taphian restor elthont weather bewar forcibl lydian serm xenophon rest xeinôn rebuk spectr verum consilio satisfactorili medicin unfavor anthrôpoi prodess ætas 1437 lighten epebaiên across practic taken seer recommend dramont handsom tenor lepton hydatôn hêtis rose ill audiat mempto scalig propos suspicient falsehood long wetstein unintellig pluto enslav agit cross continu size lamb latebo ktypou cloudi like superstiti perchanc account colchian oaken euripidê delight infidel wed pitnonta excito mate liber discreet libya unpract whither gall murder weapon mean subsist cityless sepulchra nêpie eurota hyperechthairei antiop stop prosgelai earli achill metr suffici mellonta spot abiôton arbylê aveng catastroph kephalêi natal argous beyond sped known substant line parallel aeri given hew pavement euergesiôn egomet atmospher titan peal flatteri pheroean hygrotêt inclos givest tempt endear ôkeanou onta assonat payest realiti congeni sound pella unto advantag dynatai skimpôni apt expedi patro horrent illustri libri nautic beard stab seem situat lesser floweri success odyssey commemoratio unsulli palla lyei 1209 singular mellein unhonor languag surg regular eriosteptoi assertest gynaiko populac daphni scandal allianc stroke monk aught counter putter extinguish varianc elegi polydor pedest per fright bridegroom stadii unfortun skeptic horai solicitud publish offici kachla 1840 nation korytha corruptus kain topôn lament uncal olympus reveng cineri charon remittest length sipylus lolaus greatest unadorn shoot kalyptê nowher hospit blomfield promiscu iron shelter tipto stori unquest penthêrê</code> | <code>Is the content related to non-fiction genere</code> |
| <code>sank driven interrupt linen live sledg hast mistak alban cherish egg rhyme chief ezekiel whole excess neck shepherd robber snake even cours 160th neckti vocabulari wherefor vibrat protest repent stay import fanat pedestrian plenti convict threw thousand net timber crown owner echo poke battlement bugl nearer tole blush fresh darrel sail client warden happi colli strand congress eastward run limit scamp liberti celebr sacr squint treat outbreak dost offic hear bedroom brakeman correspond guilt glibli gabl son take jolli june mullen depot havin septemb leech guard bard extraordinari hamlet scarf tender juri knotti thurst unfad helpless strap hole rous slow shallow frequent morrow jargon befriend reap ocean spatter slaveri caesar isaiah forrest mile eliot full win pan wrong confront knee shear nice slid arrear angrili fourteen tentat merit governor bear togeth shook dine sermon fortitud web plain banker thrash sixteen grand grim forsooth railroad dwelt harrow burglar fir kind sober expector around hound joint hypocrit question clover skull snap bulli upper undu forehead sum cuff tramp cricket float speaker invis gestur mebb tax skeleton volcano drill tellin foreclosur editor confirm frosti scrambl regret ravel fiction hous holiday break schoolhous card pretenc crop fervent vittl tie mire whereon haughti fellow choos manag dinner infer crack dig index uneasi done drover foot agre studious verdict hand feat graven counterfeit brindl anywher fore thrill wolf partner heartbroken match martha prey caus imit muzzl public chalk beat welcom root celtic fifti person ladi excel confidenti jealousi damnabl xvii unutt sharpli crime sower train wrung manhood sunlight darken sharper secret grill elizabethan handwrit lay minut heav strike stalk horn amber near beg preacher loos christma discont rugos sleepless america tast consider top kidnap power buck much wreck ring merrier trick hard mischiev dagger mouth back knife prospect tear midnight cocoanut best pike abe gust dungeon poverti bond cassia gobbler exercis eben</code> | <code>Is the content related to fiction genere</code> |
* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
```json
{
"scale": 20.0,
"similarity_fct": "cos_sim"
}
```
### Evaluation Dataset
#### Unnamed Dataset
* Size: 1,234 evaluation samples
* Columns: <code>anchor</code> and <code>positive</code>
* Approximate statistics based on the first 1000 samples:
| | anchor | positive |
|:--------|:-------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
| type | string | string |
| details | <ul><li>min: 453 tokens</li><li>mean: 507.1 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 10 tokens</li><li>mean: 10.71 tokens</li><li>max: 12 tokens</li></ul> |
* Samples:
| anchor | positive |
|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------|
| <code>domest creed valentinian tone proclam peaceabl 1843 weakest incompet proscript realm esteem brigandag stock none incom authent competit follow labor vers wear ensembl impair student unalter glad cisalpin damocl sang perfidi pardon impera stupefi villa monopoli charl look link adag monomania messag hypocrisi priori counterpois publica gorgia redeem thank uncivil unwound fetter pascal serpit honorari maim superintend told homo promenad furnitur brief extract nehemiah furthermor competitor billion teas victim rate terminus higher mariti sacrileg behold bridg predecessor episcopi billow annot develop yardstick pretend special insinu kingship francai reckon sale devoid ghost difficulti driven falsifi pattern chief fatten contin retract dido repent thousand scholast ell librarian owner suffic fresh changer cartesian journeyman run treat offic ingenu war spontan bard extraordinari telescop extort assumpt gracious strategi frequent shallow aliquid manufactori ocean sibyl augustus mile galvan wrong usucapio knee beautifi wardenship bear togeth cart shook executor allobrog auger chapsal fortifi budget question implac entwin arbitrari float facto dearest logic commandit apprentic fiction advent traffic choos incred foot partner wolf evalu noel rioter muzzl root 1862 florenc manhood geometr nostra horn theseus beg overs melodrama inscript habent refrain helvetius disagre nodier similitud blanqui unemancip pike exercis obvious alli preambl wife ostens conquest compens coars cherbourg grantor invent duti epicur loss futil evapor gaul raison approb athenian insincer asham whim purpos unchang destruct imposit lacedaemonian wish conson pocket boobi commune relish ablest track cook blow friend geometri railway tiberius wash detriment meyer render teller ess amen arous idea personag sacrif repres stood david confrer fond sad cratch doubli attain advic vineyard pound habetur urgent britain communiti majorat juggleri biblic trim equal villein hazard expropri selfish declar taught ingratitud satisfact deliber wiser enthusiast</code> | <code>Is the content related to non-fiction genere</code> |
| <code>conscious chronolog leapt close sis drift lump station rank destitut contriv swivel grate stuck spare monoton thicket mesh yellow air fault choost reward scorn intent applic pestilenti contemptu greenhous mix pipe persuad plung avoid displac trustet ahoy concern critic sowsand name jounc downtown involuntari establish peril also settl flash voter mighti bang necess vial bewitch characterist adorn beauti sate decrepit citronell naturalist know conscienc fontenett laden strock deceiv inde pursuer xxii aimless moonlight archangel detain infatu frighten bought drows lucki pine trickl juic owfool pathet sunbeam tent needl gusti twas clung worthi diseas outrag recov made exhaust second begrudg cobweb privat corridor speak seventi bawn undress tarri remind enamour prompt lip graver ventur obedi basement forgotten crowd other sing incident breakwat excus wile rebound entangl philosophi flabbi deliver believ outang affect arriv vision soak bug realiz cruel frock promis pahdon everi modesti suzann fickl le african relief fortun laundri serenest ash straight damp awri lessen evil loudeh fonteett tardi spill hale hostess ladder avow medit seal longer well rebuff maintain quicker exclus donkey season hug wreath emphasi fill flag devious disturb bit tiger stolen intend drench unclean deep flourish apprehend admir veight flesh shiltren week anxieti violet how richard unbear everywher prefac conduct saunt stumbl though peopl sinc someth despair obey moor moral sill strang kine compassion mark doze flow dreamless wors crouch acquaint sugar typic doorsil leafi redempt unchalleng delug tarpaulin troop circumv hither reserv wander dirti crib cistern plead ruin serious slept scholar gradual drove fan mellow meet entertain till mantlepiec fairili sorri gasp southern heighten seed attempt joseph drown notion fascin constel rich consent speech teeth tire glorious pencil convuls glisten diffid lose citat dappl feast sooner belong splendid cigarett hoist sick midday tail fairst honor scorch savedt apathi color alvay inspect</code> | <code>Is the content related to romance genere</code> |
| <code>greatness late reput alarum compar fair rediscov realis round swell danc sayl crush particip move huddl materialis benumb prophesi infel unlaw rais suspit trumpet canibal herculean calcul yong specimen superiour forbear encreas fairest enlarg steer fama barrisor pediss preval shepheard umbra altum tergo catholiqu voluntatem assum bethink spar perplext oper immedi crab exploit wealthi catterpillar marriag labour rise revolv fool voic manner thereupon impo recevra montsureau abhorr enseam rendrer mountain general chariot amend flew poet laid tel issu insult radical assert bounti illus attyr discourag trust eundo penian mishap pray predict irregular unright lot expon shoulder overcom outright dumb treilli devil embrew enterd pas massing cognoscer essay splene exemplari grass traiter unnecessari prix scape heat event shot usd stabl highness syrtibus bel fleshi monsurri obsequi later struggl arrog intervent tune presum throne mess indur seven afflig judicial cyclop shakespearean fine sori prompter hind brother annoy inordin whosoev indulg lachesi perus youth span juli sadness 1888 would administr initi greediness obay hark bliss vellet occas folli subordin palladi stamp glossi finish consist hall cave lettr mercer forg 1865 gondomar matter forward niec stomack familiar noblest audaci thrid scap cure goos took crestfaln dispenc acheson zeal bodenstedt £300 temper pindus stephen barricado unknown elucid hostag desertful delighteth cruell 1681 clapdish wari go existen spit denounc violenc familia occisi wind religion eie lean oppidani encount bussii quellen desart cornhil pull approach haut plagu leas ornaverat epictetus nere pearl spenser sicil arrogantia justifi riot curious skirmish overlap subject faciebat societi afford celestial poultron villan humer jigg emrod 1903 boy drama dan communic 3830 offenc shaksper environ habitu blunt crafti down inviol men lieuten coit poison nonsens legitimaci scoff applaus blew letcher 1557 resist readili uncredit tempor hatr rivalri purpl coward librari errour sphære</code> | <code>Is the content related to romance genere</code> |
* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
```json
{
"scale": 20.0,
"similarity_fct": "cos_sim"
}
```
### Training Hyperparameters
#### Non-Default Hyperparameters
- `eval_strategy`: steps
- `per_device_train_batch_size`: 16
- `per_device_eval_batch_size`: 16
- `num_train_epochs`: 2
- `warmup_ratio`: 0.1
- `fp16`: True
- `batch_sampler`: no_duplicates
#### All Hyperparameters
<details><summary>Click to expand</summary>
- `overwrite_output_dir`: False
- `do_predict`: False
- `eval_strategy`: steps
- `prediction_loss_only`: True
- `per_device_train_batch_size`: 16
- `per_device_eval_batch_size`: 16
- `per_gpu_train_batch_size`: None
- `per_gpu_eval_batch_size`: None
- `gradient_accumulation_steps`: 1
- `eval_accumulation_steps`: None
- `learning_rate`: 5e-05
- `weight_decay`: 0.0
- `adam_beta1`: 0.9
- `adam_beta2`: 0.999
- `adam_epsilon`: 1e-08
- `max_grad_norm`: 1.0
- `num_train_epochs`: 2
- `max_steps`: -1
- `lr_scheduler_type`: linear
- `lr_scheduler_kwargs`: {}
- `warmup_ratio`: 0.1
- `warmup_steps`: 0
- `log_level`: passive
- `log_level_replica`: warning
- `log_on_each_node`: True
- `logging_nan_inf_filter`: True
- `save_safetensors`: True
- `save_on_each_node`: False
- `save_only_model`: False
- `restore_callback_states_from_checkpoint`: False
- `no_cuda`: False
- `use_cpu`: False
- `use_mps_device`: False
- `seed`: 42
- `data_seed`: None
- `jit_mode_eval`: False
- `use_ipex`: False
- `bf16`: False
- `fp16`: True
- `fp16_opt_level`: O1
- `half_precision_backend`: auto
- `bf16_full_eval`: False
- `fp16_full_eval`: False
- `tf32`: None
- `local_rank`: 0
- `ddp_backend`: None
- `tpu_num_cores`: None
- `tpu_metrics_debug`: False
- `debug`: []
- `dataloader_drop_last`: False
- `dataloader_num_workers`: 0
- `dataloader_prefetch_factor`: None
- `past_index`: -1
- `disable_tqdm`: False
- `remove_unused_columns`: True
- `label_names`: None
- `load_best_model_at_end`: False
- `ignore_data_skip`: False
- `fsdp`: []
- `fsdp_min_num_params`: 0
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
- `fsdp_transformer_layer_cls_to_wrap`: None
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
- `deepspeed`: None
- `label_smoothing_factor`: 0.0
- `optim`: adamw_torch
- `optim_args`: None
- `adafactor`: False
- `group_by_length`: False
- `length_column_name`: length
- `ddp_find_unused_parameters`: None
- `ddp_bucket_cap_mb`: None
- `ddp_broadcast_buffers`: False
- `dataloader_pin_memory`: True
- `dataloader_persistent_workers`: False
- `skip_memory_metrics`: True
- `use_legacy_prediction_loop`: False
- `push_to_hub`: False
- `resume_from_checkpoint`: None
- `hub_model_id`: None
- `hub_strategy`: every_save
- `hub_private_repo`: False
- `hub_always_push`: False
- `gradient_checkpointing`: False
- `gradient_checkpointing_kwargs`: None
- `include_inputs_for_metrics`: False
- `eval_do_concat_batches`: True
- `fp16_backend`: auto
- `push_to_hub_model_id`: None
- `push_to_hub_organization`: None
- `mp_parameters`:
- `auto_find_batch_size`: False
- `full_determinism`: False
- `torchdynamo`: None
- `ray_scope`: last
- `ddp_timeout`: 1800
- `torch_compile`: False
- `torch_compile_backend`: None
- `torch_compile_mode`: None
- `dispatch_batches`: None
- `split_batches`: None
- `include_tokens_per_second`: False
- `include_num_input_tokens_seen`: False
- `neftune_noise_alpha`: None
- `optim_target_modules`: None
- `batch_eval_metrics`: False
- `eval_on_start`: False
- `batch_sampler`: no_duplicates
- `multi_dataset_batch_sampler`: proportional
</details>
### Training Logs
| Epoch | Step | Training Loss | loss |
|:------:|:----:|:-------------:|:------:|
| 0.3704 | 100 | 1.0978 | 0.9591 |
| 0.7407 | 200 | 1.089 | 1.0138 |
| 1.1111 | 300 | 1.0538 | 0.9570 |
| 1.4815 | 400 | 1.0502 | 0.9178 |
| 1.8519 | 500 | 1.0611 | 0.9197 |
### Framework Versions
- Python: 3.10.12
- Sentence Transformers: 3.1.0
- Transformers: 4.42.4
- PyTorch: 2.3.1+cu121
- Accelerate: 0.32.1
- Datasets: 3.0.0
- Tokenizers: 0.19.1
## Citation
### BibTeX
#### Sentence Transformers
```bibtex
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
```
#### MultipleNegativesRankingLoss
```bibtex
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
```
<!--
## Glossary
*Clearly define terms in order to be accessible across audiences.*
-->
<!--
## Model Card Authors
*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
-->
<!--
## Model Card Contact
*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
-->