lucataco / nomic-embed-text-v1

nomic-embed-text-v1 is 8192 context length text encoder that surpasses OpenAI text-embedding-ada-002 and text-embedding-3-small performance on short and long context tasks

  • Public
  • 3.6K runs
  • T4
  • GitHub
  • Paper
  • License

Input

*string
Shift + Return to add a new line

Input Sentence list - Each sentence should be split by a newline

Output

[ 0.01095135323703289, 0.05741468071937561, -0.011036418378353119, -0.05894974619150162, 0.004029006231576204, -0.00038855959428474307, -0.019942041486501694, 0.06001022085547447, -0.06366678327322006, 0.016280679032206535, 0.0017858387436717749, -0.013357186689972878, 0.010486808605492115, -0.066402867436409, 0.04116280749440193, -0.0233303215354681, 0.025054315105080605, 0.05815963074564934, -0.024944618344306946, 0.020885661244392395, -0.011214514262974262, -0.03022485040128231, -0.0021379825193434954, -0.04964024946093559, 0.07829265296459198, -0.01828112080693245, 0.0548701211810112, 0.03149222582578659, 0.03443304821848869, 0.012447149492800236, 0.013194468803703785, -0.0712825357913971, 0.00815077219158411, -0.0022494641598314047, -0.007124587893486023, 0.0416703000664711, 0.0017338439356535673, -0.04015035182237625, -0.005192411597818136, -0.0005916599184274673, 0.003866987070068717, -0.09278689324855804, -0.03890141844749451, -0.030062643811106682, 0.061590246856212616, 0.017495594918727875, 0.002977917902171612, -0.010158374905586243, -0.020877212285995483, -0.007913591340184212, -0.09182387590408325, 0.011017527431249619, 0.03534690663218498, 0.0010791081003844738, 0.009003995917737484, 0.009348420426249504, 0.015054607763886452, -0.015587196685373783, 0.07030607014894485, 0.004751214757561684, -0.04396119713783264, 0.045153941959142685, -0.03173980861902237, 0.035106077790260315, 0.017179347574710846, -0.04437899589538574, -0.03213460370898247, 0.0415092334151268, 0.04083598405122757, 0.013065253384411335, -0.01577712409198284, -0.010894079692661762, 0.05594545230269432, -0.0054020872339606285, 0.021164188161492348, -0.026833508163690567, -0.004026634152978659, -0.0971449613571167, -0.0841112732887268, 0.062138259410858154, -0.022845910862088203, 0.003857814474031329, -0.028686009347438812, 0.01740163378417492, 0.01658797264099121, -0.006137331016361713, -0.0065208119340240955, -0.015026402659714222, -0.049816668033599854, 0.07368003576993942, 0.026145759969949722, 0.048196159303188324, 0.0052910721860826015, 0.015295464545488358, -0.042671915143728256, -0.04825323820114136, 0.055762793868780136, 0.005118346307426691, 0.01936238445341587, -0.010363892652094364, -0.017021654173731804, 0.026527564972639084, -0.028245966881513596, -0.021725554019212723, 0.03361367806792259, 0.036614686250686646, -0.028702042996883392, 0.0012743142433464527, 0.025049567222595215, 0.02318190224468708, 0.006102026905864477, 0.015066496096551418, 0.00999217014759779, -0.016967011615633965, -0.012381822802126408, -0.020804187282919884, 0.0454832948744297, -0.018004948273301125, -0.011647653765976429, 0.009738769382238388, -0.01835423707962036, -0.00019434701243881136, 0.0013541621156036854, 0.036537837237119675, 0.011912787333130836, 0.0334969237446785, -0.014243103563785553, -0.007289757486432791, -0.029029762372374535, -0.010468755848705769, -0.03548333793878555, 0.0114112738519907, 0.011792461387813091, -0.041863810271024704, 0.0260318610817194, 0.052626579999923706, -0.07074104249477386, -0.04269999638199806, 0.018683597445487976, 0.03239864110946655, -0.004339292179793119, 0.009157787077128887, 0.029688680544495583, -0.0232163667678833, 0.013670137152075768, -0.015611213631927967, -0.029081260785460472, -0.006004523951560259, -0.014084621332585812, -0.009664948098361492, 0.026097219437360764, 0.02938411384820938, -0.021061971783638, -0.04682563990354538, -0.03028072789311409, -0.06160460785031319, 0.009697330184280872, -0.017547262832522392, 0.051340654492378235, -0.002938435645774007, 0.006194668356329203, 0.017610281705856323, -0.01250655297189951, 0.04507853463292122, -0.039850782603025436, -0.014003785327076912, -0.021806150674819946, 0.015212069265544415, 0.021924082189798355, 0.006270602345466614, -0.015489873476326466, -0.025726355612277985, -0.0219804048538208, -0.013452148996293545, 0.02987339347600937, 0.04142092168331146, -0.02970430813729763, -0.005046514328569174, 0.06513293832540512, 0.01007703598588705, 0.015709707513451576, 0.032798197120428085, 0.060665108263492584, 0.08495338261127472, -0.05383361503481865, 0.0024632923305034637, -0.007010897155851126, -0.005284905433654785, 0.03133589029312134, -0.020456083118915558, 0.029202604666352272, -0.05406308174133301, -0.006425356958061457, -0.01863768696784973, 0.009493045508861542, -0.04606451466679573, 0.007750218268483877, -0.022219300270080566, 0.03089388832449913, -0.017030097544193268, -0.036510322242975235, 0.0033556867856532335, -0.05742275342345238, 0.019918467849493027, -0.024453848600387573, 0.0028127767145633698, -0.03847751393914223, 0.012979201041162014, 0.010507640428841114, 0.04476174712181091, 0.06809226423501968, 0.031181611120700836, 0.01118194404989481, -0.012961296364665031, -0.04794774204492569, -0.03537052124738693, -0.025902751833200455, -0.012294700369238853, -0.022769780829548836, -0.023255184292793274, -0.03107871487736702, -0.07981742173433304, 0.00447813468053937, 0.01950179412961006, 0.022204559296369553, -0.056188084185123444, 0.044839367270469666, -0.022334588691592216, -0.029524069279432297, -0.00804254598915577, -0.03299224004149437, -0.028446244075894356, 0.0036380512174218893, -0.0035651475191116333, -0.024932485073804855, -0.021709932014346123, 0.023738160729408264, 0.03356711566448212, 0.0015305689303204417, 0.0196834784001112, -0.007776692975312471, 0.06507500261068344, -0.0385550893843174, -0.04127439856529236, -0.03507571294903755, 0.04378141835331917, -0.02261347509920597, -0.031119758263230324, 0.009881269186735153, 0.0021211784332990646, -0.04353616014122963, -0.009117803536355495, -0.04359005019068718, 0.011464567855000496, 0.024521177634596825, -0.02393646165728569, -0.023444954305887222, 0.0667286217212677, -0.0017716558650135994, 0.004849900957196951, 0.0295844879001379, 0.0681636705994606, 0.028212470933794975, -0.03865675628185272, 0.024017414078116417, 0.01685112528502941, 0.03454534336924553, 0.009134492836892605, 0.037454914301633835, -0.053639136254787445, -0.02899150177836418, 0.03195880725979805, 0.002846618415787816, -0.0064763957634568214, -0.02407020889222622, -0.03450958430767059, 0.06165960803627968, 0.007945283316075802, -0.0988699346780777, 0.018333332613110542, 0.028297564014792442, 0.04458598420023918, 0.023490583524107933, 0.014098381623625755, -0.003519838210195303, 0.02807518281042576, 0.06878983974456787, -0.06952008605003357, -0.00370450085029006, 0.008531334809958935, -0.022528856992721558, -0.0410970076918602, 0.03746335580945015, 0.03240613639354706, -0.022129975259304047, 0.04243621230125427, 0.023338552564382553, 0.024562601000070572, 0.023991748690605164, 0.004718718118965626, -0.01301904022693634, 0.05135525390505791, 0.04467343911528587, 0.03903888911008835, 0.0186002179980278, 0.01576736569404602, -0.0030277366749942303, -0.025646040216088295, 0.04754107818007469, -0.014813698828220367, -0.057426635175943375, 0.009132984094321728, -0.006039661820977926, 0.02352750673890114, -0.011951982043683529, 0.0810471773147583, 0.009133119136095047, 0.03146473318338394, 0.017106210812926292, -0.022793369367718697, 0.0813884511590004, 0.0019824292976409197, 0.018278062343597412, -0.030706653371453285, 0.018740728497505188, 0.002366192638874054, 0.025899942964315414, 0.06023377925157547, -0.04771910235285759, -0.04957083612680435, -0.0043039750307798386, 0.014273888431489468, -0.016368649899959564, -0.022631095722317696, -0.001251550274901092, 0.06406429409980774, 0.03341543301939964, 0.07553271949291229, 0.04653846472501755, 0.025847401469945908, -0.04633196070790291, -0.0572177916765213, -0.02679474651813507, 0.0009575766162015498, 0.008013112470507622, -0.045646268874406815, -0.07301680743694305, -0.032558854669332504, 0.023830702528357506, -0.037694089114665985, -0.004952005576342344, 0.02268902398645878, -0.011964686214923859, 0.0050571383908391, -0.06218785420060158, 0.0002706383529584855, 0.02045312151312828, -0.06122603267431259, 0.010259377770125866, -0.03296119347214699, 0.015623349696397781, -0.0560494139790535, -0.008739670738577843, -0.015469438396394253, 0.01494138315320015, 0.01980438269674778, 0.06613612920045853, 0.05736435577273369, 0.08562860637903214, -0.04306644946336746, -0.07399771362543106, -0.005157689098268747, -0.008684834465384483, 0.038428325206041336, 0.02072681300342083, 0.00744375865906477, -0.044580549001693726, -0.004789096303284168, -0.05440279096364975, 0.014681629836559296, 0.0352068766951561, -0.02680492401123047, -0.00879183504730463, 0.06814570724964142, 0.028043227270245552, -0.06403806060552597, -0.06723233312368393, -0.047958552837371826, -0.04751108959317207, 0.011454190127551556, -0.020288053900003433, -0.07312195748090744, -0.023174870759248734, 0.0766587033867836, 0.04014214500784874, -0.00465058209374547, 0.024160558357834816, -0.013461316004395485, -0.011761688627302647, -0.04329565167427063, 0.04425843060016632, -0.08991681784391403, -0.014513592235744, 0.030178304761648178, -0.028338417410850525, 0.014233969151973724, 0.017719777300953865, -0.015995610505342484, 0.04840908944606781, 0.05448590964078903, 0.04405346140265465, 0.06311886757612228, 0.0454021580517292, -0.04756806418299675, 0.01838882640004158, -0.019803250208497047, 0.02567993849515915, 0.008955170400440693, 0.004070349037647247, -0.015512282960116863, -0.017582112923264503, 0.050298433750867844, 0.005353149957954884, 0.025372033938765526, 0.03057747706770897, -0.004827969707548618, 0.011461003683507442, -0.02270055003464222, 0.0050400616601109505, -0.03626517206430435, -0.021481184288859367, 0.0033793365582823753, 0.010307487100362778, 0.04814928025007248, -0.020942287519574165, 0.018860695883631706, 0.05137651041150093, -0.04099107161164284, -0.05171844735741615, 0.01339589711278677, 0.00700850784778595, -0.09815945476293564, -0.011868336237967014, -0.004565415438264608, 0.017416823655366898, 0.043746646493673325, 0.05679110437631607, -0.04089980944991112, 0.03040848672389984, 0.03710726648569107, 0.047923896461725235, -0.027744831517338753, 0.023457149043679237, 0.04731198027729988, 0.0411817841231823, 0.004236286506056786, -0.05223522335290909, -0.0450056828558445, -0.027056913822889328, -0.027974342927336693, 0.06804237514734268, 0.009149203076958656, -0.003521802369505167, -0.030428705736994743, 0.000691404624376446, -0.0029757048469036818, 0.012632301077246666, 0.019209904596209526, 0.03216187283396721, -0.049256831407547, 0.04627080634236336, 0.024562722072005272, 0.0355701670050621, 0.035221293568611145, -0.07311935722827911, -0.024610735476017, 0.030660677701234818, -0.052247967571020126, -0.04667264223098755, 0.05237388610839844, 0.01180503610521555, 0.004114583134651184, -0.060087088495492935, 0.009188946336507797, -0.04129689559340477, 0.0280842836946249, -0.0014345082454383373, -0.01582113839685917, 0.04283314570784569, -0.010187298990786076, 0.0279100202023983, 0.01842082105576992, 0.03190162777900696, -0.02242290787398815, -0.026790481060743332, -0.018636655062437057, 0.019975779578089714, 0.023535508662462234, -0.015892142429947853, -0.0007095849141478539, 0.00034376970143057406, -0.00022445543436333537, 0.021103084087371826, 0.05670653283596039, 0.049382057040929794, -0.013828130438923836, -0.027455484494566917, -0.04472290351986885, -0.011652160435914993, -0.04819415882229805, -0.045328289270401, -0.01144829485565424, 0.016514906659722328, 0.0417681448161602, -0.051248159259557724, -0.01784973405301571, 0.047419190406799316, -0.0452401228249073, -0.07463189959526062, 0.007753925863653421, 0.012752576731145382, 0.024080149829387665, 0.0018788465531542897, 0.023718222975730896, -0.09520231932401657, -0.013076655566692352, 0.009152228012681007, -0.014780477620661259, 0.027227699756622314, 0.012050078250467777, 0.034551024436950684, -0.05476585030555725, 0.029133930802345276, -0.01622489094734192, -0.013028962537646294, 0.02777349203824997, -0.03841657564043999, 0.08191419392824173, -0.010105338878929615, -0.02315670996904373, 0.014560697600245476, 0.012290042825043201, 0.0016529938438907266, -0.01192073617130518, 0.04462875425815582, -0.04701828211545944, -0.03582185134291649, 0.05610434710979462, -0.09559338539838791, -0.010402658022940159, 0.04430545121431351, -0.03543740510940552, 0.057771533727645874, -0.0778145119547844, 0.08405705541372299, -0.02349276840686798, -0.03435845673084259, -0.031138814985752106, -0.000004983299731975421, -0.021359922364354134, 0.006968296132981777, -0.004981718026101589, -0.007418072782456875, 0.00838849600404501, -0.00045239622704684734, -0.044645775109529495, -0.0014725091168656945, -0.014690691605210304, 0.0033954575192183256, -0.00005948846592218615, -0.04214471951127052, -0.020198984071612358, -0.016358090564608574, -0.08050812780857086, -0.023891855031251907, -0.027170846238732338, 0.02181682363152504, -0.06833459436893463, -0.010455570183694363, 0.036701783537864685, 0.0009135351283475757, -0.06084134429693222, -0.0378529392182827, -0.05277568846940994, -0.0029317019507288933, 0.013544419780373573, -0.03654816374182701, -0.03897010162472725, 0.0018767787842079997, 0.01530505996197462, -0.006521572358906269, -0.003759671002626419, -0.011109281331300735, 0.035125330090522766, 0.029497243463993073, 0.005309949163347483, -0.03112056478857994, -0.05744868889451027, 0.017789775505661964, 0.0025538725312799215, -0.047091901302337646, 0.022515304386615753, 0.043565377593040466, -0.032728057354688644, -0.033630453050136566, 0.07852070033550262, -0.027601078152656555, 0.018232954666018486, -0.021809548139572144, -0.04005248099565506, 0.07040561735630035, -0.0008223121985793114, -0.05039757862687111, -0.021871237084269524, -0.02111952379345894, 0.0002747916441876441, 0.07178083807229996, -0.00583243602886796, 0.01804327964782715, 0.0008332610595971346, 0.014933548867702484, -0.02831130288541317, 0.06816618144512177, 0.02935623563826084, 0.051236994564533234, -0.028796883299946785, -0.010663599707186222, 0.008187556639313698, 0.0016748667694628239, 0.07402241975069046, -0.013136499561369419, 0.018661554902791977, -0.019921066239476204, -0.022850919514894485, -0.058953870087862015, -0.02801273576915264, 0.016860956326127052, -0.040944550186395645, 0.008239670656621456, 0.08554871380329132, -0.01760023459792137, 0.00490532536059618, 0.006614720448851585, -0.023586684837937355, 0.032409973442554474, 0.0005555882817134261, 0.01278941985219717, -0.026426831260323524, -0.008805539458990097, -0.032470446079969406, 0.05493434518575668, 0.035844068974256516, -0.03385930508375168, 0.039209891110658646, 0.04573044553399086, -0.013734528794884682, 0.04512805491685867, -0.019866952672600746, -0.028991734609007835, -0.010187303647398949, -0.11430353671312332, -0.03440805897116661, 0.005934761371463537, -0.0036425774451345205, 0.024203326553106308, -0.02618531510233879, -0.044007983058691025, -0.04432472214102745, -0.02462209388613701, 0.008616941049695015, 0.031241875141859055, -0.018664278090000153, 0.029299136251211166, 0.03314938768744469, -0.006805418990552425, -0.028522077947854996, -0.004853042773902416, 0.03155948221683502, 0.018128950148820877, 0.028947411105036736, -0.007338481489568949, 0.050942953675985336, 0.0385914109647274, -0.01340820174664259, 0.02782238833606243, -0.016686392948031425, 0.0034198290668427944, 0.03323481231927872, 0.036282286047935486, -0.059357598423957825, 0.026725489646196365, 0.03615732491016388, -0.03119989112019539, 0.00005066340963821858, 0.062447380274534225, 0.044097404927015305, -0.020388146862387657, 0.03097723424434662, -0.03796424716711044, 0.04448886960744858, -0.015311235561966896, 0.10095039755105972, -0.04681314900517464, 0.030279578641057014, -0.01943371631205082, 0.025086017325520515, 0.05732494965195656, 0.007536051794886589, -0.010740718804299831, 0.018136056140065193, -0.0487496443092823, 0.07054506242275238, 0.08814582973718643, 0.07161413133144379, 0.03336310014128685, -0.004511588718742132, 0.0025663338601589203, -0.04977448284626007, -0.014745009131729603, 0.013455641455948353, 0.03375483676791191, 0.0068253036588430405, -0.0507630929350853, 0.05863906443119049, -0.06384045630693436, -0.04259096086025238, -0.08660164475440979, 0.007497888058423996, 0.12273633480072021, 0.039327751845121384, -0.07067286223173141, 0.02883453108370304, 0.016115861013531685, 0.02388802543282509, -0.04639867693185806, 0.07748599350452423, 0.05080137401819229, -0.006818047259002924, -0.0504932701587677, 0.018026666715741158, 0.03747960552573204, -0.03037784807384014, 0.03645424544811249, 0.00017750763799995184, -0.014108170755207539, -0.046832870692014694, -0.017526039853692055, -0.004456405993551016, -0.016885753720998764, -0.03272240608930588, 0.0012103213230147958, -0.024431630969047546, -0.007181975059211254, 0.04176744446158409, 0.011346409097313881, 0.012977346777915955, -0.019861262291669846, 0.038307592272758484, -0.013685143552720547, 0.007373974658548832, -0.028337836265563965, 0.019534964114427567, 0.024852126836776733, 0.03777310997247696, 0.0005893092602491379, 0.000035152341297362, -0.028092166408896446, -0.021599844098091125 ]
[ -0.013367011211812496, 0.027091309428215027, -0.023367367684841156, -0.02931433543562889, -0.029378587380051613, 0.03030582331120968, -0.03322562575340271, 0.010936104692518711, 0.01639264076948166, -0.05841662362217903, -0.06689992547035217, 0.01468091644346714, 0.052607543766498566, -0.02454628050327301, 0.025613918900489807, 0.015576314181089401, -0.07304364442825317, -0.021631868556141853, -0.051586903631687164, -0.0490972101688385, -0.04658475145697594, -0.0026675958652049303, -0.0041494183242321014, -0.04526655375957489, 0.0824025347828865, 0.008150441572070122, 0.016687817871570587, 0.05063367635011673, -0.0162818543612957, 0.016138000413775444, 0.04551561921834946, -0.05062474310398102, -0.05022260919213295, 0.000929061439819634, -0.011988187208771706, -0.042469847947359085, -0.028033778071403503, -0.034987445920705795, -0.039099086076021194, 0.00407467782497406, 0.049649935215711594, -0.027287110686302185, -0.015809480100870132, -0.028346998617053032, -0.03654203191399574, 0.0077153765596449375, -0.05949987843632698, -0.018894322216510773, -0.02800183743238449, 0.025681987404823303, 0.01324142049998045, -0.031840551644563675, 0.0006436983239836991, -0.028477419167757034, -0.018753062933683395, 0.009581766091287136, 0.0011196398409083486, 0.0386163555085659, 0.027134839445352554, -0.009685776196420193, 0.025369644165039062, -0.0017950370674952865, -0.05807509645819664, 0.015584784559905529, 0.004428081680089235, -0.057285163551568985, -0.05581028014421463, 0.016089925542473793, 0.01491198968142271, -0.08602361381053925, 0.008599081076681614, -0.03273872286081314, 0.01816224679350853, 0.009296800941228867, 0.01626083441078663, -0.01852138713002205, -0.02180669456720352, 0.0053992681205272675, 0.022959381341934204, 0.020627690479159355, 0.01704055443406105, -0.003142870031297207, 0.017939621582627296, 0.06219086796045303, 0.052361421287059784, 0.028135884553194046, 0.0490267314016819, -0.0010522350203245878, -0.04324304312467575, -0.0060428171418607235, 0.025130659341812134, -0.005372289102524519, 0.02188788540661335, -0.030034316703677177, -0.04676956683397293, 0.011841668747365475, -0.0017237251158803701, -0.011886943131685257, -0.03777362406253815, -0.012695327401161194, -0.0012145627988502383, 0.01872056908905506, -0.06419260799884796, 0.009192639961838722, 0.04412635415792465, 0.0013409190578386188, 0.05110359191894531, 0.018565203994512558, 0.01053679920732975, 0.018642578274011612, 0.0195607990026474, -0.03315857797861099, 0.005079865921288729, -0.028002705425024033, -0.06296424567699432, -0.04396319389343262, 0.059086523950099945, -0.007648728787899017, 0.016416465863585472, 0.029702961444854736, 0.01353293564170599, -0.016358062624931335, -0.006039596628397703, 0.04610564187169075, 0.05226119980216026, -0.012794599868357182, -0.012460069730877876, -0.015195890329778194, 0.034715134650468826, 0.030447043478488922, -0.015420882031321526, 0.011023534461855888, 0.0019727249164134264, -0.026603301987051964, 0.023869933560490608, 0.01619776152074337, 0.017990905791521072, 0.03754420951008797, 0.013901847414672375, -0.006657007150352001, -0.043388914316892624, 0.01075967587530613, -0.005554885603487492, 0.01976935938000679, 0.016803156584501266, -0.04010647535324097, 0.05328849330544472, 0.03468293696641922, -0.04985842853784561, 0.029320836067199707, -0.01585582084953785, 0.012560023926198483, -0.005726690869778395, -0.03093198500573635, -0.058581575751304626, -0.05476389080286026, -0.051109105348587036, 0.03417803347110748, 0.011891189031302929, 0.0033439071848988533, 0.034214723855257034, 0.0635485127568245, -0.01307082548737526, -0.0002516607928555459, -0.051131948828697205, 0.03350355476140976, -0.024761168286204338, 0.052967168390750885, 0.017504673451185226, 0.011198264546692371, -0.020892567932605743, -0.010587452910840511, -0.00026501421234570444, -0.004447563551366329, 0.04966898635029793, -0.0202559195458889, -0.05135452374815941, 0.026456469669938087, 0.04309443011879921, 0.02888336405158043, 0.021896475926041603, -0.04728158563375473, 0.021502986550331116, 0.03197498247027397, -0.04003562033176422, 0.042076475918293, -0.028709886595606804, -0.009980851784348488, 0.019864916801452637, -0.05299125984311104, -0.013078444637358189, 0.03484668582677841, 0.00813272688537836, -0.012707322835922241, -0.040900807827711105, 0.03332795202732086, 0.006207541562616825, -0.06641523540019989, -0.006586514413356781, 0.017960786819458008, -0.09790168702602386, -0.03846385329961777, 0.05406108498573303, 0.05142273008823395, 0.032307226210832596, -0.0008202686440199614, -0.0017560627311468124, 0.023240385577082634, 0.00521981343626976, 0.02373497188091278, 0.0549178309738636, 0.026087051257491112, 0.05340711027383804, 0.06624624133110046, -0.04006277769804001, -0.0168919637799263, -0.0023763685021549463, 0.020568957552313805, -0.00032972136978060007, -0.020321285352110863, 0.023252444341778755, 0.03293822705745697, -0.020278438925743103, -0.016082152724266052, 0.014716453850269318, -0.03914165869355202, 0.06600044667720795, -0.023393385112285614, -0.05196664109826088, -0.008601250126957893, -0.009670604951679707, 0.022531645372509956, 0.06830757111310959, 0.025388330221176147, -0.0030663262587040663, -0.035304825752973557, -0.06941498070955276, 0.07528544962406158, -0.0004654425720218569, 0.041580066084861755, 0.002672195201739669, 0.038263384252786636, -0.07213281095027924, 0.038647737354040146, 0.030739326030015945, 0.03493083268404007, -0.038649145513772964, -0.009652426466345787, -0.011662045493721962, 0.017665961757302284, 0.010481348261237144, -0.03348642215132713, 0.002821381203830242, -0.011960468254983425, -0.016304513439536095, -0.018019307404756546, -0.00804162584245205, 0.037634167820215225, -0.016669118776917458, 0.024196676909923553, 0.0781722366809845, 0.009995926171541214, 0.041603993624448776, 0.003320380114018917, -0.028482327237725258, 0.008030678145587444, -0.07800203561782837, 0.03985422104597092, 0.03820877522230148, -0.011566279456019402, 0.047890398651361465, 0.029712477698922157, 0.012603509239852428, 0.05557284504175186, 0.007490682415664196, 0.0028987322002649307, 0.010545793920755386, -0.023050876334309578, -0.03927967697381973, 0.021380530670285225, -0.04875064641237259, 0.02948448434472084, -0.003296519862487912, -0.09237199276685715, -0.019150182604789734, 0.07859811186790466, 0.021795369684696198, -0.022578855976462364, 0.06711656600236893, -0.0024967847857624292, -0.012966042384505272, -0.0017742767231538892, -0.025218969210982323, 0.053635794669389725, 0.015528887510299683, 0.022969527170062065, 0.04157904535531998, 0.029676327481865883, -0.03682582825422287, -0.07591484487056732, 0.053010571748018265, -0.007560709957033396, 0.019736051559448242, 0.013470569625496864, 0.04742124304175377, -0.05009404942393303, 0.03461810201406479, 0.006931418552994728, 0.076551154255867, -0.015857763588428497, -0.03384680300951004, -0.008442088030278683, 0.029287857934832573, -0.07256081700325012, -0.0420292466878891, 0.00005552463699132204, 0.006595691666007042, 0.019360650330781937, 0.0039208210073411465, 0.010030201636254787, 0.05568426474928856, -0.005340257193893194, -0.004507614765316248, -0.04510500654578209, -0.007215121295303106, 0.0126324612647295, 0.007465405855327845, -0.04476967081427574, -0.028459027409553528, 0.04519397020339966, -0.025224808603525162, 0.06098664924502373, 0.013003269210457802, -0.0535307303071022, 0.006370587274432182, 0.03567911684513092, -0.056454136967659, 0.015805166214704514, 0.024966605007648468, 0.030049480497837067, -0.06974119693040848, -0.010202920064330101, -0.04017848148941994, -0.0030107253696769476, -0.014565221965312958, -0.010480005294084549, -0.03358684480190277, -0.059079449623823166, 0.05635131895542145, -0.045069146901369095, -0.06790684908628464, 0.07365623116493225, 0.025703681632876396, 0.048897549510002136, -0.03670347481966019, 0.04037290811538696, 0.003536506788805127, 0.02769387513399124, 0.002599050523713231, 0.02799932286143303, 0.002108124317601323, -0.031387727707624435, -0.02956802025437355, -0.014063182286918163, 0.0011730255791917443, 0.028059253469109535, 0.04766172543168068, 0.061100371181964874, 0.004101407248526812, -0.031233012676239014, 0.008680857717990875, -0.031279556453228, 0.09263414889574051, 0.05368093401193619, 0.07667143642902374, 0.00827677734196186, -0.03903355821967125, -0.026648467406630516, -0.02639271318912506, 0.024951182305812836, 0.033475425094366074, 0.06577513366937637, 0.009616753086447716, -0.014587080106139183, 0.046099986881017685, -0.02811550535261631, -0.05149431154131889, -0.02771199494600296, 0.04142490774393082, -0.05639816075563431, -0.031960517168045044, -0.03455748409032822, -0.04575162008404732, 0.024492980912327766, -0.0008085657027550042, 0.00743621913716197, 0.03139351308345795, 0.0483773909509182, 0.02181151695549488, 0.013271701522171497, 0.01837030239403248, -0.08332404494285583, 0.016867714002728462, -0.021527396515011787, -0.0035130439791828394, 0.03877176716923714, -0.015405097976326942, -0.05093829333782196, 0.05890645831823349, 0.03812996670603752, 0.010277321562170982, -0.001411952544003725, -0.003806103253737092, -0.08194884657859802, -0.04012405499815941, -0.0026121812406927347, 0.07104285061359406, -0.0028507711831480265, -0.006285722833126783, 0.022433243691921234, -0.01879546232521534, 0.0780840590596199, -0.04958685114979744, 0.0032874057069420815, 0.003366191405802965, 0.04245126619935036, 0.04057992622256279, -0.06513143330812454, 0.02286355197429657, -0.026958977803587914, 0.051929231733083725, 0.026219407096505165, 0.006591421086341143, 0.07260095328092575, 0.0063383923843503, 0.04895593971014023, 0.04371348395943642, -0.003942361567169428, 0.00266691530123353, 0.05101630836725235, 0.000029922137400717475, -0.15343229472637177, -0.05926795303821564, -0.005357218440622091, 0.016813958063721657, -0.008059310726821423, 0.0691450759768486, -0.09572381526231766, -0.03562704473733902, -0.0028729091864079237, 0.04939509928226471, -0.019673597067594528, -0.016620393842458725, 0.032333970069885254, 0.07009420543909073, -0.022275850176811218, -0.06902893632650375, -0.11016625910997391, 0.01840086653828621, 0.013967333361506462, -0.017528291791677475, 0.03472188860177994, -0.0002349854476051405, 0.04924885928630829, 0.026520662009716034, -0.00042489677434787154, -0.016507143154740334, -0.018684957176446915, -0.01872895658016205, 0.05920778959989548, 0.006679943297058344, 0.021617356687784195, 0.041553858667612076, 0.03394605219364166, 0.04574620723724365, 0.02542225830256939, 0.05808807909488678, -0.026744602248072624, 0.035630788654088974, 0.036318663507699966, 0.043391235172748566, -0.02441922202706337, -0.021019821986556053, -0.06524381041526794, -0.05827146768569946, 0.04211366921663284, -0.0125234704464674, 0.0013157438952475786, -0.009494279511272907, -0.03717602416872978, 0.023060737177729607, -0.014938377775251865, 0.011157751083374023, -0.006688161753118038, 0.019949669018387794, 0.02413453347980976, 0.043268512934446335, 0.012183606624603271, 0.06102009490132332, -0.028530608862638474, -0.03634338825941086, -0.001635472639463842, 0.0033934451639652252, 0.0031277902889996767, -0.026029299944639206, 0.004196987487375736, -0.0068113310262560844, -0.03420601040124893, 0.0151924267411232, 0.02164824679493904, -0.05556434392929077, 0.06548451632261276, -0.0006622567889280617, 0.07589879631996155, -0.041694704443216324, -0.05841227248311043, 0.08571728318929672, -0.01071757823228836, -0.056938499212265015, 0.05017666518688202, -0.02987842634320259, 0.006362335756421089, -0.006134297698736191, 0.052562639117240906, -0.008783246390521526, -0.0029599228873848915, -0.014119681902229786, -0.018933456391096115, -0.0495394691824913, -0.03432367742061615, 0.03581317886710167, -0.0056211333721876144, -0.0024067258927971125, -0.023786252364516258, 0.03770841285586357, -0.05880453437566757, -0.039026644080877304, 0.03691806271672249, -0.00016758378478698432, 0.0183741245418787, 0.004603371489793062, -0.05707313492894173, 0.01410890743136406, -0.00782035756856203, 0.012815277092158794, -0.028129354119300842, -0.008286369033157825, 0.02819700725376606, -0.03927973657846451, 0.006148859858512878, 0.01959318108856678, -0.025396274402737617, 0.012298468500375748, -0.09595537185668945, 0.0047227670438587666, -0.03526449203491211, -0.057474374771118164, -0.07402008026838303, 0.024349568411707878, -0.03168272227048874, -0.007614485919475555, -0.008284451439976692, -0.006951712537556887, -0.02604733780026436, -0.025470634922385216, -0.03699528053402901, -0.015697602182626724, 0.03308829292654991, -0.01758158951997757, -0.04863951355218887, 0.0038741808384656906, -0.02015613578259945, -0.004733735229820013, -0.018855364993214607, -0.02182484231889248, -0.00022339388669934124, 0.01831820420920849, -0.018878508359193802, -0.012772711925208569, -0.005890439264476299, -0.0045585003681480885, -0.05636773258447647, -0.01608840748667717, -0.015610920265316963, -0.018006136640906334, 0.028169414028525352, 0.0003899271250702441, -0.03494356945157051, 0.015711816027760506, -0.06495672464370728, -0.051817432045936584, 0.014244604855775833, -0.02853020466864109, 0.04265983775258064, -0.006204280070960522, 0.03492254018783569, 0.007314922288060188, -0.028975356370210648, 0.03774525597691536, 0.005107431206852198, 0.03489163890480995, 0.0036567712668329477, -0.0473654568195343, -0.056677885353565216, -0.01815849356353283, -0.02115686610341072, -0.0013136386405676603, 0.056293804198503494, -0.031144550070166588, 0.015903647989034653, 0.007048492319881916, 0.01291101984679699, -0.036876216530799866, -0.034796588122844696, 0.02397921308875084, -0.0019162662792950869, 0.07357223331928253, 0.05003762245178223, -0.03789607062935829, -0.04260150343179703, -0.009326632134616375, -0.003326219506561756, -0.016510874032974243, -0.00461015896871686, -0.02242056466639042, -0.059765852987766266, -0.0892246887087822, -0.03362157195806503, -0.02304922789335251, 0.04977438598871231, 0.0051648844964802265, -0.004510499071329832, -0.03382117301225662, -0.011208691634237766, 0.007185440510511398, 0.04579570144414902, -0.02245352976024151, -0.006119054276496172, 0.03581996262073517, 0.014924010261893272, -0.0234354380518198, -0.06068423017859459, -0.01039345096796751, 0.02547307126224041, 0.02183200605213642, -0.006565154530107975, -0.005035614129155874, -0.05343769118189812, 0.008762285113334656, 0.019363025203347206, 0.036154016852378845, 0.05440828576683998, 0.005783139728009701, 0.02355520986020565, 0.05823527276515961, 0.02451219968497753, 0.04743995890021324, -0.002265411661937833, -0.02139786258339882, -0.04496082291007042, -0.023157218471169472, 0.010303998365998268, 0.03627569600939751, -0.04733748733997345, 0.03725633770227432, -0.030602918937802315, -0.08086033165454865, -0.008938710205256939, 0.03394116833806038, -0.029304368421435356, 0.027318891137838364, -0.017679866403341293, -0.011306281201541424, 0.023995984345674515, -0.04946554824709892, -0.033561088144779205, 0.008230587467551231, 0.10280811786651611, -0.030022162944078445, -0.03447675332427025, -0.0120124202221632, 0.07735978066921234, 0.008378326892852783, -0.06732465326786041, 0.028196033090353012, -0.03044634871184826, 0.01848595403134823, -0.012632833793759346, 0.06076112017035484, -0.020371729508042336, 0.004058495629578829, 0.01929631642997265, -0.003344331169500947, -0.02968870848417282, 0.02977297455072403, 0.010498066432774067, -0.04256574809551239, 0.007990468293428421, 0.031897298991680145, 0.07808470726013184, -0.017049040645360947, 0.033926066011190414, 0.004207438789308071, 0.0440499372780323, -0.0443359911441803, 0.03048410639166832, 0.03308941423892975, 0.022352227941155434, -0.02332272008061409, 0.012854666449129581, -0.07122018188238144, -0.0006000956636853516, 0.04291792958974838, 0.062385495752096176, 0.028889354318380356, 0.013569467701017857, -0.043718840926885605, -0.07419683784246445, -0.03419077396392822, 0.07189968228340149, 0.04693587124347687, -0.010709425434470177, -0.018744129687547684, 0.009260585531592369, 0.008651681244373322, -0.06850480288267136, 0.002649689093232155, -0.0005230620736256242, 0.002815444953739643, 0.024545013904571533, -0.010616053827106953, 0.011593087576329708, 0.021925682201981544, 0.05801421031355858, -0.03604137897491455, -0.012206273153424263, 0.0009784932481124997, 0.026309004053473473, 0.022988567128777504, -0.010060916654765606, -0.017318347468972206, -0.014360317960381508, 0.04418138414621353, 0.031224694103002548, 0.02684205211699009, 0.015931297093629837, -0.04784568399190903, -0.008138603530824184, 0.03213050216436386, -0.015607457607984543, 0.0052072424441576, 0.050294168293476105, -0.021515650674700737, -0.02046050690114498, 0.013918804936110973, 0.028182370588183403, -0.03352099657058716, 0.07039376348257065, 0.029620075598359108, -0.03943828120827675, -0.007821516133844852, -0.045488372445106506, -0.005765561945736408, 0.05133604630827904, -0.008799396455287933, 0.02879945933818817, -0.010674664750695229, 0.028820769861340523 ]
Generated in

Run time and cost

This model costs approximately $0.0037 to run on Replicate, or 270 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia T4 GPU hardware. Predictions typically complete within 17 seconds. The predict time for this model varies significantly based on the inputs.

Readme

nomic-embed-text-v1: A Reproducible Long Context (8192) Text Embedder

nomic-embed-text-v1 is 8192 context length text encoder that surpasses OpenAI text-embedding-ada-002 and text-embedding-3-small performance on short and long context tasks.

Name SeqLen MTEB LoCo Jina Long Context Open Weights Open Training Code Open Data
nomic-embed-text-v1 8192 62.39 85.53 54.16
jina-embeddings-v2-base-en 8192 60.39 85.45 51.90
text-embedding-3-small 8191 62.26 82.40 58.20
text-embedding-ada-002 8191 60.99 52.7 55.25

Hosted Inference API

The easiest way to get started with Nomic Embed is through the Nomic Embedding API.

Generating embeddings with the nomic Python client is as easy as

from nomic import embed

output = embed.text(
    texts=['Nomic Embedding API', '#keepAIOpen'],
    model='nomic-embed-text-v1',
    task_type='search_document'
)

print(output)

For more information, see the API reference

Data Visualization

Click the Nomic Atlas map below to visualize a 5M sample of our contrastive pretraining data!

image/webp

Training Details

We train our embedder using a multi-stage training pipeline. Starting from a long-context BERT model, the first unsupervised contrastive stage trains on a dataset generated from weakly related text pairs, such as question-answer pairs from forums like StackExchange and Quora, title-body pairs from Amazon reviews, and summarizations from news articles.

In the second finetuning stage, higher quality labeled datasets such as search queries and answers from web searches are leveraged. Data curation and hard-example mining is crucial in this stage.

For more details, see the Nomic Embed Technical Report and corresponding blog post.

Training data to train the models is released in its entirety. For more details, see the contrastors repository

Usage

Note nomic-embed-text requires prefixes! We support the prefixes [search_query, search_document, classification, clustering]. For retrieval applications, you should prepend search_document for all your documents and search_query for your queries.

Sentence Transformers

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("nomic-ai/nomic-embed-text-v1", trust_remote_code=True)
sentences = ['search_query: What is TSNE?', 'search_query: Who is Laurens van der Maaten?']
embeddings = model.encode(sentences)
print(embeddings)

Transformers

import torch
import torch.nn.functional as F
from transformers import AutoTokenizer, AutoModel

def mean_pooling(model_output, attention_mask):
    token_embeddings = model_output[0]
    input_mask_expanded = attention_mask.unsqueeze(-1).expand(token_embeddings.size()).float()
    return torch.sum(token_embeddings * input_mask_expanded, 1) / torch.clamp(input_mask_expanded.sum(1), min=1e-9)

sentences = ['search_query: What is TSNE?', 'search_query: Who is Laurens van der Maaten?']

tokenizer = AutoTokenizer.from_pretrained('bert-base-uncased')
model = AutoModel.from_pretrained('nomic-ai/nomic-embed-text-v1', trust_remote_code=True)
model.eval()

encoded_input = tokenizer(sentences, padding=True, truncation=True, return_tensors='pt')

with torch.no_grad():
    model_output = model(**encoded_input)

embeddings = mean_pooling(model_output, encoded_input['attention_mask'])
embeddings = F.normalize(embeddings, p=2, dim=1)
print(embeddings)

The model natively supports scaling of the sequence length past 2048 tokens. To do so,

- tokenizer = AutoTokenizer.from_pretrained('bert-base-uncased')
+ tokenizer = AutoTokenizer.from_pretrained('bert-base-uncased', model_max_length=8192)


- model = AutoModel.from_pretrained('nomic-ai/nomic-embed-text-v1', trust_remote_code=True)
+ model = AutoModel.from_pretrained('nomic-ai/nomic-embed-text-v1', trust_remote_code=True, rotary_scaling_factor=2)

Transformers.js

import { pipeline } from '@xenova/transformers';

// Create a feature extraction pipeline
const extractor = await pipeline('feature-extraction', 'nomic-ai/nomic-embed-text-v1', {
    quantized: false, // Comment out this line to use the quantized version
});

// Compute sentence embeddings
const texts = ['What is TSNE?', 'Who is Laurens van der Maaten?'];
const embeddings = await extractor(texts, { pooling: 'mean', normalize: true });
console.log(embeddings);

Join the Nomic Community

Citation

If you find the model, dataset, or training code useful, please cite our work

@misc{nussbaum2024nomic,
      title={Nomic Embed: Training a Reproducible Long Context Text Embedder}, 
      author={Zach Nussbaum and John X. Morris and Brandon Duderstadt and Andriy Mulyar},
      year={2024},
      eprint={2402.01613},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}