Lobste.rs

Abstract: Text embeddings are commonly evaluated on a small set of datasets from a single task not covering their possible applications to other tasks. It is unclear whether state-of-the-art embeddings on semantic textual similarity (STS) can be equally well applied to other tasks like clustering or reranking. This makes progress in the field difficult to track, as various models are constantly being proposed without proper evaluation. To solve this problem, we introduce the Massive Text Embedding Benchmark (MTEB). MTEB spans 8 embedding tasks covering a total of 58 datasets and 112 languages. Through the benchmarking of 33 models on MTEB, we establish the most comprehensive benchmark of text embeddings to date. We find that no particular text embedding method dominates across all tasks. This suggests that the field has yet to converge on a universal text embedding method and scale it up sufficiently to provide state-of-the-art results on all embedding tasks. MTEB comes with open-source code and a public leaderboard at this https URL. Comments

36

4

AI search should sustain open source, not just index it (www.ericholscher.com)

submitted 2 days ago by [email protected] to c/[email protected]

1 comments fedilink

Comments

37

2

Highlighting Text in Links with Text Fragments (calebhearth.com)

submitted 1 day ago by [email protected] to c/[email protected]

0 comments fedilink

Comments

38

1

Principles of Dependent Type Theory (www.danielgratzer.com)

submitted 1 day ago by [email protected] to c/[email protected]

0 comments fedilink

Comments

39

2

Magical Fibonacci Formulae (orlp.net)

submitted 1 day ago by [email protected] to c/[email protected]

0 comments fedilink

Comments

40

1

A conceptual model of ATProto and ActivityPub (fediversereport.com)

submitted 1 day ago by [email protected] to c/[email protected]

0 comments fedilink

Comments

41

1

Atomic Attributes in Local-First Sync – Adam Wulf (adamwulf.me)

submitted 1 day ago by [email protected] to c/[email protected]

0 comments fedilink

Comments

42

1

Object Pools (famicom.party)

submitted 1 day ago by [email protected] to c/[email protected]

0 comments fedilink

Comments

43

1

In Search of Types (www.humprog.org)

submitted 1 day ago by [email protected] to c/[email protected]

0 comments fedilink

Abstract: The concept of “type” has been used without a consistent, precise definition in discussions about programming languages for 60 years.1 In this essay I explore various concepts lurking behind distinct uses of this word, highlighting two traditions in which the word came into use largely independently: engineering traditions on the one hand, and those of symbolic logic on the other. These traditions are founded on differing attitudes to the nature and purpose of abstraction, but their distinct uses of “type” have never been explicitly unified. One result is that discourse across these traditions often finds itself at cross purposes, such as overapplying one sense of “type” where another is appropriate, and occasionally proceeding to draw wrong conclusions. I illustrate this with examples from well-known and justly well-regarded literature, and argue that ongoing developments in both the theory and practice of programming make now a good time to resolve these problems. Comments

44

1

jspin: GUI for running the SPIN model checker (github.com)

submitted 1 day ago by [email protected] to c/[email protected]

0 comments fedilink

Comments

45

2

A Language A Day - A Collection Of Brief Overviews To 21 Programming Languages (andrewshitov.com)

submitted 2 days ago by [email protected] to c/[email protected]

0 comments fedilink

Comments