Ryan Wild @wild1145

0 posts0 participants0 posts today

**Piotr Migdał** @pmigdal@mathstodon.xyz · Jan 14

Piotr Migdał @pmigdal@mathstodon.xyz

I’m excited to share my newest blog post, "Don't sure cosine similarity carelessly"

https://p.migdal.pl/blog/2025/01/dont-use-cosine-similarity

We often rely on cosine similarity to compare embeddings—it's like “duct tape” for vector comparisons. But just like duct tape, it can quietly mask deeper problems. Sometimes, embeddings pick up a “wrong kind” of similarity, matching questions to questions instead of questions to answers or getting thrown off by formatting quirks and typos rather than the text's real meaning.

In my post, I discuss what can go wrong with off-the-shelf cosine similarity and share practical alternatives. If you’ve ever wondered why your retrieval system returns oddly matched items or how to refine your embeddings for more meaningful results, this is for you!
`
I want to thank Max Salamonowicz and Grzegorz Kossakowski for their feedback after my flash talk at the Warsaw AI Breakfast, Rafał Małanij for inviting me to give a talk at the Python Summit, and for all the curious questions at the conference, and LinkedIn.

p.migdal.plDon't use cosine similarity carelesslyCosine similarity - the duct tape of AI. Convenient but often misused. Let's find out how to use it better.

#cosineSimilarity #embedding #llm

**junwin** @junwin@photog.social · Dec 18, 2024

Dec 18, 2024

junwin @junwin@photog.social

wheres the damn key

A monochrome image of locked mailboxes in a post office. The composition highlights themes of security and hidden thoughts, with an array of lockers in focus, evoking a sense of mystery and introspection

The image shows a room filled with mailbox lockers at a post office. In the foreground, there is a large set of square lockers with metal locks. In the background, rows of smaller mailboxes line the walls. The space is brightly lit from overhead lights, and the floor is dark and shiny

ALT

#pattern #bnw #monochrome

**junwin** @junwin@photog.social · Dec 12, 2024

Dec 12, 2024

junwin @junwin@photog.social

evanston post office

the image shows a room with rows of uniform mailboxes lining the walls. In the center, there is a large, rectangular structure also made of metal boxes with locks. The room is well-lit with fluorescent lights, and the floor is shiny, reflecting the light above.

ALT

#postoffice #mailbox #pattern

**Patryk Krawaczyński** @agresor@infosec.exchange · Dec 3, 2024

Dec 3, 2024

Patryk Krawaczyński @agresor@infosec.exchange

TLSH – Trend Micro Locality Sensitive Hash dla malware i phishingu ( https://nfsec.pl/security/6469 ) #malware #phishing #similarity #detection #algorithms #twittermigration

https://www.youtube.com/watch?v=n3cu2kgkNTU

nfsec.plNF.sec – Bezpieczeństwo systemu Linux - TLSH – Trend Micro Locality Sensitive Hash dla malware i phishinguS króty kryptograficzne, takie jak MD5 i SHA[1/2] są wykorzystywane w wielu zastosowaniach związanych z eksploracją danych i bezpieczeństwem – służąc jako identyfikatory plików wykonywalnych i dokumentów tekstowych. Jednak ich rolą jest oznaczanie unikalności, nie podobieństw – jeśli zostanie zmieniony pojedynczy bajt pliku, wówczas skróty kryptograficzne dają zupełnie inne wartości. Ich działanie jest bardzo przydatne […]

**Stephanie** @HarmonthSeeker@caneandable.social · Nov 21, 2024

Nov 21, 2024

Stephanie @HarmonthSeeker@caneandable.social

In tonight’s tutorial, we talked about how psychology makes sense of emotions like love. Is it all subjective? Maybe not as much as you think. Love can be studied scientifically—not just through brain scans showing neurotransmitters like dopamine and oxytocin but also through observable behaviours.

Take reciprocity: when someone likes us, we’re more likely to like them back. Or similarity: shared values, interests, and beliefs often strengthen attraction. Add proximity and familiarity—the magic of repeated exposure—and you start seeing how relationships form. Then there’s complementarity: where differences between people don’t divide but balance and enhance connection.

What fascinates me is how psychology bridges the personal and the universal. Yes, love feels deeply subjective, but studies show patterns we can test and measure—observing behaviours, tracking emotional responses, and using physiological data to explore what we feel.

Have you noticed these dynamics in your relationships? #Psychology #Love #Relationships #Emotions #Reciprocity #Similarity #Proximity #Familiarity #Complementarity

**PsyPost** @PsyPost@mstdn.social · Nov 11, 2024

Nov 11, 2024

PsyPost @PsyPost@mstdn.social

Married couples’ vocabulary sizes align, hinting at selection based on intelligence cues https://www.psypost.org/married-couples-vocabulary-sizes-align-hinting-at-selection-based-on-intelligence-cues/?utm_source=dlvr.it&utm_medium=mastodon #MarriedCouples #VocabularySizes #IntelligenceCues #MaritalPartners #Similarity

M @m_enby@mastodon.social · Sep 13, 2024

Sep 13, 2024

M @m_enby@mastodon.social

Similarity and continuity are manifestations of difference. Just specific types of differences repeatedly constructed over a certain period of time-space.

#philosophy #difference #similarity

**Nicola Asuni** @nicolaasuni@mastodon.social · Jul 20, 2024

Jul 20, 2024

Nicola Asuni @nicolaasuni@mastodon.social

To manage passwords in #golang, check out the following #gosrvlib packages:

* Generate new #passwords: https://pkg.go.dev/github.com/Vonage/gosrvlib/pkg/random.
* Check for #similarity with existing passwords: https://pkg.go.dev/github.com/Vonage/gosrvlib/pkg/stringmetric.
* Compare passwords against #compromised ones (#pwned): https://pkg.go.dev/github.com/Vonage/gosrvlib/pkg/passwordpwned.
* Secure password #hashing, #storage, and #verification: https://pkg.go.dev/github.com/Vonage/gosrvlib/pkg/passwordhash (based on #OWASP Password Storage Cheat advice).

See also: #awssecretcache, #encrypt, #jwt, #redact, #randkey.

pkg.go.devrandom package - github.com/Vonage/gosrvlib/pkg/random - Go Packages

**RDN** @rdnielsen@floss.social · May 28, 2024

May 28, 2024

RDN @rdnielsen@floss.social

Just had my first opportunity to apply Mainali et al.'s new (2022) alpha-hat co-occurrence similarity index (https://www.science.org/doi/10.1126/sciadv.abj9204). It's pretty impressive, and somewhat chastening, that this new metric invalidates and replaces other similarity measures that have been used in ecology for decades (even more than a century).

#Science #Ecology #Similarity

**Gregory B Sadler** @GregSadler@metalhead.club · Mar 23, 2024

Mar 23, 2024

Gregory B Sadler @GregSadler@metalhead.club

Plutarch notes that friendship arises out of and is sustained by similarities of habits, character, outlooks, feelings, & other matters. Flatterers know this, and for that reason imitate similarity with their targets

https://youtu.be/uTw_6PBnOW4
#Video #Plutarch #Friendship #Flattery #Imitation #Similarity

YouTubePlutarch, How To Tell A Flatterer From A Friend | Friendship, Similarity, & Flattery | Core ConceptsBy Gregory B. Sadler

**रञ्जित (Ranjit Mathew)** @rmathew@mastodon.social · Mar 11, 2024

Mar 11, 2024

रञ्जित (Ranjit Mathew) @rmathew@mastodon.social

Interesting:

"The Tyranny Of The Algorithm: Why Every Coffee Shop Looks The Same", The Guardian (https://www.theguardian.com/news/2024/jan/16/the-tyranny-of-the-algorithm-why-every-coffee-shop-looks-the-same).

The Guardian · Jan 16, 2024The tyranny of the algorithm: why every coffee shop looks the sameBy Kyle Chayka

#Design #Interiors #InteriorDesign

**Hacker News** @ycombinator@rss-mstdn.studiofreesia.com · Feb 24, 2024

Replied to grimmiges

**grimmiges** @grimmiges@ecoevo.social · Feb 7, 2024

Feb 7, 2024

grimmiges @grimmiges@ecoevo.social

PS If they only had a slightly invested #phylogeneticist at hand; they easily could have learned a lot about the strengths and weaknesses of their data and preferred tree (a Bayesian MRC, by the way, is a summary tree of various competing topologies sampled in the MCMC chain, not a phylogenetic tree)

Here's a quick #NeighbourNet based on their "toutes" matrix (inferred in less than a minute), annotated.
Overall #similarity makes #clades, surprise, surprise.

#PhyloNetworks #linguistics

**Jacob Something** @jkanev@fediscience.org · Dec 22, 2023

Dec 22, 2023

Jacob Something @jkanev@fediscience.org

#Music #Hatikva #Similarity

Ist euch eigentlich mal aufgefallen, dass "Alle meine Entchen" in moll gesungen
die israelische Nationalhymne ergibt?

**Jef Allbright** @jef@mathstodon.xyz · Nov 22, 2023 *

Nov 22, 2023 *

Jef Allbright @jef@mathstodon.xyz

@johncarlosbaez @gregeganSF

Curse of Dimensionality
https://en.wikipedia.org/wiki/Curse_of_dimensionality

"Dimensionally cursed phenomena occur in domains such as numerical analysis, sampling, combinatorics, machine learning, data mining and databases. The common theme of these problems is that when the dimensionality increases, the volume of the space increases so fast that the available data become sparse. In order to obtain a reliable result, the amount of data needed often grows exponentially with the dimensionality. Also, organizing and searching data often relies on detecting areas where objects form groups with similar properties; in high dimensional data, however, all objects appear to be sparse and dissimilar in many ways, which prevents common data organization strategies from being efficient. "

en.wikipedia.orgCurse of dimensionality - Wikipedia

#analytics #similarity #optimization

**Lukas Oppermann** @lukasoppermann@mastodon.social · Nov 10, 2023

Nov 10, 2023

Lukas Oppermann @lukasoppermann@mastodon.social

#Similarity, a #gesaltPrinciple to make or break your layout: https://www.chrbutler.com/gestalt-principles-of-design-similarity

Similarity:
- creates text hierarchy e.g. headlines that are similar but body text that differs from headlines
- suggests equal importance, e.g. items that are the same size
- improves #scanability e.g. article previews that are similar are easy to scan

My take: Break similarity purposely to highlight e.g. your latest project

Example showing how similar items make websites easier to scan

ALT

**Karsten Schmidt** @toxi@mastodon.thi.ng · Oct 18, 2023

Oct 18, 2023

Karsten Schmidt @toxi@mastodon.thi.ng

#HowToThing #023 — Responsive & reactive image gallery with tag-based Jaccard similarity ranking/filtering using https://thi.ng/bitfield, https://thi.ng/rstream & https://thi.ng/rdom

A quite common comment about #ThingUmbrella is that people often have little idea what some of the ~185 packages are even good/intended for and/or how to synthesize solutions from these small, individual building blocks. IMHO this is less about these packages themselves and more down to existing blank spots about the underlying concepts, algorithms and their potential role/utility in a larger problem domain... So I very much hope this new example is also useful in this respect!

Alas, the full code for this got pretty long and contains a lot more UI stuff. I'm intending to develop this further for the new homepage to browse all ~135 #ThingUmbrella examples (and maybe even for parts of the https://thi.ng website itself)... For those of you interested in more "advanced" https://thi.ng/rdom examples, do check it out!

Background info:
https://en.wikipedia.org/wiki/Jaccard_index

Demo:
https://demo.thi.ng/umbrella/related-images/

Full source code:
https://github.com/thi-ng/umbrella/tree/develop/examples/related-images/src/

The important parts re: using compact binary encoding, bitfields & Jaccard similarity to find related items are here:

https://github.com/thi-ng/umbrella/blob/fc5db1c7a2b9083b40e9be5d6002db937b5a8267/examples/related-images/src/data.ts#L191-L225