four things tagged “lists”

Naughty Letter Frequencies in English

Here’s a community-maintained "List of Dirty, Naughty, Obscene, and Otherwise Bad Words" across various languages on Github. I was curious about a naïve frequency distribution of consonants across the English-language corpus (NSFW, obviously) and wrote a small script. Here are the results:

Letter Count
t 211
s 208
n 193
r 186
l 167
g 147
c 124
b 121
p 116
h 97
d 91
m 91
k 72
y 70
f 48
w 41
v 29
j 21
x 19
z 7
q 5

Not sure what I’m going to do with this information but here it is. 🤬

Netflix’s “Secret” Genre List

A list of sub-genres you cannot view easily on Netflix. From a Reddit thread on the subject:

This is ridiculous. What kind of hubris does Netflix have to think that their recommendation engine is better than browsing by category? Browsing by category has been the standard for browsing things since categories of things has existed. Some VP of product made his bonus by convincing someone that his ML team could do better. “Yeah, just remove it and let us populate 15 movies randomly in a whimsical fictitious category like ‘movies with dogs and music’. People will love it.”

and

Because the studios pay Netflix (via discounted licensing) for favorable placement on those “recommended viewing” lists.

Always follow the money.