Discussion about this post

User's avatar
The Strategic Linguist's avatar

My hero - thank you for this. I've spent the best parts of a decade trying to get people to understand the importance of human linguistics in tech and digital products. LLMs are large LANGUAGE models and yet we pour a lot of time and resources to understand computational lingusitics and conversational design (which are needed, don't get me wrong!) but there's more to language, we need to better understand the user language - human language, pragmatics, sociolinguistics, power, hierarchy, identity, authority.

Nazar Bartosik's avatar

I like the criticism of the blind scoring, but I don't quite understand the point of "smelling". If you have a tool advanced enough to catch those biases and formulate them so well, wouldn't it then do the translation itself applying those corrections already? I feel like this is the kind of test that works on texts made by less advanced tools, while translations made with a model that is aware of all these nuances wouldn't detect its own "smells". Or am I missing something?

3 more comments...

No posts

Ready for more?