Language – Page 2 – Hi, I'm Rauf Aliev.

Exploring English: Verbs, Misunderstandings, and Learning Through Contrast | March 06 2026, 23:57

About the English language. When Yuki sees another dog, he adorably places his chin on the ground and presses his paws to his face, but I have to tell him every time not to approach because once he lets them get closer, he suddenly starts growling and instigating a fight. And what verb would you choose for that?

Well, from school I knew that roar meant growl. And I even told everyone “roar” for the first week until I googled it and realized that in roar, it’s tigers, lions, and motorcycles, but for dogs, it’s growl or even snarl (with teeth showing).

Or take the phrase “cook food.” To cook comes to mind, but actually, to cook implies thermal processing (fire, stove). If you’re “cooking” a salad, tea, or a sandwich, a native speaker would say make. Saying “I’m cooking salad” is like you decided to boil it.

Or suppose you decided to watch a movie. In English, the choice of verb depends on where you are and how large the screen is. When you go to the cinema, you use the verb see. “Let’s go see the new Dune movie at the cinema.” If you say “I watched a movie at the cinema,” they’ll understand, but it sounds a bit technical, as if you were sitting there closely studying the screen like a security guard monitoring it.

But. When you turn on your television, laptop, or projector in your living room, watch comes into play. The verb watch implies extended attention to something on a smaller (relative to theater) screen. By the way, if the screen is off, you look at it (as an item). Once you turn it on and a picture appears, you start to watch it.

Generally, for an advanced level, it makes sense to attach each concept to a scale, to remember the words in shades of intensity. For example,

Cry -> Weep -> Sob.

Annoyed -> Irritated -> Angry -> Furious -> Livid.

Smile -> Chuckle -> Laugh -> Giggle -> Guffaw

Spitting -> Drizzling -> Raining -> Pouring

and so on.

And then further distinguish them by paired opposites, like the smile-cry from the example above.

It’s very easy to remember when put together.

But it’s necessary to try to apply them, otherwise it’s no good. Some words may be bookish, and here it’s important in what context it is said. If you told a friend in a pub: “I cannot comprehend this beer” – it would sound as if you’re writing a dissertation on that beer

Exploring LLMs and AI: Connecting Neural Processors to Natural Language Learning | February 15 2026, 15:41

Some thoughts on LLMs and artificial intelligence in general. And in the end about neuromorphic processors and Intel Loihi.

As you all know, fundamentally LLMs operate on the principle of “propose the likely next word using the context from the previous N words,” and then the word enters the context, and the process repeats all over again for the next word. Well, and the context is also processed considering the importance of words.

Now let’s think about how children were taught languages in primitive societies. There were no alphabets, nor grammar. But the grammar itself, according to estimates, was quite complex—based on observations of the small languages of small peoples. Simple grammar is modern when the language has spread to millions and billions.

That is, a child’s brain had to reconstruct grammar in its neurons simply from the flow of speech from those around and through testing the understanding of what was said. It’s likely that the child was corrected if they spoke incorrectly, but somehow this grammar and sound extraction had to settle in the brain—and here the same mechanism as in LLMs is used: which words/sounds go next in what context is determined by latent and uninterpretable rules, which each person in childhood creates in their brain in their own way. That is, roughly speaking, it trains the ML model every time from scratch on the flow of speech from those around. A child does not know what a “case” is, but feels what ending is statistically more likely in a given context.

Actually, modern cognitive science (Karl Friston’s theory) asserts that the brain is literally a “prediction machine.” We constantly generate hypotheses about the next sound or word and correct them when they don’t match (prediction error).

The peculiarity of LLMs is that for them, teachers are texts and images, but for a child’s brain, it’s the living world around, and if all the texts they hear were digitized, their volume wouldn’t even be enough to train a very weak model. LLM sees the word “apple” next to the word “red.” A child sees an apple, feels its smell, taste, weight, and simultaneously hears the sound. This “stitching” of different sensory channels allows building neural connections thousands of times faster than on plain text. That is, modern LLMs take a brute force approach—simply observing the speech of billions, not just their immediate environment. A good question is how the human brain manages to learn from a relatively small dataset. However, it’s a big question whether this dataset is small—for example, lip movements, facial expressions, context provide a lot for building this neural network in the biological brain.

About the context: unlike LLMs, a child understands the speaker’s intention. If mom looks at a cup and says “hot,” the child’s brain limits the search space of meanings to one cup. And if he didn’t understand, he’ll get burned and remember.

One might assume, of course, that the brain already has a ready network at birth. It’s true, but science can’t yet explain it properly. Our entire genetic program has about 20,000 genes encoding proteins, and these 20,000 are responsible for everything—where and how the lungs, heart, bones, blood should be built, and they themselves are of mind-boggling complexity, and somewhere among 3 billion nucleotides and 20,000 genes this information must be recorded.

Apparently, genes encode not a map but an algorithm of self-assembly. Essentially, the architecture of the neural network is built dynamically, and this process begins long before birth. Then it is calibrated by all the signals received by the unborn child, and by the time of birth, there is already a somewhat tuned network in the brain.

It’s likely that the child’s brain is millions of neural networks of different “architectures” that evolve and merge in the learning process. Unlike LLMs, here learning and usage are strictly separated in time. But most importantly—the brain, although the most energy-consuming in the body, consumes very little energy in absolute terms, especially compared to the current “candidates for replacements in hardware.”

In the last few years, there has been active development in the field of neuromorphic systems (for example, the old IBM TrueNorth processor and the actively developing Intel Loihi). In conventional AI, neurons transmit numbers (0.15, 0.88…). In neuromorphic systems, they transmit “spikes” (impulses)—as in the living brain (and the architecture is called Spiking Neural Network – SNN). A few years ago, Intel released Loihi 2. Fully programmable. Neurons on Loihi can change their connections (synapses) right during operation. Supports plasticity—the very biological mechanism when the connection between neurons is strengthened if they often “fire” together. But the main thing—it consumes very little.

In this architecture, the model can continue learning “on the fly” right during operation, without forgetting old data (Continual Learning). Besides that—extreme energy efficiency.

Loihi 2 cannot multiply matrices as modern GPUs do, so completely new software has to be written for them (and this is moving very slowly). No PyTorch or TensorFlow—for Loihi there is only the Lava framework available today. And 1 million neurons from Loihi 2 is very little for LLMs. Therefore, Intel creates systems like Hala Point—it’s an array of 1152 Loihi 2 processors. It contains up to 1.15 billion neurons. Theoretically, in terms of performance per watt, such a system can surpass traditional GPUs by 10–50 times when working with AI models.

Experimental LLMs are already being launched on Loihi 2 (for example, models with 370 million parameters). They are not yet going to replace ChatGPT in the cloud, but theoretically, they are the future for “smart” robots and gadgets that need to understand human speech while running off a small battery.

We’ll observe. It might turn out to be a dud, or it could be another major revolution.

From Camels to Bishops: The Fascinating Evolution of Chess Pieces | February 14 2026, 16:24

It all started with a question – why does the elephant ♗ have this notch? And in general, where is the elephant, and where is the bishop, and is this notch about the elephant or the bishop? Anyway, listen to what I dug up, there’s a lot of interesting stuff here.

Chess originates from India. There, this figure was initially called a camel. And their elephant was what we call a rook – which if you think about it, a rook is basically a boat – or in English, rook, which if you think about it in Persian, it means chariot.

The name “Tura”, which we often hear in colloquial speech, is a pure import from Europe. In French – tour. In Italian – torre. In Latin – turris. All of these mean the same thing: tower. When chess arrived in Europe, knights and monks didn’t really understand what a “battle chariot” was (they were out of fashion by then), but they knew very well what a siege tower was.

So, returning to the elephant and the notch.

The short answer – to distinguish it from a pawn. But there’s a long answer.

When chess came to Europe, the Indian camel was switched to the Catholic bishop, and thus the piece was named bishop. The notch supposedly symbolizes a miter – the high headgear of clergymen. That’s precisely why in English the piece is called bishop. Though to me, it’s just a mouth from the Muppet show.

Interestingly, in French, it’s le fou – the jester. In German, it’s Läufer – runner. In Greek – officer (Αξιωματικός). Why officer? I don’t know, but I dug up that in Chinese chess, xiangqi (象棋), the “elephant” piece is indicated and pronounced as xiàng (象). This character indeed means “elephant.” However, in Chinese history, there was a high state office called xiàng (相), usually translated as “chancellor,” “prime minister,” or “chief minister.” This is a different character, although the pronunciation coincides. Probably, the officer comes from here too.

The chess knight is almost a horse in all languages, only in English and a few others, it’s a knight (although, in German, for example, it’s Springer – jumper, and in Sicily – donkey).

So, in German, there is a jumper and a runner. And a little horse in German is actually a king.

I also learned that there are ready-made solutions for ANY chess endgame in which there are seven or fewer pieces on the board, regardless of the position, the composition of the remaining pieces, or possible moves. This information, known as endgame tables, currently occupies 18.4 terabytes.

from the comments: “The most interesting thing is that this week a multi-year work was completed, and there is now a ready solution for any position with 8 pieces or fewer (7 pieces was already about 12 years ago, but there’s a very big difference)”

Interactive Text Enhancer: A Tool for Embedding Clarifications | February 12 2026, 16:11

I whipped up this thing in just an hour. Do you think anyone besides me needs it?

Here’s the idea. Take any text – a Wikipedia article, for example. Highlight any segment, say something unclear. The LLM gives us an explanation, and instantly inserts a box right in the text which you can click to open the explanation. In this explanation, there might be something unclear too. We highlight it with the mouse from this explanation, and a box appears there too. This continues until everything is clear. All the boxes remain in the text, so you can always return to them. So, if the idea was unclear to me, maybe it will be to others, and then a ready link with explanations will come in very handy. The result can be shared with colleagues.

For explanations, not just the fragment is used, but also the context. For example, otherwise, the highlighted word Terrier would yield text about a dog breed, not about the search system.

Navigating the Confusion of Ergative Verbs in English | January 27 2026, 00:52

In English, ergative verbs cause me significant cognitive confusion. These are verbs that can be used in both directions: written, people change can be translated as “people change” and “people change themselves”.

For example, on the screenshot right now “illustrator will install next”. Somehow not will be installed next.

Or the sentence “she photographs well” is understood as “she is photogenic”.

“The book sold 1mln copies”, obviously not about the book’s ability to sell.

Unveiling Scientific Misnomers: A Cross-Cultural Exploration | January 14 2026, 04:46

Today I was surprised to learn that the Coriolis force is pronounced as CoriolIs force, not coriOlis force as we were taught in school. I started to investigate what else was wrong, and discovered something amazing.

It turns out what we called Gay-Lussac’s law is known as Charles’s Law in the rest of the world, and what we called Charles’s Law is known throughout the world as Gay-Lussac’s Law.

The Cartesian coordinate system here is Carthesian. Cartesius is just the Latinized name of René Descartes.

In our textbooks, the law of conservation of mass is called the Lomonosov-Lavoisier Law (what enters the chemical reaction = mass of the substances formed). In the rest of the world, it is exclusively the Law of Lavoisier (Lavoisier’s Law). Lomonosov got included here only because “whatever is taken from one body is added to another”.

Also, it turns out that if you have to explain Pythagoras’ theorem to someone in English, without a hint, it’s absolutely impossible to guess that it’s Pythagoras. Greek names are generally a mess. Thales here is pronounced as Teelis.

For some reason, in physics Roentgen is called RentgEnom, although it’s Röntgen with the emphasis on ö.

In Russia, a trapezoid is a quadrilateral with two sides parallel and two not. In the USA, our trapezoid is known as Trapezoid, and the word Trapezium here refers to a quadrilateral with no parallel sides at all. In the UK, it’s the opposite. Our trapezoid is Trapezium, and the “skewed” quadrilateral is Trapezoid.

Exploring Identity and Survival in “Avatar 3: A Journey of Relocation” | January 06 2026, 17:34

After watching Avatar 3, we decided to rewatch the first and second movies. Watched it like it was the first time, but here’s what I thought.

For the family, relocation was an urgent rescue from physical annihilation or forced participation in a war. Moving, they encountered the necessity to “learn to swim” in a new legal, linguistic, and social environment, starting from scratch and losing their former social weight. The feeling of “we are strangers here” is the central emotion. Severance of ties with friends and colleagues, only the “nuclear family” remains as the sole island of identity. Essentially, Jake’s decision to flee to save his children is the fundamental dilemma of any parent in a conflict zone: fight to the end on their own land or leave to preserve the life of the next generation.

Upon arrival, they hardly receive a visa, and permanent residency isn’t promised. But eventually, it becomes clear that it’s impossible to hide from a global conflict geographically. Sooner or later one has to participate in protecting their new “reef.”

Jake’s children and he himself have five fingers, whereas purebred Na’vi have four. Plus, the accent. This is a constant visual reminder of their origin. Even if you are fully integrated, there is always a detail that marks you as an outsider. Your children may become “one of them” faster, but they still carry the mark of “hybridity.”

By the way, in the third part, all the blues already speak English. The Na’vi language was completely displaced by them.

P. S. By the way, it’s interesting that Jake didn’t bring any of humanity’s achievements to the new culture of Pandora at all. I don’t know, the wheel, fire, medicine, some mechanical stuff. Nothing.

Exploring the Magic of Neural Networks in Letter Prediction and Visualization | December 14 2025, 23:35

I am currently experimenting with training simple neural networks – primarily to automate the existing toolkit, and some things just seem like magic.

There is a database of 32,000 names. There is a neural network filled with random numbers. I start training, with only this list of names as input. The first layer of the neural network is embeddings, and I set the number of dimensions to 2 for easy visualization. And after 200,000 iterations of training, the system clearly separates vowels from consonants, and for some reason, places the letter “q” slightly apart from other consonants. It seems that this is because the letter ‘q’ almost exclusively predicts the letter ‘u’ (Queen, Quincy, Quentin).

It also very reliably separates vowels and consonants in Russian names. In Russian names, the letters b and l are somewhat away from the other consonants, as are the soft and hard signs (well, that’s understandable).

I wonder how it works. If trained on a normal corpus of texts, the difference would be very clear. Why are vowels separated from consonants? Apparently, from the network’s mathematical perspective, ‘a’ and ‘o’ serve the same function: they “trigger” the prediction of the consonant following them, so the alternation of vowels and consonants is to blame. But damn, it’s interesting 🙂

And since the model can predict the next letters, you might try running it on Russian. On a model with 30-dimensional embeddings, it invents names like: Byaketta, Afsena, Erakey, Zasbat, Daraya, Gaiomahad, Rain, Razhul, Gzhatsiy, Reben, Vureb, Durodira, Turuzhul, Regravgava, Razsan, Gabila, Avganzh, Raksi, Khalebkokhorta, Rather. The model – for those who understand – is this: input of 6×33 characters (because we take up to 6 characters of context), encoded into embeddings of 60, goes to a layer of 100 neurons, and from there back to 33 characters. Some nonsense, but at least it’s clear how it all works at all levels.

In-Flight French: Building a Language App on the Fly | December 01 2025, 15:45

By the way, yesterday morning, while waiting at the gate for my flight to Miami, I quickly wrote a French language learning app using Gemini based on an idea I sketched out to a friend while driving to the airport, and then used this app during the flight.

The idea is that in an unfamiliar foreign language text, the user first marks unknown words and then sees their translations — but without the original text, and then returns to the text itself — but no longer seeing the translations. It’s as if the “dictionary was in the next room.” The hypothesis is that this method helps better memorize than when the translation is shown immediately upon clicking on a word, and when no effort is needed.

I am pleased that creating the app from scratch to the finished version took only about 35-40 minutes, and then I used it for some time during the flight, without the internet. Since all translations of all words/phrases were already made in advance.

I just deployed it on Render. It’s also nice that demonstrating the code in action was free and took another 10 minutes.

https://readandlearn.onrender.com/

Lost in Translation: Modernizing Opera Subtitles | November 19 2025, 02:31

This must be about my tenth staging of Le Nozze di Figaro. And I still can’t understand why no one ever bothers to make modern, well-constructed subtitles instead of something that resembles a product of prehistoric “Google translate”. Every single line is translated from Italian in such a barbaric way that it’s about to make one’s eyes bleed. And for what reason? The Italian is certainly not modern either, but if you undertake to translate, then do so in a way that the meaning can be grasped within those tens of milliseconds when your eyes dart to the screen. Now, you land on a text that takes a minute to chew over… Every woman makes me change colour… Instead of make me blush or make me pale… If you are fain to dance… And all that stuff.