Navigating Nabokov: A Companion Glossary for “Lolita” | April 08 2026, 11:24

I have finally finished the book The Reader’s Glossary – essentially a 5200-word dictionary for “Lolita” by Nabokov, but organized not alphabetically, like regular dictionaries, but in order of the occurrence of complex words, divided by chapters and indicating the context of the word or phrase. The website – readersglossary dot com (see the first comment). It is expected to be used, among other things, as a companion book while reading the original. Yes, it’s twice as thick 🙂

The dictionary turned out quite thick – 600-700 pages. It is available in four languages – Russian, English, French, and German. Moreover, the translations (RU, FR, DE) or clarifications (in ENG) are not abstract but contextual, taking into account how Nabokov himself translated the fragment from English (“Lolita” was first written in English, then translated into Russian).

On my website, there are huge fragments of these dictionaries RU, FR, DE, EN available for review (each about 1/3 of the total volume).

There is also a full-fledged interactive dictionary on the site, where you can enter a word and see its translation or explanation. The dictionary mainly contains complex words, but we know that complexity has its own definition for everyone, so all words are divided into three categories and highlighted with different frames. Probably for a well-read Anglophone, the first category (dotted) is completely useless (about 50% of the dictionary), for the less-read, maybe 20% are useless. But I decided not to cut it further, because the book is not only for Anglophones but also for those for whom English is a second language, and there those dotted frames are very handy.

Overall, I did this “for myself and friends,” just for fun, not as a commercial project. Therefore, I am quite sober in understanding that it has a super niche audience, and if even once a week someone finds it useful, it’s already nice.

Although it was something like a hobby, the book took a lot of time. To achieve what I did, I developed a dozen applications/scripts, a couple of which have their own interactive UI, in which I spent many hours over two months of work. And of course, I learned a lot in the process, which is actually the main fun of it.

So, come to the website – readersglossary dot com. Link in the comments

P.S. In Russian – only as a PDF for now. Amazon doesn’t allow selling books in Russian, only in a small number of European languages in addition to English. The French and German versions of the dictionary will be released on Amazon about a week from now.

Navigating the Lexical Complexity of Nabokov’s “Lolita” | April 02 2026, 15:56

I’ve finished the first version of a dictionary-style book on Nabokov’s “Lolita”. The chart shows how the complexity of vocabulary is distributed across the pages of the book. The lower chart averages 25 sentences, displaying the number of complex words on the vertical axis, with colors indicating their complexity/rarity (purple – the most complex, red – less complex, yellow – even less so). But I have already removed two levels, and overall, for a foreigner, all five levels are challenging. In the book, level 3 is marked with a dashed line, level 4 with a simple frame, and level 5 with a double frame. Currently, there are 5794 words, of which 541 are fifth level, 1070 are fourth, 1883 are third, 1393 are second, and 54 are first (the simplest ones). Considering that the first version ended up being 1148 pages, the dictionary will need to be significantly streamlined by removing what can be dispensed with. This mainly pertains to the first and second levels, and some from the third and fourth. The rarity of words is calculated in three ways: through LLM, and through two lists of word frequencies in the English language corpus (300K words).

Not all words are complex. For instance, in the sentence “With the ebb of lust, an ashen sense of awfulness, abetted by the realistic drabness of a gray neuralgic day, crept over me and hummed within my temples.” someone well-acquainted with English might not know the words ebb, abet, drabness, while everything else is familiar, but lower the requirements for the reader, and the dictionary might not be very useful for such cases.

Or consider the sentence:

Homo pollex of science, with all its many sub-species and forms; the modest soldier, spic and span, quietly waiting, quietly conscious of khaki’s viatric appeal; the schoolboy wishing to go two blocks; the killer wishing to go two thousand miles; the mysterious, nervous, elderly gent, with brand-new suitcase and clipped mustache; a trio of optimistic Mexicans; the college student displaying the grime of vacational outdoor work as proudly as the name of the famous college arching across the front of his sweatshirt; the desperate lady whose battery has just died on her; the clean-cut, glossy-haired, shifty-eyed, white-faced young beasts in loud shirts and coats, vigorously, almost priapically thrusting out tense thumbs to tempt lone women or sadsack salesmen with fancy cravings.

My browser even highlights four words here.

I have definitions of words in English, German, French, and Russian. I’ve encountered the issue that different words from the text are considered complex in different languages, yet they are unified for me. So, I’ll have to mark, for example, French words in the English text separately, so they are not included in the French version, since there, the reader knows, for instance, what quel mot means.

Overall, this weekend I’ll be manually removing about half, and then I can make the cover and list it on Amazon.

Exploring the Multifaceted Uses of “Oblong” in English and Russian | March 17 2026, 13:50

Sometimes in English, there are very unusual words that are very difficult to translate into Russian. Here, for example, is the word oblong. As an adjective, it translates as “elongated, oblong,” but in the book, both uses are nouns. Often oblong refers to a face – that is, close to an oval, but oblong is a broader concept that describes any figure having an elongated appearance. My mom bought an oblong tablecloth for her new table.

As a noun, it is also used, and quite frequently (though less so than as an adjective). As a noun, oblong means “a rectangular object or flat figure with unequal adjacent sides.” Rulers are considered elongated items (oblongs). Laptops, tablets, and flat-screen TVs are oblongs of different sizes. A rectangle can be defined as oblong; however, not all elongated figures are rectangles. The same face, for example. Additionally, in mathematics, an oblong number is what in Russian is called a rectangular number (the product of two consecutive numbers. For example, 12). In general, it’s utterly baffling.

The word has been alive since the 15th century, by the way. So, in my book, it appears twice, and both times as nouns. In the first case, Nabokov translated it as “corner,” and in the second – “a small oblong of smooth silver” as “a little piece.”

Exploring Multilingual Vocabulary in Nabokov’s Works with Apple Books | March 15 2026, 23:20

Man, it’s really convenient. Just sitting here reading.

The usage pattern is as follows: I hold the phone in my hands. There, in apple books, this and that book. You see an unfamiliar word – it will likely be in the word list of the chapter. The definition takes into account the translation by Nabokov himself. Then you look a couple words ahead, put the phone down, continue reading. You encounter those words, and they are still in your short-term memory, and hooray, you understand. During a break, you load the next couple of words into your brain. You have to hold the phone and flip through, each page contains 4-5 definitions.

Now, every word has definitions in English (interpretation), French, and German. Consequently, I can publish four books.

Overall, my level of English matches what my app predicts about which words will be challenging. But someday I’ll need the same for French, and it will require an assessment of the difficulty level for each word because even some basic words will be unclear to me. I’m not sure that a book with basic words will be handy. With rare ones – definitely handy.

Crafting Nabokov’s Dictionary: A Multilingual Lexical Journey | March 15 2026, 18:30

I’m reading Nabokov and decided to take a break to create a convenient app “Nabokov’s Dictionary” and am considering selling it on Amazon as a book. Essentially, it looks like this (see screenshot) – definitions of complex words in English, Russian, German, and French, in the same order they appear in the original book.

Would you buy such a book?

To accurately make their definitions, I also wrote an aligner – a program that matches sentences and paragraphs in English with their translations (Nabokovian) into Russian. And when a word’s definition is created, it uses not only the knowledge of LLM but also the Russian translation by the author. It’s worth separately discussing how the algorithm works (I invented it myself because everything I found online did not work as I needed). It first finds long sentences and matches the longest sentences with their pair through cosine similarity of embedding vectors created through the multilingual e5 model. These sentences become anchors. Then, assuming that for long sentences the error is almost excluded, the longest sentence between anchors is found, and everything repeats recursively. There are many situations where a sentence in Russian has no equivalent in English and vice versa, where a sentence is split into two, or conversely two are merged into one. The algorithm handles this as best as it can. The result is quite a good quality of alignment. To such an extent, that errors in alignment can hardly be found (but they are likely still there). Either way, it is only needed for the context for translating words, even if there are rare errors, it’s not a big deal.

Would you buy such a book?

Modern Reading: More Words, Digital Shifts, and Surprising Data Insights from 2008 | December 14 2025, 22:33

An interesting study caught my eye, dating back to 2009. According to it, the modern human indeed reads significantly more than in the past, although the format of this reading has changed. The study suggests that in 2008, an average American consumed about 100,000 words a day (approximately a quarter of “War and Peace”) – this is an approximate number of words that passed through consciousness per day (via ears or eyes), calculated based on activity chronometry. This is 140% more than in 1980.

Therefore, contrary to the myth about the degradation of reading, at least in 2008, we processed 2.4 times more textual information than our parents’ generation. Moreover, the study only considered information consumed outside of work (at home, in transit, during leisure).

The structure of reading – if in 1960, 26% of words came from paper, by 2008 this share had fallen to 9%. However, digital media (internet, email, social networks) not only compensated for this decline but also tripled the total reading time. The reason — the internet, as it is predominantly a textual environment (web surfing, email).

But it’s interesting that although the Internet accounts for 25% of consumed words, it only makes up for 2% of bytes (since video on the internet in 2008 was of low quality). Thus, they estimated the information flow from different channels and converted it into bytes 🙂 Radio accounted for 19% of the time but only generated 0.3% of bytes (as audio requires less data). Voice communication (telephone) — accounted for only 5% of words and a negligible share of bytes, but it was the only fully interactive channel before the internet era. TV remained the main source of information by time in 2008 (41% of all hours) and quantity of words (45%), however, in terms of data volume (bytes), television was only second (35%), behind computer games.

Now about games, quite interesting. The main finding from the report: Games generated (or did in 2008) 55% of all “bytes” consumed by households. Meanwhile, they only accounted for 8% of user time. This is quite a controversial topic in their report.

Those 100,500 words — that’s an assessment of actual words that a person either read or heard. This is not a metaphorical “equivalent,” but an attempt to calculate the verbal information precisely. They took the consumption time of each media and multiplied it by the average word inflow rate for that channel. Reading (books, newspapers, internet texts): 240 words per minute. Email and web surfing – 240 words per minute. Television (dialogues in shows/movies): 153 words per minute. Radio: 80 words per minute (less because of many pauses and music). Music: 41 words per minute (song lyrics).

Link in the comments

Life Imitates Art: The Real Louvre Heist After “Lupin” | October 19 2025, 14:49

Just yesterday we watched the first episode of the series “Lupin”, in which Assane Diop, inspired by the adventures of the fictional thief Arsène Lupin from Maurice Leblanc’s novels, together with two accomplices, carefully plans and steals a necklace that once belonged to Queen Marie-Antoinette from the Louvre. The episode was very cool, I hope the whole series is too.

And today I see in the news that today criminals stole jewels of “immeasurable worth” from the section where the French royal regalia are kept in the Louvre, and then fled the crime scene on scooters. The thieves entered the Apollo Gallery, where the French crown jewels are housed, using a ladder and, presumably, small chainsaws. Nine items were stolen from the collection of Napoleon’s and the Empresses’ (Josephine’s and Eugenie’s) jewels, including a necklace, a brooch, and a tiara. The entire theft took seven minutes. At least one of the stolen items was found near the museum. It appears the robbers dropped it during their escape.

What a coincidence

Haunting Tales of Hotel Room 441 | October 16 2025, 12:27

I am starting to like my hotel. I am in room 446

Items we have read report that if you do stay at this hotel, avoid the fourth floor, or at least Room 441. That is where a lady from the other side lurks at the end of the bed, kicking the feet of guests who attempt to sleep there. And those guests are attempting to sleep there because they want to have the haunted experience. Don’t ask. Ever read Stephen King’s horror short story 1408”? The Congress Plaza is said to be its inspiration and a portion of the source that has brought Mr. King’s net worth to $500 million.”

Items we have read report that if you do stay at this hotel, avoid the fourth floor, or at least Room 441. That is where a lady from the other side lurks at the end of the bed, kicking the feet of guests who attempt to sleep there. And those guests are attempting to sleep there because they want to have the hauted experience. Don’t ask. Ever read Stephen King’s horror short story 1408”? The Congress Plaza is said to be its inspiration and a portion of the source that has brought Mr, King’s net worth to $500 million.”

From Vision to Bookshelf: Launching “Recommender Algorithms” | October 13 2025, 11:54

Finally, I have released a book! It is called Recommender Algorithms — it contains more than 50 recommendation algorithms with mathematical explanations, detailed descriptions, and code examples.

It all started early in the spring in Germany, when I attended the ACM conference and made the first sketches of the book’s structure, analyzing reports on the RecSys stream. And now, six months later, the book has been published.

Why did it appear? Because there is no single, accessible source either online or in print where the recommendation algorithms of various types and purposes are thoroughly examined. There are articles focused on narrow aspects, but to collect and systematize the developments — from fundamental to the most recent — until now, it seems, no one has managed to do it for some reason. Maybe no one needed to. Suddenly, I found I needed to. I don’t know if I succeeded, but I am eager for your feedback.

Available on Amazon and Barnes and Noble. There is a Russian automatic translation (surprisingly, but very decent), but I do not know how to sell it yet.

https://www.testmysearch.com/books/recommender-algorithms.html?FB

(This is not my only book, but today — just about this one.)