CLOSE
Original image
istock

The World’s Top 20 Languages—And The Words English Has Borrowed From Them

Original image
istock

English is known as a magpie language that picks up words from almost every other language and culture it comes in contact with, from Abenaki to Zulu. And although some languages have understandably widened the English vocabulary more than others, modern English dictionaries contain more of a geographical melting pot than ever before. 

Listed here—in order by number of native speakers—are the world’s top 20 languages (according to Ethnologue, a global catalog of the 7000 languages currently in use worldwide). Alongside each entry on the list are just some of the words which English has borrowed from it. 

1. CHINESE: 1197 million native speakers (MANDARIN: 848 million)

Linguistically speaking, Chinese is a “macrolanguage” that encompasses dozens of different forms and dialects that together have just short of 1.2 billion native speakers. By far the most widely spoken variety of Chinese, however, is Mandarin, with 848 million speakers alone—or roughly 70 percent of China’s entire population. According to the Oxford English Dictionary, Chinese words have been recorded in English since the mid-16th century, with the earliest examples including the likes of tai chi (1736), ginseng (1634), yin and yang (1671), kumquat (1699) and feng shui (1797). One of the earliest of all is lychee (1588). 

2. SPANISH: 399 million

One quarter of the world’s 399 million Spanish speakers live in Mexico, although other important Hispanophone countries include Colombia (41 million), Argentina (38.8 million), and Venezuela (26.3 million); there are almost as many native Spanish speakers in the United States (34.2 million) as there are in Spain (38.4 million). In English, Spanish loanwords are characterized by terms from weaponry and the military (guerrilla, flotilla, armada, machete), animal names (chinchilla, alligator, cockroach, iguana), and terms from food and drink (potato, banana, anchovy, vanilla).

3. ENGLISH: 335 million

According to Ethnologue, the English language’s 335 million native speakers include 225 million in the United States, 55 million in the United Kingdom, 19 million in Canada, 15 million in Australia, and just short of 4 million in New Zealand. But English is one of the world’s most widespread languages: mother-tongue speakers are recorded in 101 different countries and territories worldwide, 94 of which class it as an official language. Moreover, if the number of people who use English as a second language or lingua franca were included, the global total of English speakers would easily rise to over one billion. 

4. HINDI: 260 million

The world’s 260 million native Hindi speakers are mainly found in India and Nepal, while an estimated 120 million more people in India use Hindi as a second language. As with all Indian languages, a great many Hindi loanwords found in English were adopted during the British Raj in the 19th and early 20th centuries, but long before then the likes of rupee (1612), guru (1613), pilau (1609), pukka (1619), myna (1620) and juggernaut (1638) had already begun to appear in English texts. 

5. ARABIC: 242 million

Like Chinese, Arabic is technically another macrolanguage whose 242 million native speakers—spread across 60 different countries worldwide—use a range of different forms and varieties. The first Arabic loanwords in English date from the 14th century, although many of the earliest examples are fairly rare and obsolete words like alkanet (a type of dye, 1343) and hardun (an Egyptian agama lizard, 1398). Among the more familiar Arabic contributions to English are hashish (1598), sheikh (1577), and kebab (1698).

6. PORTUGUESE: 203 million

The population of Portugal is just under 11 million, but the global Lusophone population is boosted enormously by Brazil’s 187 million native speakers. Etymologically, Portuguese and Spanish loanwords are often tricky to differentiate because of the similarities between the two languages, but according to the OED, Portuguese is responsible for the likes of marmalade (1480), pagoda (1582), commando (1791), cuspidor (1779), and piranha (1710). 

7. BENGALI: 189 million

After Hindi, Bengali is the second most widely spoken language of India with just over 82 million native speakers. But the largest native Bengali population in the world is found in Bangladesh, where 106 million people use it as their first language. The number of Bengali words adopted into English, however, is relatively small, with only 47 instances—including jute (1746), almirah (a free-standing cupboard, 1788), and jampan (a type of sedan chair, 1828)—recorded in the OED. 

8. RUSSIAN: 166 million

One hundred and thirty-seven million of Russian’s 166 million native speakers live in the Russian Federation, with smaller populations in Ukraine (8.3 million), Belarus (6.6 million), Uzbekistan (4 million) and Kazakhstan (3.8 million). The earliest Russian loanwords began to appear in English in the 16th century, among them czar or tsar (1555), rouble (1557), and beluga (1591).

9. JAPANESE: 128 million

Japan’s 128 million people comprise the language’s entire native speaker population, enough to make it the ninth most widely spoken language in the world. Japanese words have been appearing in English texts since the 16th century, with some of the earliest loanwords including katana and wacadash (both types of samurai sword, 1613), miso (1615), shogun (1615), and sake (1687). 

10. LAHNDA: 88.7 million

Lahnda is the collective name given to a group of related Punjabi languages and dialects spoken predominantly in Pakistan. Punjabi words adopted into English are rare, but nevertheless include bhangra (a local traditional dance form and music style, 1965), and gurdwara (a Sikh temple, 1909). 

11. JAVANESE: 84.3 million

Java is the most populous island on Earth, home to almost two-thirds of the entire population of Indonesia. More than half of its 139 million inhabitants speak the local Javanese language, enough to earn it a spot just outside of the global top 10 here. The words batik (1880), gamelan (1816) and lahar (a volcanic mudflow, 1929) are all of Javanese origin. 

12. GERMAN: 78.1 million

Seventy million of the world’s 78 million native German speakers live in Germany, with the remaining 8 million found in the likes of Austria, Switzerland, Belgium and Luxembourg. As English itself is classed as a Germanic language, historically the two languages share a close relationship and ultimately many of the oldest English words could be argued to have German roots. More recent direct German loanwords, however, include sauerkraut (1633), pumpernickel (1738), doppelgänger (1851), and frankfurter (1894). 

13. KOREAN: 77.2 million

Korean loanwords in English are relatively rare, with none at all recorded by the OED before the 19th century. Among the most familiar are kimchi (1898) and taekwondo (1967), while rarer examples include kono (a traditional Korean board game, 1895), and kisaeng (the Korean equivalent of a Japanese geisha girl, 1895). 

14. FRENCH: 75.9 million

The world’s 75 million native French speakers are divided among 51 countries and territories, including 7.3 million in Canada, 4 million in Belgium, and 6 million in the Democratic Republic of the Congo (home to the second largest French-speaking population in the world). Thanks largely to the Norman Conquest, roughly three out of every 10 English words are thought to have French roots, and the trend has continued ever since: English has adopted more loanwords directly from French—absinthe, blancmange, concierge, dauphin, envoi, fête, gourmand, hollandaise, impasse—than from any other living language. 

15. AND 16. TELUGU: 74 MILLION AND MARATHI: 71.8 MILLION

Telugu and Marathi are India’s third and fourth most used languages, with just over 74 and just short of 72 million native speakers, respectively. Neither is responsible for a great many English loanwords, however, and the vast majority of those that have found their way into the language tend to be fairly rare and unfamiliar, like desai (a revenue office or a petty thief, from Marathi, 1698), chawl (an Indian lodging house, from Marathi, 1891), and podu (an area of jungle cleared for farming, from Telugu 1938). By far the most well known is bandicoot, which is thought to literally mean “pig-rat” in Telugu. 

17. TURKISH: 70.9 million

Sixty-six million of the world’s 70 million Turkish speakers are in Turkey, with smaller populations found in Greece, Bulgaria, Romania, Cyprus, and Kazakhstan. Turkish words in English date back to the 16th century, with vizier (1562), tulip (1578) and caftan (1591) being among the earliest to arrive.

18. TAMIL: 68.8 million

Tamil is India’s fifth most spoken language, as well as being one of the official languages of Sri Lanka and Singapore. Catamaran (1697), pariah (1613), poppadum (1820) and patchouli (1843) are all Tamil words, as is curry (1598). 

19. VIETNAMESE: 67.8 million

The OED records just 14 Vietnamese loanwords in English, the earliest of which is the name of the Vietnamese currency, dông (1824). Among the handful of others is pho (a traditional Vietnamese soup, 1935), ao dai (a woman’s high-necked tunic, 1961), and both hao and xu (1968), the names for one-tenth and one-hundredth of a dông, respectively. 

20. URDU: 64 million

Urdu is the sixth Indian language to make the global top 20, with its worldwide total comprised of 51 million native Indian speakers, a further 10 million in Pakistan, and smaller populations in Nepal and Mauritius. Urdu words have been adopted into English since the fifteenth century, with surprisingly early examples including mogul (1577), cummerbund (1613), and bungalow (1676). Earliest of all, however, is shrab—an old Anglo-Indian nickname for an alcoholic beverage, the first record of which in English dates from 1477. 

Original image
EITAN ABRAMOVICH/AFP/Getty Images
arrow
Words
23 Slang Terms You Only Understand if You Work in Antarctica
Original image
EITAN ABRAMOVICH/AFP/Getty Images

Thanks to extreme conditions, a small research population, close quarters, and the unique experience of life there, Antarctica has developed a lingo all of its own. Yes, even freezing, remote Antarctica has slang. Here is a sample of some, er, cooler terms, which come from the many English-speaking nationalities, from Canada to New Zealand, that have stepped foot on its ice.

1. BIG EYE

In winter, Antarctica is covered in perpetual darkness; in summer, sunlight. The continent can certainly put a wrench in one’s circadian rhythms, as this slang for light-related insomnia makes plain.

2. TOASTY

Antarctica’s climate also puts a wrench in one’s mental faculties. Crew stationed there often experience a loss of words, forgetfulness, irascibility, and “brain fog” brought on by the dark, cold, and altitude. Toasty is also used for other general misdemeanors committed around the camp.

3. ICE SHOCK

Antarctica’s shell shock. As one Antarctica-based worker blogged about it, ice shock is “when you get back to the rest of the world and realize that no matter how insane Antarctica is, the real world is FAR nuttier, and that you can no longer function in it.”

4. GREENOUT

A riff on whiteout. As The Antarctic Dictionary defines it, greenout is “the overwhelming sensation induced by seeing and smelling trees and other plants spending some time in antarctic regions.”

5. THE ICE

Speaking of the ice, this is how Antarcticans refer to the whole ice-covered continent.

6. CHEECH

Not the counterpart of Chong, but a play on consonant clusters in the name of the place from which many researchers jump off to Antarctica: Christchurch, New Zealand.

7. MACTOWN

McMurdo Station, the U.S. research hub and largest Antarctic community, which can host around 1250 residents in summer.

8. CITY MICE

These are personnel who work at the main research stations.

9. COUNTRY MICE

These are crew who move among different camps on the continent.

10. ICE-HUSBAND/ICE-WIFE

When the cat's away, the mice will play. One’s ice-husband or ice-wife is like a fling for crew down in Antarctica for the season.

11. ICE-WIDOW/ICE-WIDOWER

Meanwhile, one’s spouse or significant other is stuck all alone back home as their loved one is working at the South Pole.

12. FINGY

This pejorative term for a newbie apparently derives from “f—king new guy,” or FNG.

13. BEAKER

An epithet for “scientist.” Some specialist personnel also have nicknames, like fuelie (responsible for fueling various equipment) and wastie (who deal with refuse).

14. WINTER-OVER

When crew, bravely, stay in Antarctica over the entire brutal winter.

15. TURDSICLE

It gets cold down at the southern end of the world. The average—yes, average—temperature is -52ºF. The excrement freezeth, shall we say.

16. SNOTSICLE

So too do boogers freeze in this blend of snot and icicle.

17. DEGOMBLE

“To disencumber of snow,” as The Antarctic Dictionary explains, especially before coming back inside shelter. The origin of gomble is obscure, possibly a term for little balls of snow stuck to the fur of sled dogs.

18. SKUA

Named for the predatory, scavenging skua birds found in Antarctica, a skua pile or bin is a sort of rummage bin. Crew can leave and pick over unwanted items there. Also used as a verb.

19. OFFENSIVE POTATOES

British speakers apparently did not take a liking to canned potatoes they had to eat ...

20. SAWDUST

... nor the dried cabbage.

21. FRESHIES

Shipments of these fresh fruits and vegetables are quite welcome to the cuisine-deprived Antarctica researchers and personnel.

22. POPPY

Alcohol served over Antarctica ice, which makes a pop sound as it releases the gas long pressurized into it.

23. CARROTS

Not that much of the food sounds terribly edible, if slang is any measure, but these carrots aren’t to be munched on. They refer to ice cores, ‘uprooted’ samples whose cylindrical shape resemble the vegetable.

This slang is only the tip of the, um, iceberg. For more, see Bernadette Hince’s The Antarctica Dictionary, the Cool Antarctica website, and The Allusionist podcast, which has explored linguistic life on the ice in its episode, “Getting Toasty.”

Original image
Rebecca O'Connell
arrow
Words
What's the Longest Word in the World? Here are 12 of Them, By Category
Original image
Rebecca O'Connell

Antidisestablishmentarianism, everyone’s favorite agglutinative, entered the pop culture lexicon on August 17, 1955, when Gloria Lockerman, a 12-year-old girl from Baltimore, correctly spelled it on The $64,000 Question as millions of people watched from their living rooms. At 28 letters, the word—which is defined as a 19th-century British political movement that opposes proposals for the disestablishment of the Church of England—is still regarded as the longest non-medical, non-coined, nontechnical word in the English language, yet it keeps some robust company. Here are some examples of the longest words by category.

1. METHIONYLTHREONYLTHREONYGLUTAMINYLARGINYL … ISOLEUCINE 

Note the ellipses. All told, the full chemical name for the human protein titin is 189,819 letters, and takes about three-and-a-half hours to pronounce. The problem with including chemical names is that there’s essentially no limit to how long they can be. For example, naming a single strand of DNA, with its millions and millions of repeating base pairs, could eventually tab out at well over 1 billion letters.

2. LOPADOTEMACHOSELACHOGALEOKRANIOLEIPSAN …P TERYGON

The longest word ever to appear in literature comes from Aristophanes’ play, Assemblywomen, published in 391 BC. The Greek word tallies 171 letters, but translates to 183 in English. This mouthful refers to a fictional fricassee comprised of rotted dogfish head, wrasse, wood pigeon, and the roasted head of a dabchick, among other culinary morsels. 

3. PNEUMONOULTRAMICROSCOPICSILICOVOLCANOCONIOSIS

At 45 letters, this is the longest word you’ll find in a major dictionary. An inflated version of silicosis, this is the full scientific name for a disease that causes inflammation in the lungs owing to the inhalation of very fine silica dust. Despite its inclusion in the dictionary, it’s generally considered superfluous, having been coined simply to claim the title of the longest English word.

4. PARASTRATIOSPHECOMYIA STRATIOSPHECOMYIOIDES 

The longest accepted binomial construction, at 42 letters, is a species of soldier fly native to Thailand. With a lifespan of five to eight days, it’s unlikely one has ever survived long enough to hear it pronounced correctly.

5. PSEUDOPSEUDOHYPOPARATHYROIDISM

This 30-letter thyroid disorder is the longest non-coined word to appear in a major dictionary.

6. FLOCCINAUCINIHILIPILIFICATION

By virtue of having one more letter than antidisestablishmentarianism, this is the longest non-technical English word. A mash-up of five Latin roots, it refers to the act of describing something as having little or no value. While it made the cut in the Oxford English Dictionary, Merriam-Webster volumes refuse to recognize it, chalking up its existence to little more than linguistic ephemera.

7. SUBDERMATOGLYPHIC

At 17 characters, this is the longest accepted isogram, a word in which every letter is used only once, and refers to the underlying dermal matrix that determines the pattern formed by the whorls, arches, and ridges of our fingerprints. 

8. SQUIRRELLED

Though the more commonly accepted American English version carries only one L, both Oxford and Merriam-Webster dictionaries recognize this alternate spelling and condone its one syllable pronunciation (think “world”), making it the longest non-coined monosyllabic English word at 11 letters.

9. ABSTENTIOUS

One who doesn’t indulge in excesses, especially food and drink; at 11 letters this is the longest word to use all five vowels in order exactly once.

10. ROTAVATOR 

A type of soil tiller, the longest non-coined palindromic word included in an English dictionary tallies nine letters. Detartrated, 11 letters, appears in some chemical glossaries, but is generally considered too arcane to qualify.

11. and 12. CWTCH, EUOUAE

The longest words to appear in a major dictionary comprised entirely of either vowels or consonants. A Cwtch, or crwth, is from the Welsh word for a hiding place. Euouae, a medieval musical term, is technically a mnemonic, but has been accepted as a word in itself. 

SECTIONS

More from mental floss studios