Chữ Nôm

Chữ Nôm
𡨸喃
Script type
Time period
13th century – 20th century
DirectionTop-to-bottom, columns from right to left (traditional)
Left-to-right (modern)
LanguagesVietnamese
Related scripts
Parent systems
Child systems
Nom Tay
Sister systems
Sawndip
 This article contains phonetic transcriptions in the International Phonetic Alphabet (IPA). For an introductory guide on IPA symbols, see Help:IPA. For the distinction between [ ], / / and ⟨⟩, see IPA § Brackets and transcription delimiters.

Chữ Nôm (𡨸喃, IPA: [t͡ɕɨ˦ˀ˥ nom˧˧]) is a logographic writing system formerly used to write the Vietnamese language. It uses Chinese characters (chữ Hán) to represent Sino-Vietnamese vocabulary and some native Vietnamese words, with other words represented by new characters created using a variety of methods, including phono-semantic compounds. This composite script was therefore highly complex and was accessible to less than five percent of the Vietnamese population who had mastered written Chinese.

Although formal writing in Vietnam was done in classical Chinese until the early 20th century (except for two brief interludes), chữ Nôm was widely used between the 15th and 19th centuries by the Vietnamese cultured elite for popular works in the vernacular, many in verse. One of the best-known pieces of Vietnamese literature, The Tale of Kiều, was written in chữ Nôm by Nguyễn Du.

The Vietnamese alphabet created by Portuguese Jesuit missionaries, with the earliest known usage occurring in the 17th century, replaced chữ Nôm as the preferred way to record Vietnamese literature from the 1920s. While Chinese characters are still used for decorative, historic and ceremonial value, chữ Nôm has fallen out of mainstream use in modern Vietnam. In the 21st century, chữ Nôm is being used in Vietnam for historical and liturgical purposes. The Institute of Hán-Nôm Studies at Hanoi is the main research centre for pre-modern texts from Vietnam, both Chinese-language texts written in Chinese characters (chữ Hán) and Vietnamese-language texts in chữ Nôm.

Etymology

The Vietnamese word chữ 'character' is derived from the Middle Chinese word dziH , meaning '[Chinese] character'. The word Nôm 'Southern' is derived from the Middle Chinese word nom , meaning 'south'. It could also be based on the dialectal pronunciation from the South Central dialects (most notably in the name of province of Quảng Nam, known locally as Quảng Nôm).

There are many ways to write the name chữ Nôm in chữ Nôm characters. The word chữ may be written as , 𫳘(⿰字宁), 𡨸, 𫿰(⿰字文), 𡦂(⿰字字), 𲂯(⿰貝字), 𱚂(⿱字渚), or , while Nôm is written as .

Terminology

Quốc ngữ (國語) was used historically to refer to chữ Nôm. Such as in the book, Đại Nam quốc ngữ (大南國語), a Literary Chinese – Vietnamese (chữ Nôm) dictionary.

Chữ Nôm is the logographic writing system of the Vietnamese language. It is based on the Chinese writing system but adds a large number of new characters to make it fit the Vietnamese language. Common historical terms for chữ Nôm were Quốc Âm (國音, 'national sound') and Quốc ngữ (國語, 'national language').

In Vietnamese, Chinese characters are called chữ Hán (𡨸 'Han characters'), chữ Nho (𡨸儒 'Confucian characters', due to the connection with Confucianism) and uncommonly as Hán tự (漢字 'Han characters'). Hán văn (漢文) refers literature written in Literary Chinese.

The term Hán Nôm ( 'Han and chữ Nôm characters') in Vietnamese designates the whole body of premodern written materials from Vietnam, either written in Chinese (chữ Hán) or in Vietnamese (chữ Nôm). Hán and Nôm could also be found in the same document side by side, for example, in the case of translations of books on Chinese medicine. The Buddhist history Cổ Châu Pháp Vân phật bản hạnh ngữ lục (1752) gives the story of early Buddhism in Vietnam both in Hán script and in a parallel Nôm translation. The Jesuit Girolamo Maiorica (1605–1656) had also used parallel Hán and Nôm texts.

The term chữ Quốc ngữ (𡨸 'national language script') refers to the Vietnamese alphabet in current use, but was used to refer to chữ Nôm before the Vietnamese alphabet was widely used.

History

A page from Tự Đức thánh chế tự học giải nghĩa ca (嗣德聖製字學解義歌), a 19th-century primer for teaching Vietnamese children Chinese characters. The work is attributed to Emperor Tự Đức, the fourth emperor of the Nguyễn dynasty. In this primer, chữ Nôm is used to gloss the Chinese characters, for example, 𡗶 is used to gloss .

Chinese characters were introduced to Vietnam after the Han dynasty conquered Nanyue in 111 BC. Independence was achieved after the Battle of Bạch Đằng in 938, but Literary Chinese was adopted for official purposes in 1010. For most of the period up to the early 20th century, formal writing was indistinguishable from contemporaneous classical Chinese works produced in China, Korea, and Japan.

Vietnamese scholars were thus intimately familiar with Chinese writing. In order to record their native language, they applied the structural principles of Chinese characters to develop chữ Nôm. The new script was mostly used to record folk songs and for other popular literature. Vietnamese written in chữ Nôm briefly replaced Chinese for official purposes under the Hồ dynasty (1400–1407) and under the Tây Sơn (1778–1802), but in both cases this was swiftly reversed.

Early development

The use of Chinese characters to transcribe the Vietnamese language can be traced to an inscription with the two characters "", as part of the posthumous title of Phùng Hưng, a national hero who succeeded in briefly expelling the Chinese in the late 8th century. The two characters have literal Chinese meanings 'cloth' and 'cover', which make no sense in this context. They have thus been interpreted as a phonetic transcription, via their Middle Chinese pronunciations buH kajH, of a Vietnamese phrase, either vua cái 'great king', or bố cái 'father and mother' (of the people).

After Vietnam established its independence from China in the 10th century, Đinh Bộ Lĩnh (r. 968–979), the founder of the Đinh dynasty, named the country Đại Cồ Việt . The first and third Chinese characters mean 'great' and 'Viet'. The second character was often used to transcribe non-Chinese terms and names phonetically. In this context, cồ is an obsolete Vietnamese word for 'big'.

The oldest surviving Nom inscription, dating from 1210, is a list naming 21 people and villages on a stele at the Tự Già Báo Ân pagoda in Tháp Miếu village (Mê Linh District, Hanoi). Another stele at Hộ Thành Sơn in Ninh Bình Province (1343) lists 20 villages.

Trần Nhân Tông (r. 1278–1293) ordered that Nôm be used to communicate his proclamations to the people. The first literary writing in Vietnamese is said to have been an incantation in verse composed in 1282 by the Minister of Justice Nguyễn Thuyên and thrown into the Red River to expel a menacing crocodile. Four poems written in Nom from the Tran dynasty, two by Trần Nhân Tông and one each by Huyền Quang and Mạc Đĩnh Chi, were collected and published in 1805.

The fourth page of Phật thuyết đại báo phụ mẫu ân trọng kinh (佛說大報父母恩重經) which shows text in Literary Chinese alongside an earlier form of chữ Nôm representing Old Vietnamese pronunciation. Some pairs of characters are used to represent the consonant clusters that were present in Old Vietnamese.

The Nôm text Phật thuyết Đại báo phụ mẫu ân trọng kinh ('Sūtra explained by the Buddha on the Great Repayment of the Heavy Debt to Parents') was printed around 1730, but conspicuously avoids the character lợi, suggesting that it was written (or copied) during the reign of Lê Lợi (1428–1433). Based on archaic features of the text compared with the Tran dynasty poems, including an exceptional number of words with initial consonant clusters written with pairs of characters, some scholars suggest that it is a copy of an earlier original, perhaps as early as the 12th century.

Hồ dynasty (1400–07) and Ming conquest (1407–27)

During the seven years of the Hồ dynasty (1400–07) Classical Chinese was discouraged in favor of vernacular Vietnamese written in Nôm, which became the official script. The emperor Hồ Quý Ly even ordered the translation of the Book of Documents into Nôm and pushed for reinterpretation of Confucian thoughts in his book Minh đạo. These efforts were reversed with the fall of the Hồ and Chinese conquest of 1407, lasting twenty years, during which use of the vernacular language and demotic script were suppressed.

During the Ming dynasty occupation of Vietnam, chữ Nôm printing blocks, texts and inscriptions were thoroughly destroyed; as a result the earliest surviving texts of chữ Nôm post-date the occupation.

15th to 19th century

A page from the bilingual dictionary Nhật dụng thường đàm (日用常談; 1851). Characters representing words in Hán (Chinese) are explained in Nôm (Vietnamese).

Among the earlier works in Nôm of this era are the writings of Nguyễn Trãi (1380–1442). The corpus of Nôm writings grew over time as did more scholarly compilations of the script itself. Trịnh Thị Ngọc Trúc [vi], consort of King Lê Thần Tông, is generally given credit for Chỉ nam ngọc âm giải nghĩa [vi] (指南玉音解義; 'guide to Southern Jade sounds: explanations and meanings'), a 24,000-character bilingual Hán-to-Nôm dictionary compiled between the 15th and 18th centuries, most likely in 1641 or 1761.

While almost all official writings and documents continued to be written in classical Chinese until the early 20th century, Nôm was the preferred script for literary compositions of the cultural elites. Nôm reached its golden period with the Nguyễn dynasty in the 19th century as it became a vehicle for diverse genres, from novels to theatrical pieces, and instructional manuals. Although it was prohibited during the reign of Minh Mạng (1820–1840), apogees of Vietnamese literature emerged with Nguyễn Du's The Tale of Kiều and Hồ Xuân Hương's poetry. Although literacy in premodern Vietnam was limited to just 3 to 5 percent of the population, nearly every village had someone who could read Nôm aloud for the benefit of other villagers. Thus these Nôm works circulated orally in the villages, making it accessible even to the illiterates.

Chữ Nôm was the dominant script in Vietnamese Catholic literature until the late 19th century. In 1838, Jean-Louis Taberd compiled a Nôm dictionary, helping with the standardization of the script.

The reformist Catholic scholar Nguyễn Trường Tộ presented the Emperor Tự Đức with a series of unsuccessful petitions (written in classical Chinese, like all court documents) proposing reforms in several areas of government and society. His petition Tế cấp bát điều (濟急八條 'Eight urgent matters', 1867), includes proposals on education, including a section entitled Xin khoan dung quốc âm ('Please tolerate the national voice'). He proposed to replace classical Chinese with Vietnamese written using a script based on Chinese characters that he called Quốc âm Hán tự (國音漢字 'Han characters with national pronunciations'), though he described this as a new creation, and did not mention chữ Nôm.

French Indochina and the Latin alphabet

From the latter half of the 19th century onwards, the French colonial authorities discouraged or simply banned the use of classical Chinese, and promoted the use of the Vietnamese alphabet, which they viewed as a stepping stone toward learning French. Language reform movements in other Asian nations stimulated Vietnamese interest in the subject. Following the Russo-Japanese War of 1905, Japan was increasingly cited as a model for modernization. The Confucian education system was compared unfavourably to the Japanese system of public education. According to a polemic by writer Phan Châu Trinh, "so-called Confucian scholars" lacked knowledge of the modern world, as well as real understanding of Han literature. Their degrees showed only that they had learned how to write characters, he claimed.

The popularity of Hanoi's short-lived Tonkin Free School suggested that broad reform was possible. In 1910, the colonial school system adopted a "Franco-Vietnamese curriculum", which emphasized French and alphabetic Vietnamese. The teaching of Chinese characters was discontinued in 1917. On December 28, 1918, Emperor Khải Định declared that the traditional writing system no longer had official status. The traditional Civil Service Examination, which emphasized the command of classical Chinese, was dismantled in 1915 in Tonkin and was given for the last time at the imperial capital of Huế on January 4, 1919. The examination system, and the education system based on it, had been in effect for almost 900 years.

The decline of the Chinese script also led to the decline of chữ Nôm given that Nôm and Chinese characters are so intimately connected. After the First World War, chữ Nôm gradually died out as the Vietnamese alphabet grew more and popular. In an article published in 1935 (based on a lecture given in 1925), Georges Cordier estimated that 70% of literate persons knew the alphabet, 20% knew chữ Nôm and 10% knew Chinese characters. However, estimates of the rate of literacy in the late 1930s range from 5% to 20%. By 1953, literacy (using the alphabet) had risen to 70%.

The Gin people, descendants of 16th-century migrants from Vietnam to islands off Dongxing in southern China, now speak a form of Yue Chinese and Vietnamese, but their priests use songbooks and scriptures written in chữ Nôm in their ceremonies.

Texts

A page from The Tale of Kieu by Nguyễn Du. This novel was first published in 1820 and is the best-known work in Nôm. The edition shown was printed in the late 19th century.

Types of texts

  • Giải âm (解音) - a category of chữ Nôm texts that translates the "sounds" (word-for-word) of the original Literary Chinese text. Examples include Tam thiên tự giải âm (三千字解音), etc. Often these translations attempt to match the word order as the original Literary Chinese text with no regard for Vietnamese syntax). There is significant diversity within works called giải âm (even within a single text), with some coming fairly close to the modern concept of translation, while some are closer to annotations with "word-matching" between Literary Chinese and Vietnamese, with little to no regard to the structure or syntax of Vietnamese.

Here is a line in Tam tự kinh lục bát diễn âm (三字經六八演音), a Vietnamese translation of the Three Character Classic. It features the original text on the top of the page and the Vietnamese translation on the bottom.

人不𭓇不知理 (Nhân bất học bất tri lý)

𠊚空𭓇別𨤰夷麻推 (Người không học biết nhẽ gì mà suy)

Without learning, one does not understand reason.

  • Diễn âm (演音) - synonym of giải âm (解音).
  • Giải nghĩa (解義) - a category of chữ Nôm texts that translates the "meaning", often having no regard for Literary Chinese syntax. Examples include Chỉ nam ngọc âm giải nghĩa (指南玉音解義). Some giải nghĩa or diễn nghĩa works focus on extracting as much meaning as possible from the source texts, with little to no regard to the structure and syntax of Literary Chinese.
  • Diễn nghĩa (演義) - synonym of giải nghĩa (解義).
The second page of Tam tự kinh lục bát diễn âm (三字經六八演音) with the original Literary Chinese text on the top, and Vietnamese translation on the bottom. The Vietnamese text is written in chữ Nôm and lục bát (六八) verse form.

Characters

Vietnamese is a tonal language, like Chinese, and has nearly 5,000 distinct syllables. In chữ Nôm, each monosyllabic word of Vietnamese was represented by a character, either borrowed from Chinese or locally created. The resulting system was even more difficult to use than the Chinese script.

As an analytic language, Vietnamese was a better fit for a character-based script than Japanese and Korean, with their agglutinative morphology. Partly for this reason, there was no development of a phonetic system that could be taught to the general public, like Japanese kana syllabary or the Korean hangul alphabet. Moreover, most Vietnamese literati viewed Chinese as the proper medium of civilized writing, and had no interest in turning Nôm into a form of writing suitable for mass communication.

Variant characters

Chữ Nôm has never been standardized. As a result, a Vietnamese word could be represented by several Nôm characters. For example, the very word chữ ('character', 'script'), a Chinese loanword, can be written as either (Chinese character), 𡦂 (Vietnamese-only compound-semantic character) or 𡨸 (Vietnamese-only semantic-phonetic character). For another example, the word giữa ('middle'; 'in between') can be written either as 𡨌 (⿰守中) or 𫡉 (⿰字中). Both characters were invented for Vietnamese and have a semantic-phonetic structure, the difference being the phonetic indicator ( vs. ).

Another example of a Vietnamese word that is represented by several Nôm characters is the word for moon, trăng. It can be represented by a Chinese character that is phonetically similar to trăng, (lăng), a chữ Nôm character, 𢁋 (⿱巴陵) which is composed of two phonetic components (ba) and (lăng) for the Middle Vietnamese blăng, or a chữ Nôm character, 𦝄 (⿰月夌) composed of a phonetic component (lăng) and a semantic component meaning ('moon').

Borrowed characters

Characters for cân (top) and khăn (bottom), meaning turban/towel, in Tự Đức thánh chế tự học giải nghĩa ca. The character for khăn has a diacritic to indicate different pronunciation.

Unmodified Chinese characters were used in chữ Nôm in three different ways.

  • A large proportion of Vietnamese vocabulary had been borrowed from Chinese from the Tang period. Such Sino-Vietnamese vocabulary could be written with the original Chinese character for each word, for example:
    • dịch ('service', 'corvée'), from Early Middle Chinese (EMC) /jwiajk/
    • bản ('root', 'foundation'), from EMC /pənˀ/
    • đầu ('head'), from EMC /dəw/
  • One way to represent a native Vietnamese word was to use a Chinese character for a Chinese word with a similar meaning. For example, may also represent vốn ('capital, funds'). In this case, the word vốn is actually an earlier Chinese loan that has become accepted as Vietnamese; William Hannas claims that all such readings are similar early loans.
  • Alternatively, a native Vietnamese word could be written using a Chinese character for a Chinese word with a similar sound, regardless of the meaning of the Chinese word. For example, (Early Middle Chinese /mət/) may represent the Vietnamese word một ('one').

The first two categories are similar to the on and kun readings of Japanese kanji respectively. The third is similar to ateji, in which characters are used only for their sound value, or the Man'yōgana script that became the origin of hiragana and katakana.

When a character would have two readings, a diacritic may be added to the character to indicate the "indigenous" reading. The two most common alternate reading diacritical marks are (𖿰), (a variant form of ) and nháy (𖿱). Thus when is meant to be read as vốn, it is written as 𖿱, with a diacritic at the upper right corner.

Here is the character tự 自 is written with the nháy mark (𖿱) showing to use its alternative reading từ or vice versa. (Depending on the context, the alternative reading could also be tợ instead)
An example of what tháu đấm would be with the character quốc () which was written as with two dots on its left and right side.

Other alternate reading diacritical marks include tháu đấm (草𢶸) where a character is represented by a simplified variant with two points on either side of the character.

Usage of diacritical mark nháy (𖿱) can be seen here. The character yết (揭) with the mark ' 𖿱 ' changes the reading to xiết 𖿱 (Middle left). For lâm (林) with the mark ' 𖿱 ' changes the reading to chấm 𖿱 (Middle right).

Locally invented characters

The Nôm character for người (𠊛), a term for people or humans (in general). The radical on the left suggests that the pronunciation of the character is linked to that of ngại. The radical on the right suggests that the meaning of the character is linked to people. (The character after is the word Việt 越 meaning "Vietnamese"). The two characters mean "Vietnamese people".

In contrast to the few hundred Japanese kokuji (国字) and handful of Korean gukja (국자, 國字), which are mostly rarely used characters for indigenous natural phenomena, Vietnamese scribes created thousands of new characters, used throughout the language.

As in the Chinese writing system, the most common kind of invented character in Nôm is the phono-semantic compound, made by combining two characters or components, one suggesting the word's meaning and the other its approximate sound. For example,

  • 𠀧 (ba 'three') is composed of the phonetic part (Sino-Vietnamese reading: ba) and the semantic part 'three'. 'Father' is also ba, but written as (⿱), while 'turtle' is con ba ba 𡥵.
  • (mẹ 'mother') has 'woman' as semantic component and (Sino-Vietnamese reading: mỹ) as phonetic component.

A smaller group consists of semantic compound characters, which are composed of two Chinese characters representing words of similar meaning. For example, 𡗶 (giời or trời 'sky', 'heaven') is composed of ('sky') and ('upper').

A few characters were obtained by modifying Chinese characters related either semantically or phonetically to the word to be represented. For example,

  • the Nôm character 𧘇 (ấy 'that', 'those') is a simplified form of the Chinese character (Sino-Vietnamese reading: ý).
  • the Nôm character 𫜵 (làm 'work', 'labour') is a simplified form of the Chinese character (Sino-Vietnamese reading: lạm) ( > 𪵯 > 𫜵).
  • the Nôm character 𠬠 (một 'one') comes from the right part of the Chinese character (Sino-Vietnamese reading: một).

Example

As an example of the way chữ Nôm was used to record Vietnamese, the first two lines of the Tale of Kiều (1871 edition), written in the traditional six-eight form of Vietnamese verse, consist of 14 characters:

𤾓

Trăm

hundred

𢆥

năm

year

𥪞

trong

in

𡎝

cõi

world

𠊛

người

person

ta,

our

𤾓 𢆥 𥪞 𡎝 𠊛 些

Trăm năm trong cõi người ta,

hundred year in world person our

A hundred years—in this life span on earth,

𡨸

Chữ

word

tài

talent

𡨸

chữ

word

mệnh

destiny

khéo

clever

𱺵

to be

ghét

hate

nhau.

each other

𡨸 才 𡨸 命 窖 𱺵 恄 饒

Chữ tài chữ mệnh khéo là ghét nhau.

word talent word destiny clever {to be} hate {each other}

talent and destiny are apt to feud.

Derivations of Nôm characters in the first two lines
character word gloss derivation
𤾓 (⿱百林) trăm hundred compound of 'hundred' and lâm
𢆥 (⿰南年) năm year compound of nam and 'year'
𥪞 (⿺竜內) trong in compound of long and 'inside'
𡎝 (⿰土癸) cõi world compound of 'earth' and quý
𠊛 (⿰㝵人) người person compound of ngại and 'person'
ta our character of homophone Sino-Vietnamese ta 'little, few; rather, somewhat'
𡨸 (⿰宁字) chữ word compound of trữ and 'character; word'
tài talent Sino-Vietnamese word
𡨸 (⿰宁字) chữ word compound of trữ and 'character; word'
mệnh destiny Sino-Vietnamese word
khéo clever variant character of the near-homophone Sino-Vietnamese khiếu 'hole', Sino-Vietnamese reading of is giáo
𱺵 (⿱罒𪜀) to be simplified form of 'to be', using the character of near-homophone Sino-Vietnamese la 'net for catching birds'
ghét hate compound of 'heart' classifier and cát
nhau each other character of near-homophone Sino-Vietnamese nhiêu 'bountiful, abundant, plentiful'
Chữ Hán characters compared to chữ Nôm characters.

Computer encoding

In 1993, the Vietnamese government released an 8-bit coding standard for alphabetic Vietnamese (TCVN 5712:1993, or VSCII), as well as a 16-bit standard for Nôm (TCVN 5773:1993). This group of glyphs is referred to as "V0." In 1994, the Ideographic Rapporteur Group agreed to include Nôm characters as part of Unicode. A revised standard, TCVN 6909:2001, defines 9,299 glyphs. About half of these glyphs are specific to Vietnam. Nôm characters not already encoded were added to CJK Unified Ideographs Extension B. (These characters have five-digit hexadecimal code points. The characters that were encoded earlier have four-digit hex.)

Code Characters Unicode block Standard Date V Source Sources
V0 2,246 Basic Block (593), A (138), B (1,515) TCVN 5773:1993 2001 V0-3021 to V0-4927 5
V1 3,311 Basic Block (3,110), C (1) TCVN 6056:1995 1999 V1-4A21 to V1-6D35 2, 5
V2 3,205 Basic Block (763), A (151), B (2,291) VHN 01:1998 2001 V2-6E21 to V2-9171 2, 5
V3 535 Basic Block (91), A (19), B (425) VHN 02:1998 2001 V3-3021 to V3-3644 Manuscripts
V4 785 (encoded) Extension C Defined as sources 1, 3, and 6 2009 V4-4021 to V4-4B2F 1, 3, 6
V04 1,028 Extension E Unencoded V4 and V6 characters Projected V04-4022 to V04-583E V4: 1, 3, 6;
V6: 4, manuscripts
V5 ~900 Proposed in 2001, but already coded 2001 None 2, 5
Sources: Nguyễn Quang Hồng, "Unibook Character Browser", Unicode, Inc., "Code Charts – CJK Ext. E" (N4358-A).

Characters were extracted from the following sources:

  1. Hoàng Triều Ân, Tự điển chữ Nôm Tày [Nôm of the Tay People], 2003.
  2. Institute of Linguistics, Bảng tra chữ Nôm [Nôm Index], Hanoi, 1976.
  3. Nguyễn Quang Hồng, editor, Tự điển chữ Nôm [Nôm Dictionary], 2006.
  4. Father Trần Văn Kiệm, Giúp đọc Nôm và Hán Việt [Help with Nôm and Sino-Vietnamese], 2004.
  5. Vũ Văn Kính & Nguyễn Quang Xỷ, Tự điển chữ Nôm [Nôm Dictionary], Saigon, 1971.
  6. Vũ Văn Kính, Bảng tra chữ Nôm miền Nam [Table of Nôm in the South], 1994.
  7. Vũ Văn Kính, Bảng tra chữ Nôm sau thế kỷ XVII [Table of Nôm After the 17th Century], 1994.
  8. Vũ Văn Kính, Đại tự điển chữ Nôm [Great Nôm Dictionary], 1999.
  9. Nguyễn Văn Huyên, Góp phần nghiên cứu văn hoá Việt Nam [Contributions to the Study of Vietnamese Culture], 1995.

The V2, V3, and V4 proposals were developed by a group at the Han-Nom Research Institute led by Nguyễn Quang Hồng. V4, developed in 2001, includes over 400 ideograms formerly used by the Tày people of northern Vietnam. This allows the Tày language to get its own registration code. V5 is a set of about 900 characters proposed in 2001. As these characters were already part of Unicode, the IRG concluded that they could not be edited and no Vietnamese code was added. (This is despite the fact that national codes were added retroactively for version 3.0 in 1999.) The Nôm Na Group, led by Ngô Thanh Nhàn, published a set of nearly 20,000 Nôm characters in 2005. This set includes both the characters proposed earlier and a large group of additional characters referred to as "V6". These are mainly Han characters from Trần Văn Kiệm's dictionary which were already assigned code points. Character readings were determined manually by Hồng's group, while Nhàn's group developed software for this purpose. The work of the two groups was integrated and published in 2008 as the Hán Nôm Coded Character Repertoire.

Character Composition Nôm reading Sino-Vietnamese reading Meaning Code point V Source Other sources
ba (slightly formal) and U+5427 V0-3122 G0,J,KP,K,T
𠂉 thương thương wound, injury, to love non-romantically U+50B7 V1-4C22 G1,J,KP,K,T
𠊛 người N/A people U+2029B V2-6E4F None
suông song plain, bland U+391D V3-313D G3,KP,K,T
𫋙 càng N/A claw, pincer U+2B2D9 V4-536F None
𫡯 chàu N/A wealth U+2B86F V4-405E None
Key: G0 = China (GB 2312); G1 = China (GB 12345); G3 = China (GB 7589); GHZ = Hanyu Da Zidian; J = Japan; KP= North Korea; K = South Korea; T = Taiwan.
Sources: Unihan Database, Vietnamese Nôm Preservation Foundation, "Code Charts – CJK Ext. E" (N4358-A). The Han-Viet readings are from Hán Việt Từ Điển.

The characters that do not exist in Chinese have Sino-Vietnamese readings that are based on the characters given in parentheses. The common character for càng () contains the radical (insects). This radical is added redundantly to create 𫋙, a rare variation shown in the chart above. The character 𫡯 (chàu) is specific to the Tày people. It has been part of the Unicode standard only since version 8.0 of June 2015, so there is still very little font and input method support for it. It is a variation of , the corresponding character in Vietnamese.

See also


This page was last updated at 2024-03-28 03:17 UTC. Update now. View original page.

All our content comes from Wikipedia and under the Creative Commons Attribution-ShareAlike License.


Top

If mathematical, chemical, physical and other formulas are not displayed correctly on this page, please useFirefox or Safari