‘THE ENGLISH ALPHABET”[ARCHAIC LETTERS]

-as of [11 SEPTEMBER 2025]-

‘archaic] letters’

(‘old/middle english’ had a number of ‘non-latin letters’ that have since dropped out of use)

(these either took the names of the equivalent runes, since there were no latin names to adopt, or (thorn, wyn) were runes themselves)

‘edh’ or ‘eth’

/ˈɛð/

(used for the consonants /ð/ + /θ/)

ethel

/ˈɛðəl/

(used for the vowel /œ/, which disappeared from the language quite early)

thorn

/ˈθɔːrn/

(used for the consonants /ð/ + /θ/)

*’wyn’ or ‘wynn’*

/ˈwɪn/

(used for the consonant /w/)
(as the letter ‘w’ had not yet been invented)

yogh

/ˈjɒɡ/ or /ˈjɒx/

(used for various sounds derived from /ɡ/)
(such as /j/ + /x/)

Ç ç

*wiki-link*

en.wikipedia.org /wiki/%C3%87
Ç – Wikipedia
7-9 minutes
This article is about the Latin script letter. For the voiceless palatal fricative represented by ç in the IPA, see Voiceless palatal fricative.

Ç
Ç ç
(See below)
Writing cursive forms of Ç
Usage
Writing system Latin script
Type Alphabetic and Logographic
Language of origin Old Spanish language
Phonetic usage [s]
[t͡ʃ]
[d͡ʒ]
[t͡s]
[d͡z]
[ç]
[ɽ]
[ǂ]
Unicode value U+00C7, U+00E7
History
Development
Z4

Proto-Sinaitic Zayin
Protozayn.svg
Phoenician Zayin
PhoenicianZ-01.svg
Ζ ζ
𐌆
Z z
Ꝣ ꝣ
Ç ç
Time period ~900 to present
Descendants None
Sisters None
Transliteration equivalents ch, c, s, ts
Variations (See below)
Other
Other letters commonly used with c, ch, s, ts
This article contains IPA phonetic symbols. Without proper rendering support, you may see question marks, boxes, or other symbols instead of Unicode characters. For an introductory guide on IPA symbols, see Help:IPA.
Ç or ç (c-cedilla) is a Latin script letter, used in the Albanian, Azerbaijani, Manx, Tatar, Turkish, Turkmen, Kurdish, Zazaki, and Romance alphabets. Romance languages that use this letter include Catalan, French, Friulian, Ligurian, Occitan, and Portuguese as a variant of the letter C. It is also occasionally used in Crimean Tatar and in Tajik (when written in the Latin script) to represent the /d͡ʒ/ sound. It is often retained in the spelling of loanwords from any of these languages in English, Basque, Dutch, Spanish and other Latin script spelled languages.

It was first used for the sound of the voiceless alveolar affricate /t͡s/ in Old Spanish and stems from the Visigothic form of the letter z (Ꝣ). The phoneme originated in Vulgar Latin from the palatalization of the plosives /t/ and /k/ in some conditions. Later, /t͡s/ changed into /s/ in many Romance languages and dialects. Spanish has not used the symbol since an orthographic reform in the 18th century (which replaced ç with the now-devoiced z), but it was adopted for writing other languages.

In the International Phonetic Alphabet, /ç/ represents the voiceless palatal fricative.

Usage as a letter variant in various languages[edit]

Evolution from Visigoth Z to modern Ç.

In many languages, ⟨ç⟩ represents the “soft” sound /s/ where a ⟨c⟩ would normally represent the “hard” sound /k/. These include:

Catalan. Known as ce trencada (‘broken C’) in this language, where it can be used before ⟨a⟩, ⟨o⟩, ⟨u⟩ or at the end of a word. Some examples of words with ⟨ç⟩ are amenaça (‘menace’), torçat (‘twisted’), xoriço (‘chorizo’), forçut (‘strong’), dolç (‘sweet’) and caça (‘hunting’). A well-known word with this character is Barça, a common Catalan clipping of Futbol Club Barcelona.
French (cé cédille): français (‘French’), garçon (‘boy’), façade (‘frontage’), grinçant (‘squeaking’), leçon (‘lesson’), reçu (‘received’ [past participle]). French does not use the character at the end of a word but it can occur at the beginning of a word (e.g., ça, ‘that’).[1]
Occitan (ce cedilha): torçut (‘twisted’), çò (‘this’), ça que la (‘nevertheless’), braç (‘arm’), brèç (‘cradle’), voraç (‘voracious’). It can occur at the beginning of a word.
Portuguese (cê-cedilha or cê cedilhado): it is used before ⟨a⟩, ⟨o⟩, ⟨u⟩: taça (‘cup’), braço (‘arm’), açúcar (‘sugar’). Modern Portuguese does not use the character at the beginning or at the end of a word (the nickname for Conceição is São, not Ção). According to a Portuguese grammar written in 1550, the letter ç had the sound of /dz/ around that time. Another grammar written around 1700 would say that the letter ç sounds like /s/, which shows a phonetic evolution that is still valid today.
Manx aaçhent (‘to flash back’), atçhim (‘terror’), çhaggil (‘Gather’) the digraph çh makes a /tʃ/ sound and so does tçh
In other languages, it represents the voiceless postalveolar affricate /t͡ʃ/ (like ⟨ch⟩ in English chalk):

Friulian (c cun cedilie) before ⟨a⟩, ⟨o⟩, ⟨u⟩ or at the end of a word.
Turkish: çelik (‘steel’), çilek (‘strawberry’), and çamur (‘mud’).
In Manx, it is used in the digraph ⟨çh⟩, which also represents /t͡ʃ/, to differentiate it from normal ⟨ch⟩, which represents /x/.

In loanwords only[edit]
In Basque, ⟨ç⟩ (known as ze hautsia) is used in the loanword curaçao.
In Dutch, it can be found in some words from French and Portuguese, such as façade, reçu, Provençaals and Curaçao.
In English, ⟨ç⟩ is used in loanwords such as façade and limaçon (although the cedilla mark is often dropped: ⟨facade⟩, ⟨limacon⟩).
In modern Spanish it can appear in loanwords, especially in Catalan proper nouns.
Usage as a separate letter in various languages[edit]
It represents the voiceless postalveolar affricate /t͡ʃ/ in the following languages:

the 4th letter of the Albanian alphabet.
the 4th letter of the Azerbaijani alphabet.
the 5th letter of the Tatar alphabet (based on Zamanälif).
the 4th letter of the Turkish alphabet.
the 3rd letter of the Turkmen alphabet.
the 4th letter of the Zazaki alphabet.
In Kazakh, as the 4th letter of the Kazakh alphabets based on 2020 amendment of its alphabets, it however represents /tɕ/, this is a little different by other Turkic languages.

It previously represented a voiceless palatal click /ǂ/ in Juǀʼhoansi and Naro, though the former has replaced it with ⟨ǂ⟩ and the latter with ⟨tc⟩.

The similarly shaped letter the (Ҫ ҫ) is used in the Cyrillic alphabets of Bashkir and Chuvash to represent /θ/ and /ɕ/, respectively.

It also represents the retroflex flap /ɽ/ in the Rohingya Latin alphabet.

Janalif uses this letter to represent the voiced postalveolar affricate /d͡ʒ/

Classical Malay uses ç to represent /dʒ/ and /ɲ/.

Computer[edit]
Character Ç ç Ꝣ ꝣ
Unicode name LATIN CAPITAL LETTER C WITH CEDILLA LATIN SMALL LETTER C WITH CEDILLA LATIN CAPITAL LETTER VISIGOTHIC Z LATIN SMALL LETTER VISIGOTHIC Z
Encodings decimal hex decimal hex decimal hex decimal hex
Unicode 199 U+00C7 231 U+00E7 42850 U+A762 42851 U+A763
UTF-8 195 135 C3 87 195 167 C3 A7 234 157 162 EA 9D A2 234 157 163 EA 9D A3
Numeric character reference Ç Ç ç ç Ꝣ Ꝣ ꝣ ꝣ
Named character reference Ç ç
Input[edit]
On Albanian, French, Portuguese, Spanish, Turkish and Italian keyboards, Ç is directly available as a separate key; however, on most other keyboards, including the US and British keyboard, a combination of keys must be used:

In the US-International keyboard layout, these are ‘ followed by either C or ⇧ Shift+C. Alternatively one may press AltGr+, or AltGr+⇧ Shift+,.
In classic Mac OS and macOS, these are ⌥ Opt+C and ⌥ Opt+⇧ Shift+C for lower- and uppercase, respectively.
In the X Window System and many Unix consoles, one presses sequentially Compose, , and either C or ⇧ Shift+C. Alternatively, one may press AltGr+= and then either C or ⇧ Shift+C.
In Microsoft Windows, these are Alt+0231 or Alt+135 for lowercase and Alt+0199 or Alt+128 for uppercase.
In Microsoft Word, these are Ctrl+, and then either C or ⇧ Shift+C.
The HTML character entity references are ç and Ç for lower- and uppercase, respectively.
In TeX and LaTeX, \c is used for adding the cedilla accent to a letter, so \c{c} produces “ç”.
See also[edit]
Ҹ
References[edit]

Look up Ç or ç in Wiktionary, the free dictionary.

*cedilla*

*wiki-link*

en.wikipedia.org /wiki/Cedilla
Cedilla – Wikipedia
12-15 minutes
“Cedille” redirects here. For the record label, see Cedille Records.

Not to be confused with Sedilia.

Cedilla
Diacritics in Latin & Greek
accent
acute´
double acute˝
grave`
double grave ̏
circumflexˆ
caron, háčekˇ
breve˘
inverted breve ̑
cedilla¸
diaeresis, umlaut¨
dot·
palatal hook ̡
retroflex hook ̢
hook above, dấu hỏi ̉
horn ̛
iota subscript ͅ
macronˉ
ogonek, nosinė˛
perispomene ͂
overring˚
underring˳
rough breathing῾
smooth breathing᾿
Marks sometimes used as diacritics
apostrophe’
bar◌̸
colon:
comma,
full stop/period.
hyphen˗
prime′
tilde~
Diacritical marks in other scripts
Arabic diacritics
Early Cyrillic diacritics
kamora ҄
pokrytie ҇
titlo ҃
Gurmukhī diacritics
Hebrew diacritics
Indic diacritics
anusvaraं ং ം
chandrabinduँ ఁ
nukta़
virama् ് ్ ් ್
visargaः ঃ
IPA diacritics
Japanese diacritics
dakuten ﾞ
handakuten ﾟ
Khmer diacritics
Syriac diacritics
Thai diacritics
Related
Dotted circle◌
Punctuation marks
Logic symbols
This template:

view
talk
edit
A̧ a̧
B̧ b̧
Ç ç
Ḉ ḉ
Ç̇ ç̇
Ḑ ḑ
Ȩ ȩ
Ȩ̇ ȩ̇
Ḝ ḝ
Ə̧ ə̧
Ɛ̧ ɛ̧
Ģ ģ
Ḩ ḩ
I̧ i̧
Ɨ̧ ɨ̧
Ķ ķ
Ļ ļ
M̧ m̧
Ņ ņ
O̧ o̧
Ɔ̧ ɔ̧
Q̧ q̧
Ŗ ŗ
Ş ş
ſ̧ ß̧
Ţ ţ
U̧ u̧
X̧ x̧
Z̧ z̧
A cedilla ( si-DIL-ə; from Spanish), also known as cedilha (from Portuguese) or cédille (from French), is a hook or tail ( ¸ ) added under certain letters as a diacritical mark to modify their pronunciation. In Catalan, French, and Portuguese, it is used only under the c, and the entire letter is called, respectively, c trencada (i.e. “broken C”), c cédille, and c cedilhado (or c cedilha, colloquially). It is used to mark vowel nasalization in many languages of sub-Saharan Africa, including Vute from Cameroon.

Origin[edit]

The tail originated in Spain as the bottom half of a miniature cursive z. The word “cedilla” is the diminutive of the Old Spanish name for this letter, ceda (zeta).[1] Modern Spanish and Galician no longer use this diacritic, although it is used in Portuguese,[2] Catalan, Occitan, and French, which gives English the alternative spellings of cedille, from French “cédille”, and the Portuguese form cedilha. An obsolete spelling of cedilla is cerilla.[2] The earliest use in English cited by the Oxford English Dictionary[2] is a 1599 Spanish-English dictionary and grammar.[3] Chambers’ Cyclopædia[4] is cited for the printer-trade variant ceceril in use in 1738.[2] The main use in English is not universal and applies to loan words from French and Portuguese such as “façade”, “limaçon” and “cachaça” (often typed “facade”, “limacon” and “cachaca” because of lack of ç keys on Anglophone keyboards).

With the advent of modernism,[vague] the calligraphic nature of the cedilla was thought somewhat jarring on sans-serif typefaces, and so some designers instead substituted a comma design, which could be made bolder and more compatible with the style of the text.[a] This can add to confusion as the use of commas as opposed to cedillas varies by language.

C[edit]
Main article: Ç

A conventional “ç” and ‘modernist’ cedilla “c̦” (right), intended for French and Swiss use.

The most frequent character with cedilla is “ç” (“c” with cedilla, as in façade). It was first used for the sound of the voiceless alveolar affricate /ts/ in old Spanish and stems from the Visigothic form of the letter “z” (ꝣ), whose upper loop was lengthened and reinterpreted as a “c”, whereas its lower loop became the diminished appendage, the cedilla.

It represents the “soft” sound /s/, the voiceless alveolar sibilant, where a “c” would normally represent the “hard” sound /k/ (before “a”, “o”, “u”, or at the end of a word) in English and in certain Romance languages such as Catalan, Galician, French (where ç appears in the name of the language itself, français), Ligurian, Occitan, and Portuguese. In Occitan, Friulian and Catalan ç can also be found at the beginning of a word (Çubran, ço) or at the end (braç).

It represents the voiceless postalveolar affricate /tʃ/ (as in English “church”) in Albanian, Azerbaijani, Crimean Tatar, Friulian, Kurdish, Tatar, Turkish (as in çiçek, çam, çekirdek, Çorum), and Turkmen. It is also sometimes used this way in Manx, to distinguish it from the velar fricative.

In the International Phonetic Alphabet, ⟨ç⟩ represents the voiceless palatal fricative.

S[edit]
Main article: Ş

The character “ş” represents the voiceless postalveolar fricative /ʃ/ (as in “show”) in several languages, including many belonging to the Turkic languages, and included as a separate letter in their alphabets:

Turkish
For example, it is used in Turkish words and names like Eskişehir, Şımarık, Hasan Şaş, Rüştü Reçber etc.
Azerbaijani
Crimean Tatar
Gagauz
Tatar
Turkmen
Romanian (substitution use when S-comma [Ș] was missing from pre-3.0 Unicode standards, and older standards, still frequent, but an error)
Kurdish
In HTML character entity references Ş and ş can be used.
Latvian[edit]
Comparatively, some consider the diacritics on the palatalized Latvian consonants “ģ”, “ķ”, “ļ”, “ņ”, and formerly “ŗ” to be cedillas. Although their Adobe glyph names are commas, their names in the Unicode Standard are “g”, “k”, “l”, “n”, and “r” with a cedilla. The letters were introduced to the Unicode standard before 1992, and their names cannot be altered. The uppercase equivalent “Ģ” sometimes has a regular cedilla.

Marshallese[edit]
Four letters in Marshallese have cedillas: <ļ m̧ ņ o̧>. In standard printed text they are always cedillas, and their omission or the substitution of comma below and dot below diacritics are nonstandard.[citation needed]

As of 2011, many font rendering engines do not display any of these properly, for two reasons:

“ļ” and “ņ” usually do not display properly at all, because of the use of the cedilla in Latvian. Unicode has precombined glyphs for these letters, but most quality fonts display them with comma below diacritics to accommodate the expectations of Latvian orthography. This is considered nonstandard in Marshallese. The use of a zero-width non-joiner between the letter and the diacritic can alleviate this problem: “l‌̧” and “n‌̧” may display properly, but may not; see below.
“m̧” and “o̧” do not currently exist in Unicode as precombined glyphs, and must be encoded as the plain Latin letters “m” and “o” with the combining cedilla diacritic. Most Unicode fonts issued with Windows do not display combining diacritics properly, showing them too far to the right of the letter, as with Tahoma (“m̧” and “o̧”) and Times New Roman (“m̧” and “o̧”). This mostly affects “m̧”, and may or may not affect “o̧”. But some common Unicode fonts like Arial Unicode MS (“m̧” and “o̧”), Cambria (“m̧” and “o̧”) and Lucida Sans Unicode (“m̧” and “o̧”) do not have this problem. When “m̧” is properly displayed, the cedilla is either underneath the center of the letter, or is underneath the right-most leg of the letter, but is always directly underneath the letter wherever it is positioned.
Because of these font display issues, it is not uncommon to find nonstandard ad hoc substitutes for these letters. The online version of the Marshallese-English Dictionary (the only complete Marshallese dictionary in existence) displays the letters with dot below diacritics, all of which do exist as precombined glyphs in Unicode: “ḷ”, “ṃ”, “ṇ” and “ọ”. The first three exist in the International Alphabet of Sanskrit Transliteration, and “ọ” exists in the Vietnamese alphabet, and both of these systems are supported by the most recent versions of common fonts like Arial, Courier New, Tahoma and Times New Roman. This sidesteps most of the Marshallese text display issues associated with the cedilla, but is still inappropriate for polished standard text.

Other diacritics[edit]
Languages such as Romanian add a comma (virgula) to some letters, such as ș, which looks like a cedilla, but is more precisely a diacritical comma. This is particularly confusing with letters which can take either diacritic: for example, the consonant /ʃ/ is written as “ş” in Turkish but “ș” in Romanian, and Romanian writers will sometimes use the former instead of the latter because of insufficient font or character-set support.

The Polish letters “ą” and “ę” and Lithuanian letters “ą”, “ę”, “į”, and “ų” are not made with the cedilla either, but with the unrelated ogonek diacritic.

French[edit]
In 1868, Ambroise Firmin-Didot suggested in his book Observations sur l’orthographe, ou ortografie, française (Observations on French Spelling) that French phonetics could be better regularized by adding a cedilla beneath the letter “t” in some words. For example, the suffix -tion this letter is usually not pronounced as (or close to) /t/ in French, but as /sjɔ̃/. It has to be distinctly learned that in words such as diplomatie (but not diplomatique) it is pronounced /s/. A similar effect occurs with other prefixes or within words. Firmin-Didot surmised that a new character could be added to French orthography. A similar letter, the t-comma, does exist in Romanian, but it has a comma accent, not a cedilla.

Romanian[edit]
The Unicode characters for Ţ (T with cedilla) and Ş (S with cedilla) were wrongly implemented in Windows-1250, the code page for Romanian. In Windows 7, Microsoft corrected the error by replacing T-cedilla with T-comma (Ț) and S-cedilla with S-comma (Ș).

Vute[edit]
Vute, a Mambiloid language from Cameroon, uses cedilla for the nasalization of all vowel qualities (cf. the ogonek used in Polish and Nahuatl for the same purpose). This includes unconventional roman letters that are formalized from the IPA into the official writing system. These include .

Gagauz[edit]
Gagauz uses Ţ (T with cedilla), one of the few languages to do so, and Ş (S with cedilla). Besides being present in some Gagauz orthographies, T with Cedilla exists as part of the General Alphabet of Cameroon Languages, in the Kabyle dialect of the Berber language, and possibly elsewhere.

Encodings[edit]
Unicode provides precomposed characters for some Latin letters with cedillas. Others can be formed using the cedilla combining character.

Unicode and HTML Codes for Cedillas

Description Letter Unicode HTML

Cedilla (spacing) ¸ U+00B8 ¸ or ¸

Combining cedilla ◌̧ U+0327 ̧

C with cedilla Ç

ç U+00C7

U+00E7 Ç or Ç

ç or ç

C with cedilla and acute accent Ḉ

ḉ U+1E08

U+1E09 Ḉ

ḉ

Combining small c with cedilla

(medieval superscript diacritic)[10] ◌ᷗ U+1DD7 ᷗ

D with cedilla Ḑ

ḑ U+1E10

U+1E11 Ḑ

ḑ

E with cedilla Ȩ

ȩ U+0228

U+0229 Ȩ

E with cedilla and breve Ḝ

ḝ U+1E1C

U+1E1D Ḝ

ḝ

G with cedilla Ģ

ģ U+0122

U+0123 Ģ

H with cedilla Ḩ

ḩ U+1E28

U+1E29 Ḩ

ḩ

K with cedilla Ķ

ķ U+0136

U+0137 Ķ

L with cedilla Ļ

ļ U+013B

U+013C Ļ

N with cedilla Ņ

ņ U+0145

U+0146 Ņ

R with cedilla Ŗ

ŗ U+0156

U+0157 Ŗ

S with cedilla Ş

ş U+015E

U+015F Ş

T with cedilla Ţ

ţ U+0162

U+0163 Ţ

References[edit]

^ For cedilla being the diminutive of ceda, see definition of cedilla, Diccionario de la lengua española, 22nd edition, Real Academia Española (in Spanish), which can be seen in context by accessing the site of the Real Academia and searching for cedilla. (This was accessed 27 July 2006.)

^ Jump up to: a b c d “cedilla”. Oxford English Dictionary (3rd ed.). Oxford University Press. September 2005. (Subscription or UK public library membership required.)

^ Minsheu, John (1599) Percyvall’s (R.) Dictionarie in Spanish and English (as enlarged by J. Minsheu) Edm. Bollifant, London, OCLC 3497853

^ Chambers, Ephraim (1738) Cyclopædia; or, an universal dictionary of arts and sciences (2nd ed.) OCLC 221356381

^ Jacquerye, Denis Moyogo. “Cedilla Comma” (PDF). Unicode Consortium. Retrieved 3 July 2015.

^ “Neue Haas Grotesk”. The Font Bureau, Inc. p. Introduction.

^ “Neue Haas Grotesk – Font News”. Linotype.com. Retrieved 2013-09-21.

^ “Schwartzco Inc”. Christianschwartz.com. Retrieved 2013-09-21.

^ “Akzidenz Grotesk Buch”. Berthold/Monotype. Archived from the original on 4 July 2015. Retrieved 3 July 2015.

^ “N3027: Proposal to add medievalist characters to the UCS” (PDF). ISO/IEC JTC1/SC2/WG2. 2006-01-30.

External links[edit]

ScriptSource — Positioning the traditional cedilla

Diacritics Project—All you need to design a font with correct accents

Keyboard Help—

Learn how to make world language accent marks and other diacriticals on a computer

👈👈👈☜*“THE ENGLISH ALPHABET”* ☞ 👉👉👉

💕💝💖💓🖤💙🖤💙🖤💙🖤❤️💚💛🧡❣️💞💔💘❣️🧡💛💚❤️🖤💜🖤💙🖤💙🖤💗💖💝💘

*🌈✨ *TABLE OF CONTENTS* ✨🌷*

🔥🔥🔥🔥🔥🔥*we won the war* 🔥🔥🔥🔥🔥🔥