the documented Franco writing system.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

13 KiB

title date description tags
the Arabic writing system problem 2019-01-01 a deep dive into the problems that face the traditional Arabic writing system, and how we can change it. [research writing systems Arabic linguistics]


the nature of the language

humans like any other animal have a way to send signals to each other by the means of sound. our brain utilizes our ears and mouth to form a communication channel by encoding entities in nature to a conventional verbal sounds. this whole thing we call it now "the language" - in the English language, or more precisely "the spoken language".

the nature of writing systems

between 3400 and 3300 BC in sumar humans invented the most important invention in the history (its actually the invention that defined the beginning of history), the writing system. this is yet another communication channel in which our brain this time utilizes our eyes and hands to form it.

the writing system is very different than the spoken language in its nature. our brain treats those two communication channels differently. we often falsely (but understandably) include the writing system in the definition of the language. it's known that learning speaking and listening in a new language is a different skill than writing and reading it, and require different training. due to this natural distinction, the writing systems of languages evolved independently. actually so many languages didn't have any writing system until relatively recently (like Arabic for instance).

what i want to emphasis from this section is one thing, the writing system is not the language itself, and they are mostly independent from each other, some languages have multiple writing systems (e.g. Hindi and Urdu). some writing systems are used in multiple languages, as the Arabic writing system used for writing languages other than Arabic like Persian, Kurdish, and Urdu (Arabic was also used to write Spanish until the 16th century).

categories of writing system

phonetic based

in phonetic based systems every letter represent a speech sound, like alphabetical languages (e.g. all Latin languages) or abjadic languages (e.g. Arabic and Hebrew).

syllabaries or logographies based systems

in those systems each character represents a word or part of a word (e.g. Chinese and Egyptian hieroglyphs)


some writing systems has both types. for instance Japanese has letters that represents sounds and also uses some Chinese characters in its writing system.

character set and encoding

before we continue we have to agree on the meaning of some words that will come up often in this discussion.

character set

a character set is the collection of symbols that's used in a writing system. for example, a to z is character set of 26 symbols. while a to z

  • A to Z is another character set with 52 symbols. some writing systems us character sets with thousands of symbols like Chinese. different writing systems can share one character set or borrow some symbols from another writing system's character set.


long time after humans invented writing systems. it was the time for inventing the second most important invention in history, computers. how computers work demands another layer of encoding, encoding the symbols in a character set to ones and zeros. this layer forms a communication channel between we - humans - and computers.


ASCII is character set and encoding that consists of 128 symbols {a-z + A-Z, 0-9, punctuations}, i.e. everything you can see on English keyboard. it's the most simple, efficient and supported character set known till now.

the Arabic character set

the character set of Arabic can consists of (at least) the following symbols: ! " # $ ٪ & ' () ٭ + ، - . / ٠ ١ ٢ ٣ ٤ ٥ ٦ ٧ ٨ ٩ : ؛ <=> ؟ ء آ أ ؤ إ ئ ا ب ة ت ث ج ح خ د ذ ر ز س ش ص ض ط ظ ع غ [] ـ ف ق ك ل م ن ه و ى ي ً ٌ ٍ َ ُ ِ ّ ْ ٱ ٰ ٬ ” “ Arabic charset shares some symbols with ASCII which is the source of many problems regarding reordering in mixed text. the Arabic charset has another point of complexity, which is how symbols are displayed on screen. for example the character غ is never displayed as you see it standing alone in Arabic text, basically it changes its shape according to where it appeared in the word (initial, medial, final) and also according to the letter before it and after it. Some letters would have more than eight glyphs (shapes) in some writing styles. and almost every character in Arabic behaves this way. the total number of glyphs in traditional printing exceeds four hundred in many typefaces.

the problem

mixed text

since writing mixed English and Arabic is inevitable for Arabic speakers in the world we live in now, the inherently problematic bidirectional text scenario arises more often bringing with it all kinds of headaches.

every text is stored electronically as a string of characters, writing RTL (Right To Left) or LTR doesn't change the order those characters are stored internally. the software displaying the string is responsible for correctly computing the relevant position of the next character on the screen based on whether or not this characters and the character preceding it belongs to an RTL character set.

if you are an Arabic speaker you know the frustration of reading or editing bidirectional text on nearly any software.

some of the ways things can go wrong are:

  • numbers inside RTL (specially if there is a number at the end or the beginning of a line or there is a number at a direction switch)
  • highlighting text with mixed directions
  • wrapping text on multiple lines
  • adding a comma on a direction switch
  • enclose text in parenthesis (or any mirrored characters like "<,[,{")
  • specifying the base direction

you can see this for an introduction to the bidi algorithms and its limitations.

another problem is keyboard mappings, if you are a keyboard oriented person as most power users, changing keyboard layout while editing will disable all your keyboard shortcuts making editing more and more and more difficult.


there are two solutions people are migrating to.

all English

this is great solution and it shortens the way to the inevitable end of one universal language. but this will not happen in our life time or even in our childern's life time. until then arabic will be used and trying to avoid it in all written text is basically impractical.

Writing Arabic Using Latin Characters

this solution is traced back to the time email and cell phones didn't support Arabic. users back then used to send Arabic written in English alphabet as an ad hoc solution. this method of writing used to have many names: Araby, Arabizi, Mu'arrab, and Franco-Arabic. i will refer to it simply as Franco as we call it in Egypt.

but way before that there was proposals to replace the Arabic letters with Latin ones. the first recorded one was by Wilhelm Spitta in 1880 in his book "Vulgardialectes von Agypten Grammatik des Arabischen [The Rules of Slang Arabic in Egypt]" in which he suggested using Latin characters to write the Egyptian slang. There were many other supporters for this proposal during the following forty years. Among them was Abdul Aziz Fahmi who proposed a full practical scheme in response to a competition organized by the Academy of the Arabic Language in Cairo. Other supporters of Fahmi’s proposal, including Sa’id ‘Aqil and Anis Freha developed their own proposals. They found additional support in Turkey, where Kamal Ataturk ordered the adoption of English characters to write Turkish instead of the Arabic characters in 1928.

those academic proposals didn't have any influence over the design and adoption of Franco, but rather it's developed organically over time by society, it got a lot of adoption among new generations to the point where google added Franco as one of its supported input methods in 2012 (but dropped it later due to Anti-Franco campaigns). it has been, in formal and informal daily use communication, entertainment, social media, signs and flyers, or even in students' writing books. i must say also that Franco wasn't a way of romanization of Arabic (which is a way of writing Arabic names and titles in Latin alphabet to be pronounced by English speakers). Franco meant to be a system for Arabic speakers to communicate with each other in Arabic.

characteristics of common Franco.

  1. Franco only uses the ASCII charset(i.e. does not use Latin symbols as é or á).
  2. the Franco system is contextual.
  3. The traditional Arabic vocalization marks are substituted by vowels. The "fatha" by a; the "kasra" by i or e; and the "damma" by u, ou, or o.
  4. The Franco system differs for every Arabic country, depending on the local dialect. for example, in countries which french is its second languages, "ch" is used to write the Arabic letter "sheen" instead of "sh".
  5. numbers are used to represent Arabic characters that can't be written using the ASCII. numbers are chosen based how similar the number is to the letter represented. for example, the letter ح represented by 7; the letter ء represented by 2 (the same shape mirrored); the letter ع represented by 3.

war against Franco

Franco has always been fought and severely criticized by Arab nationalists and Muslim enthusiasts, who considered it as a direct attack on the Arabic identity. They also saw it as a threat to the Quran, which is written in classical Arabic using a writing system that has lasted for more than fourteen centuries. Islamic and Arabic scholars started a series of Anti-Franco campaigns around the Arab world. Anti-Franco was the main theme of World Arabic Language Day 2014 . Google as well stopped from supporting Franco and removed it from their language tools. Arabic experts reactions to Franco are typically quite negative, even when they themselves employ it sometimes. They emphasize the use of the traditional writing system and marginalize the efforts of students to use alternative forms of writing Arabic.

most Anti-Franco arguments has political, religious and racial components new generations are free of. they also come from a misunderstanding of the difference between a language and its writing system. i see the adoption of Franco is a healthy thing for the Arabic language. i love Arabic; i think in Arabic, and i will continue to think in Arabic to the end of my life. but Franco for me is a way to express myself in Arabic without all the hassle traditional writing brings in.

rational criticism

rational people who are not used to read or write Franco will criticize Franco as it's hard to read, of course using a new writing system wouldn't come without a learning curve. its not fair to compare the ease of Franco to a writing system people taught in school for years. suppose Franco taught in schools and someone introduced you to the traditional system, would you find it easier?.

i didn't see any criticism outside that it is hard to read. if you have another point against Franco please send it to me and i will include it in this section.

the future of Franco

until now i think Franco refuses to die despite everything, Arabs are still using it heavily in chats, but the public domain doesn't reflect this internal adoption. we need new studies to measure Franco use in different geographic areas. i came across some studies done in Saudi Arabia with ranging results from medium to high usage among teens. i claim that if similar studies made in countries like Egypt, Morocco, Lebanon or Jordan the studies will have significantly different results towards very high usage.

like any organically grown idea, most of the design choices in the common Franco we write today has some form of a hidden wisdom inside of it, but has some weak aspects as well. if we are serious about developing a writing system, there must be an intelligent driving force that maintains some standard or a guideline. i didn't find one yet, but i am working on one here. i am planning also to visit a linguistic college and see how people there think about this and maybe try to convince some students to maintain the project further in a scientific manner. if you are interested or want to contribute any thing in any form (even a criticism would help in some decisions) please contact me.