Tag Archives: dictionaries

Writing Systems

One of the side benefits of the consultant conference I attended in September was that it was in Thailand, where some of our colleagues are based. Specifically, those who work on WeSay, a program I have put to good use in DRCongo, and hope to continue using in Cameroon. The same group also works on non-roman scripts (Writing Systems Technology). Anyway, I got to greet them face to face for the first time, having corresponded with some of them for more than a decade.

One of the highlights of that visit was seeing different font and printing issues they had faced. As I work in writing systems development, this was particularly interesting to me. They work on a different end of the puzzle, though: I work on figuring out what are the consonants, vowels, and tones in a language, and how to represent them well in writing (#FluentReadingMakesPowerfulBibles). This team works on how to take a developed writing system and represent it well in a (unicode!) font for printing.

They have examples of printed Bibles with roman scripts (like our letters, though with additional signs for additional vowels, tone, or other important language features):

Even though this script is based on the same alphabet as English, you can see that there are some potential problems for typesetting it. For instance, in many places there are two distinct marks (diacritics) above a vowel, what we call “stacked diacritics”. For instance, in John 13:26 (the first verse on the page) Jesus’ second word “tônuô̌” has two different marks on the final vowel. I assume the one marks a vowel difference (i.e., “o” and “ô” are not the same vowel —like English “bet” and “bit” aren’t the same vowel), and the other marks a difference of tone (i.e., “o” is not the same tone as “ǒ” —like English “convict” (n) and “convict” (v) don’t have the same stress).

When you put these two together, and you have a potential conflict. In my browser anyway (and maybe in yours) the word “tônuô̌” has two diacritics (from ô and the ǒ) typed more or less on top of each other, rather than the diacritic on ǒ being placed above that on the ô, as it should be (and as in the printed Bible). If you scan through that page, you can see lots of variations of these stacked diacritics, meaning its is important to get this correct for printing.

But they also work with non-roman scripts:

These scripts have their own issues, though I haven’t gotten into them myself, since most African languages I have worked with use roman scripts.

One interesting issue they had to deal with was multiple scripts for the same language. In this example, there was a script of high cultural value, which was not really understood by much of the population. So they wanted to print the valuable script in parallel with the script that people could actually understand:

On some occasions, they have dealt with two (or three) scripts for the same language because it is spoken in multiple countries, and each country as it’s own mandated writing system. Converting checked scripture between these poses its own set of problems.

Here is yet another script:

I don’t remember the details of this one, though the tabs a the top of the page each indicate a problem that needed to be fixed in terms of the printing of the writing system. How one writes is typically very important, both to individuals and people groups, so we want to be able to move past making mistakes if possible —though imagine checking the typesetting on this Bible!

Often the two books that help clarify and preserve a writing system are the Bible and a dictionary, which is why we want to get the work done on the writing system early, before we do a lot of printing of scripture portions. And it is again why I pair my dictionary work with writing system development, so people can see lots of words in writing, and give us feedback on problems with the writing system, whether they are technical or aesthetic.

This hard, technical work in advance is worth it because we want the scriptures to be used, and to speak with power into the hearts of the people who read and hear them read. For this to happen, the writing system needs to be acceptable and valued so people will use it, but also accurately reflect the language, so people can use it fluently. #FluentReadingMakesPowerfulBibles

Dictionaries

I just finished drafting the line “good progress on dictionaries for each of the languages” for the newsletter we’re hoping to put out in the next couple days, and I realized I’m not sure that it is clear to all why that is a good thing.  So here’s a bit of a rationale.

From what I understand of the history of dictionaries in English, one of the main reasons people do them is to help standardize the writing system.  Have you ever asked how to spell a word, and been told to look it up in a dictionary?  Perhaps that doesn’t happen so much anymore, but in any case, dictionaries can be an authoritative source for spelling information. I have even understood that one of Webster’s goals was not only the standardization of American English spelling, but also the creation of a distinctly American English. Have you used the words colour, litre, practise, paralyse or programme? If so, you’re probably British (or learned your spelling from a Brit). Nothing against the Brits; it’s just that spelling is one way of saying “this is who we are”. While I’m hoping that the communities we work with in the DRC won’t spend much time distinguishing one dialect of their language from another, I do hope they will spend time clarifying their identity in their writing system.

I want this for two reasons.  First of all, a lot of Bible Translation is about identity.  If we are going to help someone translate what will be seen as a foreigners’ Bible, then we might as well stop today, since these people already have second and third language Bibles. Additionally, this is not the promise of the scriptures. Rather, it promises “…a great multitude that no one could number, from every nation, from all tribes and peoples and languages… crying out with a loud voice, ‘Salvation belongs to our God…'” (Rev 7:9-10 ESV). I don’t see this promise as just lots of different Christians, but people from each of the 6,000+ languages of the world seeing God as their own God, and worshiping Him as such. We don’t want to just translate and print books; we want to see the impact in individuals and communities of the Holy Spirit working through the written Word of God. This impact is hampered if you’re reading a Bible that continues to tell you that God is talking to someone else, not you.

The second reason I’m in favor of standardization has to do with fluency.  I’ve seen a lot of non-fluent reading of the scriptures, and I personally find it painful. And I imagine it must be difficult to have the kind of impact I mention above if the reader stumbles often in the reading, and/or has to read something multiple times to get the meaning. As a result, one major motivation for my work is to see people reading fluently. I want them to read without stumbling, and to get the sense the first time they read. I want pastors to be able to read the scriptures in the middle of a sermon without creating a major break in the thought flow. And I want to do everything I can to remove any barriers to fluency which arise as a result of the writing system. That normally starts with getting the consonants, vowels and tone correct, but it also includes people knowing how words are spelled, and identifying the correct word and its pronunciation quickly as they read.

There are two other arguments for dictionaries as part of language development, one of which is sociological, the other linguistic. Related to the identity question above, many peoples I’ve interacted with don’t see their language as valuable, and this opinion is often shared by outsiders. I once heard a “real language” as “you know, one with books”. I think I know what that person meant, but if it takes a book to give a language respect, then I want to be a part of giving them their first book. And people get this. Seeing someone look at the first booklet in their language (as little as 15 pages, with lots of pictures!) is an amazing sight. They immediately get that someone does finally care about them.

The linguistic argument is that in order to do good dictionary work, you need to do a lot of other things which you already should be doing anyway: collecting and analyzing texts, checking pronunciations, helping the community decide how words should be spelled, including diacritics, spacing, and punctuation. All of this analysis helps build not only the dictionary, but our understanding of the language more generally, perhaps more particularly how the sound and writing systems will interact. I hope it is clear why one would want to do this before publishing much in the language; anything you publish without really understanding how the writing system will work may need substantial revision, and anything you publish creates a precedent that you will have to fight against in making later changes (and if you don’t see how precedent can trump sound reasoning for spelling changes, just look at English).

So our ultimate goal is life transformation through the Bible, but to get there, we want to see that the community is well placed to have and use a Bible that is theirs, that can and will be read fluently and with power, and dictionary work helps further all of those goals.