All English vowel sounds in one sentence
Contrary to popular belief, English has more than five or ten vowel sounds. The actual number is disputed because of disagreements over when two sounds are sufficiently distinct to be classified as separate sounds. I've heard some people say 15, some 17, some over 20.
I ran across a podcast episode recently that mentioned a sentence that demonstrates a different English vowel sound in each word:
Who would know naught of art must learn, act, and then take his ease [1].
The hosts noted that to get all the vowels in, you need to read the sentence with non-rhotic pronunciation, i.e. suppressing the r in art.
I'll run this sentence through some software that returns the phonetic spelling of each word in IPA symbols to see the distinct vowel sounds that way. First I'll use Python, then Mathematica.
PythonLet's run this through some Python code that converts English words to IPA notation so we can look at the vowels.
import eng_to_ipa as ipa text = "Who would know naught of art must learn, act, and then take his ease." print(ipa.convert(text))
This gives us
hu wd no nt v rt mst lrn, aekt, nd n tek hz iz
Which includes the following vowel symbols:
- u
- o
- ae
- e
- i
This has some duplicates: 5, 7, 8, and 10 are all schwa symbols.
By default the eng_to_ipa gives one way to write each word in IPA notation. There is an optional argument, retrieve_all that defaults to False but may return more alternatives when set to True. However, in our example the only difference is that the second alternative writes and as aend rather than nd.
It looks like the eng_to_ipa module doesn't transcribe vowels with sufficient resolution to distinguish some of the sounds in the model sequence. For example, it doesn't seem to distinguish the stressed sound from the unstressed .
MathematicaHere's Mathematica code to split the model sentence into words and show the IPA pronunciation of each word.
text = "who would know naught of art must \ learn, act, and then take his ease" ipa[w_] := WordData[w, "PhoneticForm"] Map[ipa, TextWords[text]]
This returns
{"hu", "wd", "no", "nt", "v", "rt", "mst", "ln", "aekt", "aend", "n", "tek", "hz", "iz"}
By the way, I had to write the first word as who" because WordData won't do it for me. If you ask for
ipa["Who"]
Mathematica will return
Missing["NotAvailable"]
though it works as expected if you send it who" rather than Who."
Let's remove the stress marks and join the words together so we can compare the Python and Mathematica output. The top line is from Python and the bottom is from Mathematica.
hu wd no nt v rt mst lrn aekt aend n tek hz iz hu wd no nt v rt mst ln aekt aend n tek hz iz
There are a few differences, summarized in the table below. Since the symbols are a little difficult to tell apart, I've included their Unicode code points.
|-------+------------+-------------| | Word | Python | Mathematica | |-------+------------+-------------| | of | (U+0259) | (U+028C) | | must | (U+0259) | (U+028C) | | art | (U+0251) | (U+0252) | | learn | (U+0259) | (U+025D) | |-------+------------+-------------|
Mathematica makes some distinctions that Python missed.
Update: See the first comment below for variations on how the model sentence can be pronounced and how to get more distinct vowel sounds out of it.
More linguistics posts- Estimating vocabulary size with Heaps law
- Writing down an unwritten language
- Chinese character frequency and entropy
[1] After writing this post I saw the sentence in writing, and the fourth word is aught" rather than naught." This doesn't change the vowel since the two words rhyme, but Mathematica doesn't recognize the word aught."
The post All English vowel sounds in one sentence first appeared on John D. Cook.