All English vowel sounds in one sentence

John

from John D. Cook on 2020-08-20 16:24 (#5751D)

Contrary to popular belief, English has more than five or ten vowel sounds. The actual number is disputed because of disagreements over when two sounds are sufficiently distinct to be classified as separate sounds. I've heard some people say 15, some 17, some over 20.

I ran across a podcast episode recently that mentioned a sentence that demonstrates a different English vowel sound in each word:

Who would know naught of art must learn, act, and then take his ease [1].

The hosts noted that to get all the vowels in, you need to read the sentence with non-rhotic pronunciation, i.e. suppressing the r in art.

I'll run this sentence through some software that returns the phonetic spelling of each word in IPA symbols to see the distinct vowel sounds that way. First I'll use Python, then Mathematica.

Python

Let's run this through some Python code that converts English words to IPA notation so we can look at the vowels.

 import eng_to_ipa as ipa text = "Who would know naught of art must learn, act, and then take his ease." print(ipa.convert(text))

This gives us

hu wd no nt v rt mst lrn, aekt, nd n tek hz iz

Which includes the following vowel symbols:

This has some duplicates: 5, 7, 8, and 10 are all schwa symbols.

By default the eng_to_ipa gives one way to write each word in IPA notation. There is an optional argument, retrieve_all that defaults to False but may return more alternatives when set to True. However, in our example the only difference is that the second alternative writes and as aend rather than nd.

It looks like the eng_to_ipa module doesn't transcribe vowels with sufficient resolution to distinguish some of the sounds in the model sequence. For example, it doesn't seem to distinguish the stressed sound from the unstressed .

Mathematica

Here's Mathematica code to split the model sentence into words and show the IPA pronunciation of each word.

 text = "who would know naught of art must \ learn, act, and then take his ease" ipa[w_] := WordData[w, "PhoneticForm"] Map[ipa, TextWords[text]]

This returns

 {"hu", "wd", "no", "nt", "v", "rt", "mst", "ln", "aekt", "aend", "n", "tek", "hz", "iz"}

By the way, I had to write the first word as who" because WordData won't do it for me. If you ask for

 ipa["Who"]

Mathematica will return

 Missing["NotAvailable"]

though it works as expected if you send it who" rather than Who."

Let's remove the stress marks and join the words together so we can compare the Python and Mathematica output. The top line is from Python and the bottom is from Mathematica.

 hu wd no nt v rt mst lrn aekt aend n tek hz iz hu wd no nt v rt mst ln aekt aend n tek hz iz

There are a few differences, summarized in the table below. Since the symbols are a little difficult to tell apart, I've included their Unicode code points.

 |-------+------------+-------------| | Word | Python | Mathematica | |-------+------------+-------------| | of |  (U+0259) |  (U+028C) | | must |  (U+0259) |  (U+028C) | | art |  (U+0251) |  (U+0252) | | learn |  (U+0259) |  (U+025D) | |-------+------------+-------------|

Mathematica makes some distinctions that Python missed.

Update: See the first comment below for variations on how the model sentence can be pronounced and how to get more distinct vowel sounds out of it.

Source	RSS or Atom Feed
Feed Location	http://feeds.feedburner.com/TheEndeavour?format=xml
Feed Title	John D. Cook
Feed Link	https://www.johndcook.com/blog