3 Brackets and Trees

Learning Objectives

By the end of this chapter, you should be able to,

  • understand and utilize three different ways to represent syntactic information
  • appreciate the relationship between constituency and representation (phrase structure rules, brackets, and trees)
  • understand the relationship between meaning and structure, and what ambiguity tells us about that relationship

Starting Small

The constituency tests that we performed in the last chapter allowed us to see that every sentence is made up of smaller units of the sentence. Importantly, each constituent forms its own “semantic” unit as well. This is sometimes difficult to understand, and even harder to demonstrate, so we’ll start first by looking at a somewhat “smaller” problem, and work up to full sentences.

English (like all Germanic languages) has a productive process through which new words can be formed that involves putting two words together. The resulting words are called compounds.

Compounding in English
rice pot rice-pot
swan boat swan-boat
phone case phone-case

This process is, theoretically, infinitely recursive, meaning that we can continue to make new words from existing words.

More compounding in English
rice-pot rack rice-pot-rack
swan-boat jacket swan-boat-jacket
phone-case store phone-case-store

We can think of each piece of a compound as a constituent. Rice-pot has two pieces of meaning,  rice and pot. When we put them together, we make something whose meaning is the combination of both. This idea is represented in terms of bracketing structure.

\ex. \a. \textit{rice-pot} : [ [ rice ] [ pot ] ] \b.\textit{swan-boat} : [ [ swan ] [ boat ] ]

Inside each pair of brackets is one meaning “unit.” So rice-pot involves three pairs of brackets: one around rice, one around pot and one around the compound rice-pot. Same with swan-boat.

This way of representation allows us to iteratively represent more complex compounds, as in (2).

\setcounter{ExNo}{1}  \ex. \a. \textit{rice-pot-rack} : [ [ [ rice ] [ pot ] ] [ rack ] ] \b. \textit{swan-boat-jacket} : [ [ [ swan ] [ boat ] ] [ jacket ] ]

When we build larger compounds, we are creating a meaningful unit, and then adding to it. So first I create the word rice-pot, and it has a meaning like, “a pot for rice.” Then I create the word rice-pot-rack, which has the meaning “a rack for a pot for rice.” In other words, the meaning of the entire word depends on the meaning of its parts.

Of course, the more complex our compounds become, the more difficult it is to read the bracketing structure. So another way to express the exact same information is by using a tree, as in (3).

Rendered by QuickLaTeX.com

Each node in the tree corresponds to one pair of brackets. Thus, trees and brackets provide the exact same amount of information, it’s just that trees do it in a visually more appealing way. But it’s always possible to state a bracketing structure as a tree, and vice versa.

I can actually make my representations slightly more informative by adding in category information. The following trees and bracketing structures give us all the preceding information, and they additionally tell us what the category of each constituent is.

Rendered by QuickLaTeX.com

Note that it’s not necessary to explicitly state the compound words that are formed at each junction, because I can just look lower in the tree to figure out what that word is. The reason it’s good to list the category, though, is that you can combine more than one category. We therefore want to know which category “projects,” that is, names the resulting word.

Rendered by QuickLaTeX.com

Brackets and trees (length: 3m 49s)

Scaling up to sentences

Just like each word in a compound is a unit of meaning, which combine to create bigger meanings, sentences also behave the same way. Each constituent contributes a piece of meaning. The constituents combine to create larger meanings, until the sentence is complete.

Our constituency tests are telling us how the pieces of meaning come together. Just like we know that the compound sweet potato chickpea salad is made up of two smaller compounds sweet potato and chickpea salad, sentences can be broken up into smaller chunks. For instance, we found that in the sentence The tired doctor slept, the string of words the tired doctor is a constituent. Thus, this piece of meaning is enclosed in brackets or corresponds to a single node in a tree.

Rendered by QuickLaTeX.com

And just like with compounds, we can also add in information about categories. The constituent the tired doctor is a Determiner Phrase (for reasons that we’ll discuss in chapter 5), and the whole thing is a SentenceEnglish (abbreviated SE).

Rendered by QuickLaTeX.com

Constituency, brackets, and trees (length: 2m 59s)

Note that the combination of words in a compound word results in a category label (N, A, V, D, …). This is because when you make a compound, you’re making another word. When you create constituents that are bigger than words, we called these phrases. Putting the and doctor together doesn’t make a new word, it makes a phrase—a DP.

Here comes the crucial leap to phrase structure grammars. The rules of a phrase structure grammar are the possible structures in a language. That is, the phrase structure grammars tell you what the possible bracketing structures and trees are in a language. Consider again our rules for English from chapter 2 (now with abbreviations).

A revised grammar fragment of English

Rule 1:  SE → DP V

Rule 2: DP → D N

Rule 3: DP → D A N

These phrase structure rules tell you how information is packaged in English. That is, they tell that the phrase [DP … ] precedes V. And they tell you that inside of DP you can have either [DP D N ] or [DP D A N ]. More generally, the three phrase structure rules say the following are the (only) possible trees and bracketing structures in English.

Rendered by QuickLaTeX.com

Phrase structure grammars are thus “instructions” for what trees/bracketing structures can be drawn in a language. Our fragment for English above is quite limited; it only generates a few trees/structures. We will expand on this fragment in the following chapter.

Representations (length: 44s)

 

Structural Ambiguity

What does the following compound mean?

\setcounter{ExNo}{12}  \ex. baby-tree-planter

Many of you will say that it means, “someone who plants baby trees.” And many of you will say that it means “a tree-planter who is a baby.” Both of these are right; it’s possible to get both readings of the compound.

If we wanted to represent each meaning, we would choose different structures. The “baby-trees” meaning would have the following representations.

Rendered by QuickLaTeX.com

The reason is that we want “baby tree” to be a unit of meaning, i.e., a constituent, because that meaning describes the kind of planter it is.

On the other hand, the “tree-planting-baby” meaning would have the following representation

Rendered by QuickLaTeX.com

The reason is that we want “tree-planter” to be a unit of meaning first, because ultimately we’re describing a kind of tree-planter.

This kind of ambiguity is called structural ambiguity, because we can represent the ambiguity structurally.

There is another kind of ambiguity called lexical ambiguity. The word bank is lexically ambiguous, because it means two different things (a financial institution and the side of a river). But it’s not structurally ambiguous because the two meanings do not correspond to different structures.

The take-away point here is that different meanings correspond to different structures. More abstractly, the different meanings are the result of packaging the information in different ways.

The same is true when we scale up to full trees. Consider the following (very famous) sentence in English.

\setcounter{ExNo}{15}  \ex. Mary saw the man with the binoculars.

This sentence is ambiguous as well. On the one hand, it can mean that Mary saw a man who was holding binoculars. On the other hand, it can mean that Mary used binoculars to see the man. In the first meaning with the  binoculars describes a property of the man. On the second meaning, with the binoculars describes how Mary is “seeing.”

Just like baby-tree-planter, we observe different meanings. We can ask then whether those meanings correspond to different structures. The way to answer this question is to apply constituency tests, because constituency tests tell us how the parts of a sentence fit together. For instance, let’s apply the substitution test. Specifically what I’m doing in this test is trying to see if with the binoculars must connect with the other words in a particular way, in particular, if it must modify man.

\setcounter{ExNo}{16}  \ex. Mary saw {\underline{the man with the binoculars}}. $\longrightarrow$ That's right, and John saw {\underline{him}}, too.  \ex. Mary saw {\underline{the man}} with the binoculars. $\longrightarrow$ That's right, and John saw {\underline{him}} with the telescope.

Notice first that both of these sentences are grammatical. At first, you might be tempted to think that we’ve done our test wrong, since the results of (17) and (18) suggest that there are two ways to do the constituency of this sentence. But in fact, if we applied all of our tests, we’d find that both underlined strings of words are possible constituents. What is crucial is that the constituent parsings in (17) and (18) correspond to a different meanings. This is the hallmark of a structural ambiguity: different meanings correlate with different structures.

Explicitly, what (17) and (18) shows is that once we’ve applied a test, then the ambiguity goes away. When we force [ man with the binoculars ] to be a constituent by replacing that string with him, the sentence can only mean that the man has the binoculars. In contrast, when we force [ the man ] to be a constituent by replacing just that string with him, then the sentence can only mean that Mary was using binoculars. This is because by applying the test, we’re forcing the sentence to have a certain structure, and therefore to have a certain meaning.

Again, the take-away message here is: different meanings have different structures—and vice versa.

Structural ambiguity (length: 1m 27s)

Before moving on, it’s important to recognize that trees are not syntax. Very often, students in an introductory syntax class come away with the impression that trees are the most important thing for a syntactician or that syntacticians “study” trees.

This is incorrect. There are many, many syntax papers, books, and analyses that don’t contain a single tree, and in fact there are many theories of syntax that largely eschew trees. Trees are simply representations of information. We use trees to visualize how information is packaged, but syntacticians don’t study trees. They are just tools, and like all tools, they’re good at doing certain things, and bad at doing other things.

However, if we draw a tree, then we must be precise. Just like y=2x2+5 is not the same as y=5x2+2, each structure that we create in syntax represents different information. Indeed, the math analogy is telling: we can represent y=5x2+2 in different ways, say, by listing all the possible values for x and their corresponding values for y, or by graphing the equations on a two dimensional plane. This is just like syntax: there are multiple ways to represent information (trees, brackets, listing sentences, PSGs). We can use any of them, but some are better in certain contexts.

Still, we the instructors are going to hold you to a high standard in your trees. Not because trees are important per se, but because a precise representation of language is important.

Where we’re going

With these representational tools, brackets and trees, we’re going to try to build a functional description of a language using phrase structure rules. What is important is that our description—that is the phrase structure grammar—accurately represents the language in question. To do this we simply use the tools we have: constituency and category diagnostics.

Things to remember

  • The relationship between brackets, trees, and phrase structure grammar
  • How constituency and structures are related
  • Structural ambiguity derives from different structural representations.

Advanced

One question that we should ask when developing a representation is, “How detailed should the information be?” For example, we’ve abstracted over categories so that now we have Verbs (S), Nouns (N), etc. But we can further subdivide each of these domains. For instance, we can divide verbs into transitive and intransitive. The former appear with a subject and an object, while the latter only appear with a subject. (See more discussion in Chapter 10.) Should we divide our rules that we abstractly represent transitive and intransitive verbs different? After all, they appear in different syntactic frames.

Or consider nouns. We are currently treating all nouns as equivalent, but that is clearly not the case. Some nouns are count nouns, meaning that is possible to “individuate” them: chair, table, student, country, …. Some nouns are mass nouns, meaning that they cannot be individuated int he same way: sand, water, skin, ….[1] For our purposes, it’s important to recognize that count and mass nouns have different distributions: one chair is grammatical but *one sand is not. Likewise, I see water is grammatical, but *I see chair is not. Note too that mass and count nouns have different morphological properties. Count nouns pluralize easily (chairs) while mass nouns do not (?sands).

Thus, based on distributional and morphological diagnostics, I could treat mass nouns as categorially distinct from count nouns. Should our representations reflect this division among categories? The answer is: it depends on what we’re looking at. If we want to make broad generalizations about English, Swahili, Arabic, Korean, etc, then this level of information isn’t necessary. But if we want to make a careful study of say, nouns in Arabic, or nouns in Swahili, etc, then we probably would have to represent the nouns differently depending on the differences that we find.

By analogy, when we study phonology, we are happy to use broad phonetic transcription. But of course narrow phonetic transcription is always possible and is in fact necessary when looking at particular particular sounds or environments.

\setcounter{ExNo}{18}  \ex. blank \a. Broad transcription: /\textipa{bl\ae}\textipa{Nk}/ \b. Narrow transcription /\textipa{"bl\~\ae}\textipa{P}/

The same principle applies to syntax (and morphology). We use a “broad” representation when we need to abstract away from certain irrelevant details. But a “narrow” representation is useful when we want to do a detailed investigation.


  1. As a rule of thumb, a noun is a mass noun if, when you divide it up, you infinitely get the same thing. So if I have a pile of sand and I divide it in half, I will have two piles of sand. This isn't true for, say, chairs. If I have a chair, and I divide it in half, then I don't have two chairs.
definition

License

Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

The Science of Syntax Copyright © by John Gluckman is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License, except where otherwise noted.

Share This Book