X-bar theory


In linguistics, X-bar theory is a theory of syntactic category formation. It embodies two independent claims: one, that phrases may contain intermediate constituents projected from a head X; and two, that this system of projected constituency may be common to more than one category.
The letter X is used to signify an arbitrary lexical category ; when analyzing a specific utterance, specific categories are assigned. Thus, the X may become an N for noun, a V for verb, an A for adjective, or a P for preposition.
The term X-bar is derived from the notation representing this structure. Certain structures are represented by X. Because this may be difficult to typeset, this is often written as X′, using the prime symbol or with superscript numerals as exponents, e.g., X1. In English, however, this is still read as "X bar". The notation XP stands for X phrase, and is at the equivalent level of X-bar-bar, written X″ or X2, usually read aloud as X double bar.
X-bar theory was first proposed by Noam Chomsky, building on Zellig Harris's 1951 approach to categories, and further developed by Ray Jackendoff. X-bar theory was incorporated into both transformational and nontransformational theories of syntax, including GB, GPSG, LFG, and HPSG. Recent work in the Minimalist Program has largely abandoned X-bar schemata in favor of bare phrase structure approaches.

Core concepts

There are three "syntax assembly" rules which form the basis of X-bar theory. These rules can be expressed in English, as immediate dominance rules for natural language, or visually as parse trees. All three representations are presented below.
1. An X Phrase consists of an optional specifier and an X-bar, in any order:
XP →, X′

XP XP
/ \ or / \
spec X' X' spec

2. One kind of X-bar consists of an X-bar and an adjunct, in either order:

Not all XPs contain X′s with adjuncts, so this rewrite rule is "optional".

X' X'
/ \ or / \
X' adjunct adjunct X'

3. Another kind of X-bar consists of an X and any number of complements, in any order:
X′ → X,

X' X'
/ \ or / \
X complement complement X

How the rules combine

The following diagram illustrates one way the rules might be combined to form a generic XP structure. Because the rules are recursive, there is an infinite number of possible structures that could be generated, including smaller trees that omit optional parts, structures with multiple complements, and additional layers of XPs and X′s of various types.

XP
/ \
spec X'
/ \
X' adjunct
/ \
X complement
|
head

Because all of the rules allow combination in any order, the left-right position of the branches at any point may be reversed from what is shown in the example. However, in any given language, usually only one handedness for each rule is observed. The above example maps naturally onto the left-to-right phrase order used in English.
Note that a complement-containing X' may be distinguished from an adjunct-containing X' by the fact that the complement has an X as a sibling, whereas an adjunct has X-bar as a sibling.

A simple noun phrase

The noun phrase "the cat" might be rendered like this:

NP
/ \
Det N'
| |
the N
|
cat

The word the is a determiner, which at first was believed to be a type of specifier for nouns. The head is the determiner which projects into a determiner phrase. The word cat is the noun phrase which acts as the complement of the determiner phrase. More recently, it has been suggested that D is the head of the noun phrase.
Note that branches with empty specifiers, adjuncts, complements, and heads are often omitted, to reduce visual clutter. The DetP and NP above have no adjuncts or complements, so they end up being very linear.
In English, specifiers precede the X-bar that contains the head. Thus, determiners always precede their nouns if they are in the same noun phrase. Other languages use different word order.

A full sentence

For more complex utterances, different theories of grammar assign X-bar theory elements to phrase types in different ways. Consider the sentence He studies linguistics at the university. A transformational grammar theory might parse this sentence as the following diagram shows:
The "IP" is an inflectional phrase, consists of a specifier and an Ī. Its specifier is the noun phrase which acts as the subject of the sentence. The complement of the Ī is the predicate of the sentence, a verb phrase. There is no word in the sentence which explicitly acts as the head of the Ī, but this slot is usually considered to contain the unspoken "present tense" implied by the tense marker on the verb "studies".
A head-driven phrase structure grammar might parse this sentence differently. In this theory, the sentence is modeled as a verb phrase. The noun phrase that is the subject of the sentence is located in the specifier of the verb phrase. The predicate parses the same way in both theories.

Substitution test

Evidence for the existence of X-bars may be provided by the various possibilities of substitution. For example, to the above sentence He studies linguistics at the university, someone may reply She does, also. Here the word does stands for the entire phrase studies linguistics at the university. However, had the reply been And she does at night-school, the word does would stand for just studies linguistics. This implies that significant constituents containing the verb exist at two levels; the constituent at the higher level here is named a verb phrase, and that at the lower level a V-bar.

Reduction

In 1981, Tim Stowell tried to derive X-bar theory from more general principles in his MIT thesis Origins of phrase structure, a pathbreaking but ultimately unsuccessful effort according to Andras Kornai and Geoffrey Pullum.
In the mid 1990s, there were two major attempts to deduce versions of X-bar theory from independent principles. Richard Kayne's theory of Antisymmetry derived X-bar theory from the assumption that there was a tight relation between structure and linear order; and Noam Chomsky's paper Bare Phrase Structure attempted to eliminate labelling from syntax and deduce their effects from other principles of the grammar.

Quantity of sentence structure

Theories of syntax that build on the X-bar schema tend to posit a large amount of sentence structure. The constituency-based, binary branching structures of the X-bar schema increase the number of nodes in the parse tree to the upper limits of what is possible. The result is highly layered trees that acknowledge as many syntactic constituents as possible. The number of potential discontinuities increases, which increases the role of movement up the tree or feature passing up and down the tree. The analysis of phenomena such as inversion and shifting becomes more complex because these phenomena will necessarily involve discontinuities and thus necessitate movement or feature passing. Whether the large amount of sentence structure associated with X-bar schemata is necessary or beneficial is a matter of debate.

Endocentric structures only

When the X-bar schema was introduced and generally adopted into generative grammar in the 1970s, it was replacing a view of syntax that allowed for exocentric structures with one that views all sentence structure as endocentric. In other words, all phrasal units necessarily have a head in the X-bar schema, unlike the traditional binary division of the sentence into a subject noun phrase and a predicate verb phrase , which was an exocentric division. In this regard, the X-bar schema was taking generative grammar one step toward a dependency-based theory of syntax, since dependency-based structures are incapable of acknowledging exocentric divisions. At the same time, the X-bar schema was taking generative grammar two steps away from a dependency-based understanding of syntactic structure insofar as it was allowing for an explosion in the amount of syntactic structure that the theory can posit. Dependency-based structures, in contrast, necessarily restrict the amount of sentence structure to an absolute minimum.