Wh-movement


In linguistics, wh-movement concerns rules of syntax involving the placement of interrogative words. In plain terms, it refers to an asymmetry between the syntactical arrangement of words or morphemes in a question and the form of answers to that question; specifically, the placement of the question word. An example in English is "What are you doing?", a response to which could be "I am editing Wikipedia."; in which the answer is at the end of the sentence but the question word is at the beginning.
Interrogative forms are known within English linguistics as wh-words because most question words in the English language start with a wh-; such as what, when, where, who, and why. In English, only one interrogative word does not begin with wh-, namely how. In languages with wh-movement, sentences or clauses with a wh-word show a special word order that places the wh-word at the front of the sentence or clause instead of the canonical position later in the sentence. Leaving the wh-word in its canonical position is called wh-in situ.
Wh-movement often results in a discontinuity, and in that regard, it is one of four widely acknowledged discontinuity types, the others being topicalization, scrambling, and extraposition. Wh-movement is found in many languages around the world, and of the various discontinuity types, wh-movement is the one that has been studied the most. It is observed in many of the world's languages, and plays a key role in the theories of long-distance dependencies.
Historically, the name wh-movement stems from early generative grammar and was a reference to the transformational analysis of the day in which the wh-expression appears in its canonical position at deep structure and then moves leftward from that position to a derived position at the front of the sentence/clause at surface structure. Although many theories of syntax do not use the mechanism of movement in the transformative sense, the term wh-movement is widely used to denote the phenomenon, even in theories that do not model long-distance dependencies as movement.
What needs to be considered is that wh-movement does not occur solely due to interrogative words. Wh-words are used to form questions, and can also occur in relative clauses. Wh-movement occurs from the existing EPP. There are three types of wh-expressions overall- Interrogative, Relative, and Pseudo-cleft wh-expressions.

Basic examples

The following examples of sentence pairs illustrate wh-movement: each example has the canonical word order of a declarative sentence in English; each sentence has undergone wh-movement, whereby the wh-word has been fronted in order to form a question. The relevant words are bolded.
wh-fronting of whom, which corresponds to the direct object Tesnière
Tom has been reading Tesnière.
Whom
has Tom been reading?

wh-fronting of what, which corresponds to the prepositional object syntax
She should stop talking about syntax.
What
should she stop talking about?

The examples in through illustrate that wh-fronting occurs when a constituent is questioned that appears to the right of the finite verb in the corresponding declarative sentence. Consider in this regard that when the subject is questioned, there is no obvious reason to assume that wh-fronting has occurred because the default position of the subject is clause-initial:
Despite the fact that such data provide no obvious reason to assume movement, some theories of syntax maintain a movement analysis in the interest of remaining consistent. They assume that the wh-subject has in fact moved up the syntactic hierarchy, although this movement is not apparent from the actual linear order of the words.
Wh-movement occurs in ex situ forms, where the structure moves from the base to the Spec C with the help of EPP.
Each one of the Wh-words is responsible for a certain description or target.
  1. Locative-> Where
  2. Temporal-> When
  3. Object-> Who, What
  4. Manner-> How
  5. Rationale-> Why, What

    Wh-expressions without wh-movement

Wh-movement typically occurs to form questions in English. There are, however, at least three kinds of questions in which wh-movement does not occur :
  1. Echo questions:
  2. Quiz questions or specific questions: ask for detailed specific information that the individual has encountered before
  3. Multiple questions in a single sentence:when there is already one wh-word at the front:
  4. Expected questions: Occur when new information is expected
  5. Reference questions: Asking for someone to repeat what already has been said/ paraphrasing
While wh-movement is the rule in English, other languages may leave wh-expressions in situ more often such as in Slavic Languages. In French for instance, wh-movement is often optional in certain matrix clauses.
The EPP plays a great role in the case of wh-movement. For those languages that leave wh-expressions in situ, we may see a pattern in that EPP does not exist in the structure itself- unlike ex situ where movement is allowed and the EPP is present.
Some example of languages that possess Wh-expressions without wh-movement are Chinese and Slavic languages- languages that are most commonly used as examples are Mandarin and Russian.
It also needs to be considered that in situ questions are different from wh-fronted questions as they follow two different paths:
1) Typically, in situ expressions result from no movement at all which tends to be morphologically or pragmatically conditioned
2) Wh-expressions/words are always moved

In subordinate clauses

The examples in the previous section have wh-movement occurring in main clauses. Wh-movement is not restricted to occurring in main clauses. It frequently appears in subordinate clauses, although its behavior in subordinate clauses differs in a key respect, viz. word order. The following two subsections consider wh-movement in indirect questions and relative clauses.

In indirect questions

In English, wh-movement occurs to form a question in both main and subordinate clauses. When the question is expressed with a main clause, it is a direct question. When the question is expressed with a subordinate clause, however, it is an indirect question. While wh-fronting occurs in both direct and indirect questions, there is a key word order difference that distinguishes between the two. This difference is illustrated with the following data:
The subscripts indicate a central word order difference across direct and indirect questions. Wh-fronting in main clauses typically results in V2 word order in English, meaning the finite verb appears in second position, as marked by the 2-subscript in the b-sentences. In indirect questions, however, V3 word order typically obtains, as marked by the 3-subscript in the c-sentences. Despite this systematic word order difference across direct and indirect questions, wh-fronting within the clause is occurring in both cases. Note as well that do-support is often needed in order to enable wh-fronting. Wh-fronting in main clauses is often reliant on subject–auxiliary inversion.

In relative clauses

The examples above all involve interrogative clauses. Wh-movement also occurs in relative clauses, however, which cannot be interpreted as questions. Many relative pronouns in English have the same form as the corresponding interrogative words. Relative clauses are subordinate clauses, so the characteristic V3 word order seen in indirect questions occurs:
The relative pronouns have fronted in the subordinate clauses of the b-examples, just like they are fronted in the indirect questions in the previous sections. The characteristic V3 word order is obligatory. If the V2 word of main clauses occurs, the sentence is bad, as the c-examples demonstrate.

Pied-piping

Many instances of wh-fronting involve pied-piping. Pied-piping occurs when a fronted wh-word pulls an entire encompassing phrase to the front of the clause with it, i.e. it "pied-pipes" the other words of the phrase with it to the front of the clause. The following two subsections consider both obligatory and optional pied-piping.

Obligatory pied-piping

Pied-piping is sometimes obligatory. That is, in order for a wh-expression to be fronted, an entire encompassing phrase must be fronted with it. The relevant phrase of pied-piping is underlined in the following examples:
These examples illustrate that pied-piping is often necessary when the wh-word is inside a noun phrase or adjective phrase. Pied-piping is motivated in part by the barriers and islands to extraction. When the wh-word appears underneath a blocking category or in an island, the entire encompassing phrase must be fronted. Pied-piping was first identified by John R. Ross in his 1967 dissertation.

Optional pied-piping

There are cases where pied-piping can be optional. In English, this occurs most notably with prepositional phrases. The wh-word is the object of a preposition. A formal register will pied-pipe the preposition, whereas more colloquial English prefers to leave the preposition in situ, e.g.
The c-examples are cases of preposition stranding, which is possible in English, but not allowed in many languages that are related to English. For instance, preposition stranding is largely absent from many of the other Germanic languages and it may be completely absent from the Romance languages. Prescriptive grammars often claim that preposition stranding should be avoided in English as well; however, in certain contexts pied-piping of prepositions in English may make a sentence feel artificial or stilted.

Extraction islands

In many cases, a Wh-expression can occur at the front of a sentence regardless of how far away its canonical location is, e.g.
The Wh-word whom is the direct object of the verb likes in each of these sentences. There appears to be no limit on the distance that can separate the fronted expression from its canonical position. In more technical terms, we can say that the dependency relation between the gap and its filler is unbounded in the sense that there is no upper bound on how deeply embedded within the given sentence the gap may appear.
However, there are cases in which this is not possible. Certain kinds of phrases do not seem to allow a gap. The phrases from which a Wh-word cannot be extracted are referred to as extraction islands or simply islands. The following subsections briefly consider seven types of islands: 1) adjunct islands, 2) Wh-islands, 3) subject islands, 4) left branch islands, 5) coordinate structure islands, 6) complex NP islands, and 7) non-bridge islands. These island types were all originally identified in Ross' seminal dissertation. The islands in the examples that follow are underlined in the a-sentences.

Adjunct islands

An adjunct island is a type of island formed from an adjunct clause. Wh-movement is not possible out of an adjunct clause. Adjunct clauses include clauses introduced by because, if, and when, as well as relative clauses. Some examples include:
Wh-movement fails in the b-sentences because the gap appears in an adjunct clause.

Wh-islands

A wh-island is created by an embedded sentence which is introduced by a Wh-word. Wh-islands are weaker than adjunct islands and violating them results in the sentence sounding ungrammatical to the native speaker.
The b-sentences are strongly marginal/unacceptable because one has attempted to extract an expression out of a wh-island.
The reason why this occurs is because both wh-words are part of a DP. It would not be possible to move the bottom wh-word to the top of the structure, as they would both interfere. In order to get a grammatical result, a proper wh-movement must occur. However, because the wh-word is taking up the Spec- C position, it is not possible to move the competing wh-word higher by skipping the higher DP as wh-movement is a cyclic process.

Subject islands

Wh-movement is not possible out of subjects, at least not in English. This is particularly true for subject clauses, and to a somewhat lesser extent out of subject phrases, e.g.
The important insight here is that wh-extraction out of object clauses and phrases is quite possible. There is therefore an asymmetry across subjects and objects with respect to wh-movement.

Left branch islands

Modifiers that would appear on a left branch under a noun cannot be extracted. The relevant constraint is known as the Left Branch Condition, and Ross is again credited with having discovered it. The left branch constraint captures the fact that possessive determiners and attributive adjectives in English and many related languages necessarily pied-pipe the entire noun phrase when they are fronted, e.g.
Extraction fails in the b-sentences because the extracted expression corresponds to a left-branch modifier of a noun. Left branch islands are cross-linguistically variable. While they exist in English, they are absent from many other languages, most notably, from the Slavic languages.

Coordinate structure islands

In coordination, extraction out of a conjunct of a coordinate structure is possible only if this extraction affects all the conjuncts of the coordinate structure equally. The relevant constraint is known as the coordinate structure constraint. Extraction must extract the same syntactic expression out of each of the conjuncts simultaneously. This sort of extraction is said to occur across the board, e.g.
Wh-extraction out of a conjunct of a coordinate structure is only possible if it can be interpreted as occurring equally out all the conjuncts simultaneously, that is, if it occurs across the board.

Complex noun phrase islands

Extraction is difficult from out of a noun phrase. The relevant constraint is known as the complex NP constraint, and comes in two varieties, the first banning extraction from the clausal complement of a noun, and the second banning extraction from a relative clause modifying a noun:
Sentential complement to a noun:
Relative clause:

Non-bridge-verb islands

Extraction out of object that-clauses serving as complements to verbs may show island-like behavior if the matrix verb is a non-bridge verb. Non-bridge verbs include manner-of-speaking verbs, such as whisper or shout, e.g.

Wh-Movement in Syntax Trees

Wh-Tree Explained

Syntax trees are visual breakdowns of sentences that include dominating heads for every segment in the tree itself. In the Wh-Movement, there are additional segments that are added- EPP and the Question Feature that represents a question sentence.
The Wh-movement is motivated by a Question Feature/EPP at C, which promotes movement of a Wh-word from the canonical base position to Spec-C. This movement could be considered as “Copy + Paste + Delete” movement as we are copying the interrogative word from the bottom, pasting it to Spec-C, and then deleting it from the bottom so that it solely remains at the top. Overall, the highest C will be the target position of the Wh-Raising.
The interrogatives that are used in the Wh-Movement do not all share headedness. This is important to consider when making the syntax trees, as there are three different heads that may be used
Headedness:
Determiner Phrase : Who, What
Prepositional Phrase : Where, When, Why
Adverb Phrase : How
When creating the Syntax Tree for the Wh-movement, consider the subject-aux inversion in the word that was raised from T to C.
The location of the EPP :
The EPP allows movement of the Wh-word from the bottom canonical position of the syntax tree to Spec C. The EPP is a great indicator when it comes to distinguishing between in-situ trees and ex-situ. Ex situ trees allow the movement to Spec C, while in situ do not as the head C lacks the EPP feature.
Islands in Syntax Trees:
Within Syntax trees, islands do not allow movement to occur- if movement is attempted, the sentence would then be perceived as ungrammatical to the native speaker of the observed language. Islands are typically noted as being a boxed node on the tree. The movement in the Wh-Island syntax tree is unable to occur because in order to move out of an embedded clause, a Determiner Phrase must move through the Spec C position. This cannot occur, as the Determiner Phrase is already occupied.
Wh-Island example,
“She said ?”. We see that “who” takes the place of DP and restricts “what” from rising up to the respected Spec C. Native speakers may confirm this as well as it will sound ungrammatical * ”What did she say ”.

Multiple Wh-Questions

In languages, a sentence can contain more than one wh-question. These interrogative constructions are called multiple wh-questions,
e.g: Who ate what at the restaurant?
In the following English example, a strikeout-line and trace-movement co-indexation symbols - - are used to indicate the underlying raising-movement of the closest wh-phrase. This movement produces an overt sentence word order with one fronted wh-question:
e.g:
In the underlying syntax, the wh-phrase closest to Spec-CP is raised to satisfy selectional properties of the CP: the and feature requirements of C. The wh-phrase farther away from Spec-CP stays in its base position.

Superiority Condition

The superiority condition determines which wh-phrase moves in a clause that contains multiple wh-phrases. This is the outcome of applying the attract closest principle, where only the closest candidate is eligible for movement to the attracting head that selects for it. If the farther wh-phrase moves instead of the preceding wh-phrase, an ungrammatical structure is created. Not all languages have instances of multiple-wh movement governed by the superiority condition, most have variations. There is no uniformity found across languages concerning the superiority condition.
For example, see the following English phrases:
a.
b. *
The subscript "ti" or "i" are used to mark coreference. "t" represents a trace, while both "ti" and "i" represent that the words refer to each other and the same entity.
In a., the closer wh-phrase moves up towards Spec-CP from being the subject of the VP . The second wh-phrase remains in-situ. This is to satisfy the feature in the Spec-CP.
In b., the farther wh-phrase has incorrectly moved from the direct object position of the VP into the Spec-CP position. The closer wh-phrase to Spec-CP has remained in-situ as the subject of the VP. This sentence contains a violation of the attract closest principle, as the closest candidate was not moved, rather the farther candidate. This sentence is ungrammatical which is marked by the asterisk.

German

German does not show the expected effects of the superiority condition during clauses with multiple wh-phrases. German appears to have a process that allows the farther wh-phrase to "cross-over" the closer wh-phrase and move, not remaining in-situ. This movement is tolerated and has less consequences than when compared with English.
For example, see the following German phrases:
a.
Ichweissnicht,werwasgesehenhat
Iknownot,whowhatseenhas

b.
Ichweissnicht,waswergesehenhat
Iknownot,whatwhoseenhas

In a., the gloss shows that the wh-phrase has "crossed over" wh-phrase and is now in Spec,CP to satisfy the feature. This movement is a violation of the attract closest principle, which is what the superiority condition is based upon.

Mandarin Chinese

is a wh-in-situ language, which means that it does not exhibit wh-movement in constituent questions. In other words, wh-words in Mandarin remain at the end of the sentence, contrasting with wh-movement in English where the wh-word would move in constituent questions.

In-situ

The following example illustrates multiple wh-movement in Mandarin, and is written in pinyin for the sake of simplicity and clarity:
This example demonstrates that the wh-word "what" in Mandarin remains in-situ at Surface structure, while the wh-word "why" in Mandarin moves to proper scope position and, in doing so, c-commands the wh-word that stays in-situ.

Matrix scope

The scope of wh-questions in Mandarin is also subject to other conditions depending on the kind of wh-phrase involved. The following example can translate into two meanings:
This example illustrates the way certain wh-words such as "who" and "what" can freely obtain matrix scope in Mandarin.

Attract Closest

In reference to the Attract Closest principle, where the head adopts the closest candidate available to it, the overt wh-phrase in Mandarin moves to proper scope position while the other wh-phrase stays in-situ as it is c-commanded by the wh-phrase first mentioned. This can be seen in the following example, where the word for "what" stays in-situ since it is c-commanded by the phrase in Mandarin meaning "at where":
As these examples show, Mandarin is a wh-in-situ language, exhibits no movement of wh-phrases at Surface structure, is subject to other conditions based on the type of wh-phrase involved in the question, and adheres to the Attract Closest principle.

Bulgarian

ln Bulgarian, the feature of C motivates multiple Wh-word movements, which leads to multiple specifiers. It requires formation of clusters of wh-phrases in in the matrix clause. This is different from English because in English, only one Wh-word moves to when there is multiple wh-words in a clause. This is because in Bulgarian, unlike English, all movements of wh-elements take place in the syntax, where movement is shown overtly. The phrase structure for Wh-words in Bulgarian would look like is shown in Figure 1 below, where a Wh-cluster is formed under .
In Bulgarian and Romanian, a Wh-element is attracted into and the other Wh-elements are adjoined into the first Wh-word in .
In Example 1, we see that the both Wh-words underwent movement and are in a cluster.

Attract Closest

The Attract Closest is a principle of the Superiority Condition where the head which attracts a certain feature adopts the closest candidate available to it. This usually leads to the movement of the closest candidate.
Slavic languages are grouped in to two different S-structures concerning the movement of Wh-elements at . One group includes the languages: Serbo-Croatian, Polish, and Czech where there is only one Wh-element in at S-structure. The other group contains Bulgarian and Romanian which have all of their Wh-elements in at S-structure. In the first group mentioned, the Attract Closest principle is present and the Wh-word that is closest to the attracting head undergoes movement while the rest of the Wh-elements remain in-situ. The second group of languages, the Attract Closest principle occurs in a slightly different way. The orderof the way the Wh-word moves is dictated by their proximity to . The closest Wh-word to the attracting head undergoes movement first and the next closest one follows suit, and on and on. In that way the Superiority effect is present in Serbo-Croation, Polish, and Czech in the first Wh-element, while in Bulgarian and Romanian, it is present in all of the Wh-elements in the clause.
The Attract Closest principle explains a crucial detail about the order of which Wh-words move first in the tree. Since the closest Wh-word is moved first, there is a particular order that appears. Wh-Subjects goes before Wh-objects and Wh-adjuncts. This is seen in Example #2 and Example #3. Example #3 also shows that there can be more than two Wh-words in and that no matter how many Wh-words are in the clause they would all have to undergo movement.

In-situ

In Bulgarian, we see in Example #4, that to defer from forming a sequence of the same Wh-words, a Wh-element is allowed to remain in-situ as a last resort.
In summary, Bulgarian has Multiple Wh-Movement in the syntax and the Wh-words move overtly. We also see that while all Wh-words in a clause moves under because of the feature, there is still a certain order in how they are appear in the clause.

French

In French, multiple wh-questions have the following patterns:
a) In some French interrogative sentences, wh-movement can be optional.
1.The closest wh-phrase to Spec CP can be fronted ;
2. Alternatively, wh-phrases can remain in situ.
In the example sentences above, examples and are both grammatical and share the same meaning in French. Here the choice of using one form of question over the other is optional; either sentence can be used to ask about the two particular DP constituents expressed by two wh-words. In French, the second sentence could also be used as an echo question. ' By contrast, in English the grammatical structure of the second sentence is only acceptable as an echo question: a question we ask to clarify the information we hear in someone's utterance, or that we use to express our shock or disbelief in reaction to a statement made by someone. For echo questions in English, it is typical for speakers to emphasize the wh-words prosodically by using rising intonation. These special instances of using multiple wh-questions in English are essentially "requests for the repetition of that utterance".
b) In other French interrogative sentences, wh-movement is required.
The option of using wh-in-situ in French sentences with multiple wh-questions is limited to specific conditions. There exists "a very limited distribution" of its usage.
French wh-in-situ can occur only:
  1. in matrix clauses
  2. in matrix clauses that do not have an overt complementizer
  3. in 'short-distance' questions
Wh-in situ usage is not allowed in French when these criteria are not met.
Wh-in-situ is not allowed:
  1. in embedded questions
2. in questions with overt complementizers



3.
in 'long-distance' questions'''


To summarize, in French sentences with multiple wh-questions, the choice between wh-movement and wh-in-situ is not arbitrary; it is constrained by specific conditions.

In other languages

Wh-movement is also found in many other languages around the world. Most European languages also place wh-words at the beginning of a clause. Furthermore, many of the facts illustrated above are also valid for other languages. The systematic difference in word order across main wh-clauses and subordinate wh-clauses shows up in other languages in varying forms. The islands to wh-extraction are also present in other languages, but there will be some variation. The following example illustrates wh-movement of an object in Spanish:
The following examples illustrates wh-movement of an object in German:
The following examples illustrate wh-movement an object in French:
All the examples are quite similar to the English examples and demonstrate that wh-movement is a general phenomenon in numerous languages. As stated however, the behavior of wh-movement can vary, depending on the individual language in question.

Languages in which it is not present

Many languages do not have wh-movement. Instead, these languages keep the symmetry of the question and answer sentences.
For example, topic questions in Chinese have the same sentence structure as their answers:
'? []
  • Literal: You do what?
  • Translation: What are you doing?
The response to which could be:
' 。 []
It needs to be considered that Chinese in fact have a wh-particle. However, there is no wh-movement.

Theoretical approaches

Wh-movement typically results in a discontinuity: the "moved" constituent ends up in a position that is separated from its canonical position by material that syntactically dominates the canonical position, which means there seems to be a discontinuous constituent and a long distance dependency present. Such discontinuities challenge any theory of syntax, and any theory of syntax is going to have a component that can address these discontinuities. In this regard, theories of syntax tend to explain discontinuities in one of two ways, either via movement or via feature passing. The EPP feature and Question Feature play a large role in the movement itself. We have noticed that these two features occur in ex situ questions which allow movement and do not exist in in situ questions that do allow it.
Theories that posit movement have a long and established tradition that reaches back to early Generative Grammar. They assume that the displaced constituent is first generated in its canonical position at some level or point in the structure generating process below the surface. This expression is then moved or copied out of this base position and placed in its surface position where it actually appears in speech. Movement is indicated in tree structures using one of a variety of means.
The alternative to the movement approach to wh-movement and discontinuities in general is feature passing. This approach rejects the notion that movement in any sense has occurred. The wh-expression is base generated in its surface position, and instead of movement, information passing occurs up or down the syntactic hierarchy to and from the position of the gap.