Transformational grammar


In linguistics, transformational grammar or transformational-generative grammar is part of the theory of generative grammar, especially of natural languages. It considers grammar to be a system of rules that generate exactly those combinations of words that form grammatical sentences in a given language and involves the use of defined operations to produce new sentences from existing ones.
This mechanism was first introduced to general linguistics by the structural linguist Louis Hjelmslev, son of the mathematician Johannes Hjelmslev who invented the Hjelmslev transformation. A modification which separated discourse and semantics from the syntactic model was subsequently made by Zellig Harris, giving rise to what became known as transformational generative grammar. The full Hjelmslevian conception, in contrast, is incorporated into functional grammar.

Deep structure and surface structure

's 1965 book Aspects of the Theory of Syntax developed the idea that each sentence in a language has two levels of representation: a deep structure and a surface structure. The deep structure represents the core semantic relations of a sentence and is mapped onto the surface structure, which follows the phonological form of the sentence very closely, via transformations. The concept of transformations had been proposed before the development of deep structure to increase the mathematical and descriptive power of context-free grammars. Deep structure was developed largely for technical reasons related to early semantic theory. Chomsky emphasized the importance of modern formal mathematical devices in the development of grammatical theory:

Formal definition

Chomsky's advisor, Zellig Harris, took transformations to be relations between sentences such as "I finally met this talkshow host you always detested" and simpler sentences "I finally met this talkshow host" and "You always detested this talkshow host." Chomsky developed a formal theory of grammar in which transformations manipulated not only the surface strings but also the parse tree associated with them, making transformational grammar a system of tree automata. A transformational-generative grammar thus involved two types of productive rules: phrase structure rules, such as "S → NP VP" etc., which could be used to generate grammatical sentences with associated parse trees ; and transformational rules, such as rules for converting statements to questions or active to passive voice, which acted on the phrase markers to produce other grammatically correct sentences.
In this context, transformational rules are not strictly necessary for the purpose of generating the set of grammatical sentences in a language, since that can be done using phrase structure rules alone, but the use of transformations provides economy in some cases, and it also provides a way of representing the grammatical relations that exist between sentences, which would not otherwise be reflected in a system with phrase structure rules alone.
This notion of transformation proved adequate for subsequent versions including the "extended," "revised extended," and Government-Binding versions of generative grammar, but it may no longer be sufficient for the current minimalist grammar, as merge may require a formal definition that goes beyond the tree manipulation characteristic of Move α.

Development of basic concepts

Though transformations continue to be important in Chomsky's current theories, he has now abandoned the original notion of deep structure and surface structure. Initially, two additional levels of representation were introduced—logical form and phonetic form —but in the 1990s, Chomsky sketched a new program of research known as first as Minimalism, in which deep structure and surface structure are no longer featured and PF and LF remain as the only levels of representation.
To complicate the understanding of the development of Chomsky's theories, the precise meanings of deep structure and surface structure have changed over time. By the 1970s, Chomskyan linguists normally called them D-Structure and S-Structure. In particular, Chomskyan linguists dropped for good the idea that a sentence's deep structure determined its meaning when LF took over this role.

Innate linguistic knowledge

Using a term such as "transformation" may give the impression that theories of transformational generative grammar are intended as a model of the processes by which the human mind constructs and understands sentences, but Chomsky clearly stated that a generative grammar models only the knowledge that underlies the human ability to speak and understand, arguing that because most of that knowledge is innate, a baby can have a large body of knowledge about the structure of language in general and so need to learn only the idiosyncratic features of the language to which it is exposed.
Chomsky is not the first person to suggest that all languages have certain fundamental things in common. He quoted philosophers who posited the same basic idea several centuries ago. But Chomsky helped make the innateness theory respectable after a period dominated by more behaviorist attitudes towards language. He made concrete and technically sophisticated proposals about the structure of language as well as important proposals about how grammatical theories' success should be evaluated.

Grammatical theories

In the 1960s, Chomsky introduced two central ideas relevant to the construction and evaluation of grammatical theories. One was the distinction between competence and performance. Chomsky noted the obvious fact that when people speak in the real world, they often make linguistic errors, such as starting a sentence and then abandoning it midway through. He argued that such errors in linguistic performance are irrelevant to the study of linguistic competence, the knowledge that allows people to construct and understand grammatical sentences. Consequently, the linguist can study an idealised version of language, which greatly simplifies linguistic analysis.
The other idea related directly to evaluation of theories of grammar. Chomsky distinguished between grammars that achieve descriptive adequacy and those that go further and achieve explanatory adequacy. A descriptively adequate grammar for a particular language defines the set of grammatical sentences in that language; that is, it describes the language in its entirety. A grammar that achieves explanatory adequacy has the additional property that it gives insight into the mind's underlying linguistic structures. In other words, it does not merely describe the grammar of a language, but makes predictions about how linguistic knowledge is mentally represented. For Chomsky, the nature of such mental representations is largely innate and so if a grammatical theory has explanatory adequacy, it must be able to explain different languages' grammatical nuances as relatively minor variations in the universal pattern of human language.
Chomsky argued that even though linguists were still a long way from constructing descriptively adequate grammars, progress in descriptive adequacy would come only if linguists held explanatory adequacy as their goal: real insight into individual languages' structure can be gained only by comparative study of a wide range of languages, on the assumption that they are all cut from the same cloth.

"I-language" and "E-language"

In 1986, Chomsky proposed a distinction between I-language and E-language that is similar but not identical to the competence/performance distinction. "I-language" is internal language; "E-language" is external language. I-language is taken to be the object of study in linguistic theory; it is the mentally represented linguistic knowledge a native speaker of a language has and thus a mental object. From that perspective, most of theoretical linguistics is a branch of psychology. E-language encompasses all other notions of what a language is, such as a body of knowledge or behavioural habits shared by a community. Thus E-language is not a coherent concept by itself, and Chomsky argues that such notions of language are not useful in the study of innate linguistic knowledge or competence even though they may seem sensible and intuitive and useful in other areas of study. Competence, he argues, can be studied only if languages are treated as mental objects.

Grammaticality

Chomsky argued that "grammatical" and "ungrammatical" can be meaningfully and usefully defined. In contrast, an extreme behaviorist linguist would argue that language can be studied only through recordings or transcriptions of actual speech and that the role of the linguist is to look for patterns in such observed speech, not to hypothesize about why such patterns might occur or to label particular utterances grammatical or ungrammatical. Few linguists in the 1950s actually took such an extreme position, but Chomsky was on the opposite extreme, defining grammaticality in an unusually mentalistic way for the time. He argued that the intuition of a native speaker is enough to define the grammaticality of a sentence; that is, if a particular string of English words elicits a double-take or a feeling of wrongness in a native English speaker, with various extraneous factors affecting intuitions controlled for, it can be said that the string of words is ungrammatical. That, according to Chomsky, is entirely distinct from the question of whether a sentence is meaningful or can be understood. It is possible for a sentence to be both grammatical and meaningless, as in Chomsky's famous example, "colorless green ideas sleep furiously". But such sentences manifest a linguistic problem that is distinct from that posed by meaningful but ungrammatical -sentences such as "man the bit sandwich the", the meaning of which is fairly clear, but which no native speaker would accept as well-formed.
The use of such intuitive judgments permitted generative syntacticians to base their research on a methodology in which studying language through a corpus of observed speech became downplayed since the grammatical properties of constructed sentences were considered appropriate data on which to build a grammatical model.

Minimalist program

From the mid-1990s onward, much research in transformational grammar has been inspired by Chomsky's minimalist program. It aims to further develop ideas involving economy of derivation and economy of representation, which had started to become significant in the early 1990s but were still rather peripheral aspects of transformational-generative grammar theory:
Both notions, as described here, are somewhat vague, and their precise formulation is controversial. An additional aspect of minimalist thought is the idea that the derivation of syntactic structures should be uniform: rules should not be stipulated as applying at arbitrary points in a derivation but instead apply throughout derivations. Minimalist approaches to phrase structure have resulted in "Bare Phrase Structure," an attempt to eliminate X-bar theory. In 1998, Chomsky suggested that derivations proceed in phases. The distinction between deep structure and surface structure is not present in Minimalist theories of syntax, and the most recent phase-based theories also eliminate LF and PF as unitary levels of representation.

Mathematical representation

An important feature of all transformational grammars is that they are more powerful than context-free grammars. Chomsky formalized this idea in the Chomsky hierarchy. He argued that it is impossible to describe the structure of natural languages with context-free grammars. His general position on the non-context-freeness of natural language has held up since then, though his specific examples of the inadequacy of CFGs in terms of their weak generative capacity were disproved.

[|Transformations]

The usual usage of the term "transformation" in linguistics refers to a rule that takes an input, typically called the deep structure or D-structure, and changes it in some restricted way to result in a surface structure. In TG, phrase structure rules generate deep structures. For example, a typical transformation in TG is subject-auxiliary inversion. That rule takes as its input a declarative sentence with an auxiliary, such as "John has eaten all the heirloom tomatoes", and transforms it into "Has John eaten all the heirloom tomatoes?" In the original formulation, those rules were stated as rules that held over strings off terminals, constituent symbols or both.
In the 1970s, by the time of the Extended Standard Theory, following Joseph Emonds's work on structure preservation, transformations came to be viewed as holding over trees. By the end of government and binding theory, in the late 1980s, transformations were no longer structure-changing operations at all; instead, they add information to already existing trees by copying constituents.
The earliest conceptions of transformations were that they were construction-specific devices. For example, there was a transformation that turned active sentences into passive ones. A different transformation raised embedded subjects into main clause subject position in sentences such as "John seems to have gone", and a third reordered arguments in the dative alternation. With the shift from rules to principles and constraints that was found in the 1970s, those construction-specific transformations morphed into general rules, which eventually changed into the single general rule move alpha or Move.
Transformations actually come in two types: the post-deep structure kind mentioned above, which are string- or structure-changing, and generalized transformations. GTs were originally proposed in the earliest forms of generative grammar. They take small structures, either atomic or generated by other rules, and combine them. For example, the generalized transformation of embedding would take the kernel "Dave said X" and the kernel "Dan likes smoking" and combine them into "Dave said Dan likes smoking." GTs are thus structure-building rather than structure-changing. In the Extended Standard Theory and government and binding theory, GTs were abandoned in favor of recursive phrase structure rules, but they are still present in tree-adjoining grammar as the Substitution and Adjunction operations, and have recently reemerged in mainstream generative grammar in Minimalism, as the operations Merge and Move.
In generative phonology, another form of transformation is the phonological rule, which describes a mapping between an underlying representation and the surface form that is articulated during natural speech.