In linguistics, selection denotes the ability of predicates to determine the semantic content of their arguments. Predicates select their arguments, which means they limit the semantic content of their arguments. One sometimes draws a distinction between types of selection; one acknowledges both s-selection and c-selection. Selection in general stands in contrast to subcategorization: predicates both select and subcategorize for their complement arguments, whereas they only select their subject arguments. Selection is a semantic concept, whereas subcategorization is a syntactic one. Selection is closely related to valency, a term used in other grammars than the Chomskian generative grammar, for a similar phenomenon.
Examples
The following pairs of sentences will illustrate the concept of selection: The # indicates semantic deviance. The predicateis wilting selects a subject argument that is a plant or is plant-like. Similarly, the predicate drank selects an object argument that is a liquid or is liquid-like. A building cannot normally be understood as wilting, just as a car cannot normally be interpreted as a liquid. The b-sentences are possible only given an unusual context that establishes appropriate metaphorical meaning. The deviance of the b-sentences is addressed in terms of selection. The selectional restrictions of the predicates is wilting and drank are violated. When a mismatch between a selector and a selected element triggers reinterpretation of the meaning of those elements, that process is referred to as coercion.
S-selection vs. c-selection
One sometimes encounters the terms s-selection and c-selection. The concept of c-selection overlaps to an extent with subcategorization. Predicates c-select the syntactic category of their complement arguments - e.g. noun, verb, adjective, etc. - i.e. they determine the syntactic category of their complements. In contrast, predicates s-select the semantic content of their arguments. Thus s-selection is a semantic concept, whereas c-selection is a syntactic one. When the term selection or selectional restrictions appears alone without the c- or s-, s-selection is usually understood. The b-sentences above do not contain violations of the c-selectional restrictions of the predicates is wilting and drank; they are, rather, well-formed from a syntactic point of view, for the arguments the building and a car satisfy the c-selectional restrictions of their respective predicates, these restrictions requiring their arguments to be nouns or noun phrases. Just the s-selectional restrictions of the predicates is wilting and drank are violated in the b-sentences.
C-selection vs. subcategorization
The concepts of c-selection and subcategorization overlap in meaning and use to a significant degree. If there is a difference between these concepts, it resides with the status of the subject argument. Traditionally, predicates are interpreted as NOT subcategorizing for their subject argument because the subject argument appears outside of the minimal VP containing the predicate. Predicates do, however, c-select their subject arguments, e.g. The predicate eats c-selects both its subject argument Fred and its object argument beans, but as far as subcategorization is concerned, eats subcategorizes for its object argument beans only. This difference between c-selection and subcategorization depends crucially on the understanding of subcategorization. An approach to subcategorization that sees predicates as subcategorizing for their subject arguments as well as for their object arguments will draw no distinction between c-selection and subcategorization; the two concepts are synonymous for such approaches.
Thematic relations
Selection can be closely associated with thematic relations. By limiting the semantic content of their arguments, predicates are determining the thematic relations/roles that their arguments bear.
Theories
Several linguistic theories make explicit use of selection. These include:
Operator grammar, which makes selection a central part of the theory.
Link grammar, which assigns a log-likelihood "cost" to each context a word can appear in, thus providing an explicit numeric estimate of the likelihood of a parse.