Romance linguistics
Romance linguistics is the study of linguistics of Romance languages.
Basic features
Romance languages have a number of shared features across all languages:- Romance languages are moderately inflecting, i.e. there is a moderately complex system of affixes that are attached to words to convey grammatical information such as number, gender, person, tense, etc. Verbs have much more inflection than nouns. The amount of synthesis is significantly more than English, but less than Classical Latin and much less than the oldest Indo-European languages. Inflection is fusional, with a single affix representing multiple features. For example, Portuguese amei "I loved" is composed of am- "love" and the fusional suffix -ei "first-person singular preterite indicative".
- Romance languages have a primarily subject–verb–object word order, with varying degrees of flexibility from one language to another. Constructions are predominantly of the head-first type. Adjectives, genitives and relative clauses all tend to follow their head noun, although determiners usually precede.
- In general, nouns, adjectives and determiners inflect only according to grammatical gender and grammatical number. Grammatical case is marked only on pronouns, as in English; case marking, as in English, is of the nominative–accusative type. A significant exception, however, is Romanian, with two-case marking on nominal elements.
- Verbs are inflected according to a complex morphology that marks person, number, tense, mood, and sometimes aspect or gender. Grammatical voice and some grammatical aspects are expressed using periphrastic constructions, as in the Italian present perfect io ho amato/io sono stato amato "I have loved/I have been loved".
- Most Romance languages are null subject languages.
- All Romance languages have two articles, and many have in addition a partitive article. In some languages, the use of an article with a noun is nearly obligatory; it serves to express grammatical number and to cope with the extreme homophony of French vocabulary as a result of extensive sound reductions.
- The phonemic inventory of most Romance languages is of moderate size with few unusual phonemes. Phonemic vowel length is uncommon. Some languages have developed nasal vowels or front rounded vowels.
- Word accent is of the stress type, rather than making use of pitch. Stress occurs more or less predictably on one of the last three syllables.
Changes from Classical Latin
Loss of the case system
The most significant changes between Classical Latin and Proto-Romance relate to the reduction or loss of the Latin case system, and the corresponding syntactic changes that were triggered.The case system was drastically reduced from the six-case system of Classical Latin. Although five cases can be reconstructed for Vulgar Latin nouns, for Proto-Romance this had been reduced to three: nominative, accusative-ablative, and genitive-dative. This system is preserved best in pronouns. In the West, the genitive-dative disappeared with the genitive replaced by de + ablative and the dative by ad + accusative. This left only two cases: nominative and oblique. Some of the older Gallo-Romance languages preserved this two-case system well into the literary period, and in Ibero-Romance languages, such as Spanish and Portuguese, as well as in Italian, a couple of examples are found which preserve the old nominative. In the East, a genitive-dative made entirely of dative forms was retained but the nominative and accusative-ablative eventually merged.
Concomitant with the loss of cases, freedom of word order was greatly reduced. Classical Latin had a generally verb-final but overall quite free word order, with a significant amount of word scrambling and mixing of left-branching and right-branching constructions. The Romance languages eliminated word scrambling and nearly all left-branching constructions, with most languages developing a rigid SVO, right-branching syntax. Some freedom, however, is allowed in the placement of adjectives relative to their head noun. In addition, some languages have an "accusative preposition" along with clitic doubling, which allows for some freedom in ordering the arguments of a verb.
The Romance languages developed grammatical articles where Latin had none. Articles are often introduced around the time a robust case system falls apart in order to disambiguate the remaining case markers and to serve as parsing clues that signal the presence of a noun.
This was the pattern followed by the Romance languages: In the Romance languages that still preserved a functioning nominal case system, only the combination of article and case ending serves to uniquely identify number and case. All Romance languages have a definite article and an indefinite article. Many also have a partitive article.
Latin had a large number of syntactic constructions expressed through infinitives, participles, and similar nominal constructs. Examples are the ablative absolute, the accusative-plus-infinitive construction used for reported speech, gerundive constructions, and the common use of reduced relative clauses expressed through participles. All of these are replaced in the Romance languages by subordinate clauses expressed with finite verbs, making the Romance languages much more "verbal" and less "nominal" than Latin. Under the influence of the Balkan sprachbund, Romanian has progressed the furthest, largely eliminating the infinitive.
Other changes
- Loss of phonemic vowel length, and change into a free-stressed language. Classical Latin had an automatically determined stress on the second or third syllable from the end, conditioned by vowel length; once vowel length was neutralized, stress was no longer predictable so long as it remained where it was.
- Development of a series of palatal consonants as a result of palatalization.
- Loss of most traces of the neuter gender.
- Development of a series of analytic perfect tenses, comparable to English "I have done, I had done, I will have done".
- Loss of the Latin synthetic passive voice, replaced by an analytic construction comparable to English "it is/was done".
- Loss of deponent verbs, replaced by active-voice verbs.
- Replacement of the Latin future tense with a new tense formed by a periphrasis of infinitive + present tense of habēre "have", which usually contracts into a new synthetic tense. A corresponding conditional tense is formed in the same way but using one of the past-tense forms of habēre.
- Numerous lexical changes. A number of words were borrowed from the Germanic languages and Celtic languages. Many basic nouns and verbs, especially those that were short or had irregular morphology, were replaced by longer derived forms with regular morphology. Throughout the medieval period, words were borrowed from Classical Latin in their original form or in something approaching the original form, often replacing the popular forms of the same words.
Phonology
Vowels
Every language has a different set of vowels from every other. Common characteristics are as follows:- Most languages have at least five monophthongs. The parent language of most of the Italo-Western Romance languages actually had a seven-vowel system, which is kept in most Italo-Western languages. In some languages, like Spanish and Romanian, the phonemic status and difference between open-mid and close-mid vowels was lost. French has probably the largest inventory of monophthongs, with conservative varieties having 12 oral vowels and 4 nasal vowels. European Portuguese also has a large inventory, with 9 oral monophthongs, 5 nasal monophthongs, and a large number of oral and nasal diphthongs.
- Some languages have a large inventory of falling diphthongs. These may or may not be considered as phonemic units, depending on their behavior. As an example, French, Spanish and Italian have occasional instances of putative falling diphthongs formed from a vowel plus a non-syllabic or , but these are normally analyzed as sequences of vowel and glide. The diphthongs in Romanian, Portuguese, Catalan and Occitan, however, have various properties suggesting that they are better analyzed as unit phonemes. Portuguese, for example, has the diphthongs, where appear only in some dialects. All except appear frequently in verb or noun inflections.
- Among the major Romance languages, Portuguese and French have nasal vowel phonemes, stemming from nasalization before a nasal consonant followed by loss of the consonant. Originally, vowels in both languages were nasalized before all nasal consonants, but have subsequently become denasalized before nasal consonants that still remain. In Portuguese, nasal vowels are sometimes analyzed as phonemic sequences of oral vowels plus an underlying nasal consonant, but such an analysis is difficult in French because of the existence of minimal pairs such as bon "good ", bonne "good ". In both languages, there are fewer nasal than oral vowels. Nasalization triggered vowel lowering in French, producing the 4 nasal vowels . Vowel raising was triggered in Portuguese, however, producing the 5 nasal vowels. Vowel contraction and other changes also resulted in the Portuguese nasal diphthongs .
- Most languages have fewer vowels in unstressed syllables than stressed syllables. This again reflects the Italo-Western Romance parent language, which had a seven-vowel system in stressed syllables but only in unstressed syllables. Some languages have seen further reductions: e.g. Standard Catalan has only in unstressed syllables. In French, on the other hand, any vowel may take prosodic stress.
- Most languages have even fewer vowels in word-final unstressed syllables than elsewhere. For example, Old Italian allowed only, while the early stages of most Western Romance languages allowed only. The Gallo-Romance languages went even farther, deleting all final vowels except. Of these languages, French has carried things to the extreme by deleting all vowels after the accented syllable and uniformly accenting the final syllable. Modern Spanish now allows final unstressed, and modern Italian allows final unstressed, but they tend to occur largely in borrowed or onomatopoeic words, e.g. guru "guru", taxi "taxi", Spanish tribu "tribe" and espíritu "spirit", Italian babau "bogeyman". The apparent Spanish exception casi "almost" originates from Latin quasi "as if" < quam sī, and was probably influenced by si "if".
- Phonemic vowel length is uncommon. Vulgar Latin lost the phonemic vowel length of Classical Latin and replaced it with a non-phonemic length system where stressed vowels in open syllables were long, and all other vowels were short. Standard Italian still maintains this system, and it was rephonemicized in the Gallo-Romance languages as a result of the deletion of many final vowels. Some northern Italian languages still maintain this secondary phonemic length, but in most languages the new long vowels were either diphthongized or shortened again, in the process eliminating phonemic length. French is again the odd man out: Although it followed a normal Gallo-Romance path by diphthongizing five of the seven long vowels and shortening the remaining two, it phonemicized a third vowel length system around 1300 AD in syllables that had been closed with an , and now is phonemicizing a fourth system as a result of lengthening before final voiced fricatives.
- In modern spoken and literary Romanian, Slavic influences are evident in phonetics and morphology. Phonetic Slavicisms include the iotation of the initial e in words such as el, ea, este pronounced .
Consonants
Key:
- bold: Appears in all 5 languages.
- italic: Appears in 3–4 languages.
- : Appears in 2 languages.
- ): Appears in only 1 language.
- Spanish has no phonemic voiced fricatives. The equivalent of merged with, and all the rest became voiceless. It also lost, which became or in some other dialects.
- The western languages all used to have the affricates,,,. By the fourteenth century or so, these all turned into fricatives except for Spanish and dialectal Portuguese. Romanian likewise became.
- French, and most varieties of Spanish, have lost . Romanian merged both and into.
- Romanian was influenced by Slavic phonology, most notably in the palatalization of consonants in the plural form.
- Although all of the languages have or used to have, almost none of these sounds are cognate between pairs of languages. The only real exception is many between Italian and Romanian, stemming from Latin C- before E or I. Italian also has from Vulgar Latin -CY-, and from -TY- following a consonant. Former French is from Latin C- before A, either word-initial or following a consonant; Spanish is from Latin -CT-, or from PL, CL following a consonant; former Portuguese is from Latin PL, CL, FL, either word-initial or following a consonant.
- Italian and former Romanian are not cognate with former western .
Lexical stress
Erosion of unstressed syllables following the stress has caused most Spanish and Portuguese words to have either penultimate or ultimate stress: e.g. Latin trēdecim "thirteen" > Spanish trece, Portuguese treze; Latin amāre "to love" > Spanish/Portuguese amar. Most words with antepenultimate stress are learned borrowings from Latin, e.g. Spanish/Portuguese fábrica "factory". This process has gone even farther in French, with deletion of all post-stressed vowels, leading to consistent, predictable stress on the last syllable: e.g. Latin Stephanum "Stephen" > Old French Estievne > French Étienne ; Latin juvenis "young" > Old French juevne > French jeune. This applies even to borrowings: e.g. Latin fabrica > French borrowing fabrique .
Other than French, the position of the stressed syllable generally falls on one of the last three syllables. Exceptions may be caused by clitics or certain verb endings, e.g. Italian telefonano "they telephone"; Spanish entregándomelo "delivering it to me"; Italian mettiamocene "let's put some of it in there"; Portuguese dávamos-vo-lo "we were giving it to you". Stress on verbs is almost completely predictable in Spanish and Portuguese, but less so in Italian.
Nominal morphology
Nouns, adjectives, and pronouns can be marked for gender, number and case. Adjectives and pronouns must agree in all features with the noun they are bound to.Number
The Romance languages inherited from Latin two grammatical numbers, singular and plural; the only trace of a dual number comes from Latin ambō > Spanish and Portuguese ambos, Old Romanian îmbi > Romanian ambii, Old French ambe, Italian ambedue, entrambi.Gender
Most Romance languages have two grammatical genders, masculine and feminine. The gender of animate nouns is generally natural, but for nonanimate nouns it is arbitrary.Although Latin had a third gender, there is little trace of this in most languages. The biggest exception is Romanian, where there is a productive class of "neuter" nouns, which include the descendants of many Latin neuter nouns and which behave like masculines in the singular and feminines in the plural, both in the endings used and in the agreement of adjectives and pronouns. This behavior happens also in Italian with a restricted number of words. Another noteworthy exception is Asturian, which uses a neuter gender to refer to abstract, collective and uncountable entities. It appears in adjectives, possesives, one neuter article, one neuter pronoum and some nouns. Some of these neuter traces also feature in Spanish, such as ello.
Such nouns arose because of the identity of the Latin neuter singular -um with the masculine singular, and the identity of the Latin neuter plural -a with the feminine singular. A similar class exists in Italian, although it is no longer productive. A similar phenomenon may be observed in Albanian, and the category remains highly productive with a number of new words loaned or coined in the neuter hotel one hotel vs. hotele three hotels ).
Spanish also has vestiges of the neuter in the demonstrative adjectives: esto, eso, aquello, the pronoun ello and the article lo. Portuguese also has neuter demonstrative adjectives: "isto", "isso", "aquilo".
Remnants of the neuter, interpretable now as "a sub-class of the non-feminine gender", are vigorous in Italy in an area running roughly from Ancona to Matera and just north of Rome to Naples. Oppositions with masculine typically have been recategorized, so that neuter signifies the referent in general, while masculine indicates a more specific instance, with the distinction marked by the definite article. In Southeast Umbrian, for example, neuter lo pane is 'the bread', while masculine lu pane refers to an individual piece or loaf of bread. Similarly, neuter lo vinu is wine in general, while masculine lu vinu is a specific sort of wine, with the consequence that mass lo vinu has no plural counterpart, but lu vinu can take a sortal plural form li vini'', referring to different types of wine. Phonological forms of articles vary by locale.
Case
Latin had an extensive case system, where all nouns were declined in six cases and two numbers. Many adjectives were additionally declined in three genders, leading to a possible 6 × 2 × 3 = 36 endings per adjective. In practice, some category combinations had identical endings to other combinations, but a basic adjective like bonus "good" still had 14 distinct endings.In all Romance languages, this system was drastically reduced. In most modern Romance languages, in fact, case is no longer marked at all on nouns, adjectives and determiners, and most forms are derived from the Latin accusative case. Much like English, however, case has survived somewhat better on pronouns.
Most pronouns have distinct nominative, accusative, genitive and possessive forms. Many also have a separate dative form, a disjunctive form used after prepositions, and a special form used with the preposition con "with".
"boy" | "girl" | "man" | "woman" | |
Singular | chico | chica | hombre | mujer |
Plural | chicos | chicas | hombres | mujeres |
The system of inflectional classes is also drastically reduced. The basic system is most clearly indicated in Spanish, where there are only three classes, corresponding to the first, second and third declensions in Latin: plural in -as, plural in -os, plural in -es. The singular endings exactly track the plural, except the singular -e is dropped after certain consonants.
The same system underlines many other modern Romance languages, such as Portuguese, French and Catalan. In these languages, however, further sound changes have resulted in various irregularities. In Portuguese, for example, loss of /l/ and /n/ between vowels produces various irregular plurals "; hotel – hotéis "hotel.
In French and Catalan, loss of /o/ and /e/ in most unstressed final syllables has caused the -os and -es classes to merge. In French, merger of remaining /e/ with final /a/ into, and its subsequent loss, has completely obscured the original Romance system, and loss of final /s/ has caused most nouns to have identical pronunciation in singular and plural, although they are still marked differently in spelling.
Noun inflection has survived in Romanian somewhat better than elsewhere. Determiners are still marked for two cases in both singular and plural, and feminine singular nouns have separate endings for the two cases. In addition, there is a separate vocative case, enriched with native development and Slavic borrowings and the combination of noun with a following clitic definite article produces a separate set of "definite" inflections for nouns.
The inflectional classes of Latin have also survived more in Romanian than elsewhere, e.g. om – oameni "man – men" ; corp – corpuri "body – bodies". " vs. lună – luni "moon"; frate – fraţi "brother" vs. carte – cărţi "book" vs. vale – văi "valley
In Italian, the situation is somewhere in between Spanish and Romanian. There are no case endings and relatively few classes, as in Spanish, but noun endings are generally formed with vowels instead of /s/, as in Romanian: amico – amici "friend ", amica – amiche "friend "; cane – cani "dog". The masculine plural amici is thought to reflect the Latin nominative plural -ī rather than accusative plural -ōs ; however, the other plurals are thought to stem from special developments of Latin -ās and -ēs.
Case | Latin | Spanish | Old French | Old Sursilvan | Romanian | |
Masculine singular | Nominative | bonus | bueno | buens | buns | bun |
Masculine singular | Vocative | bone | bueno | buens | buns | bun |
Masculine singular | Accusative | bonum | bueno | buen | biVn | bun |
Masculine singular | Genitive | bonī | bueno | buen | biVn | bun |
Masculine singular | Dative | bonō | bueno | buen | biVn | bun |
Masculine singular | Ablative | bonō | bueno | buen | biVn | bun |
Masculine plural | Nominative | bonī | buenos | buen | biVni | buni |
Masculine plural | Vocative | bonī | buenos | buen | biVni | buni |
Masculine plural | Accusative | bonōs | buenos | buens | buns | buni |
Masculine plural | Genitive | bonōrum | buenos | buens | buns | buni |
Masculine plural | Dative | bonīs | buenos | buens | buns | buni |
Masculine plural | Ablative | bonīs | buenos | buens | buns | buni |
Feminine singular | Nominative | bona | buena | buene | buna | bună |
Feminine singular | Vocative | bona | buena | buene | buna | bună |
Feminine singular | Accusative | bonam | buena | buene | buna | bună |
Feminine singular | Genitive | bonae | buena | buene | buna | bune |
Feminine singular | Dative | bonae | buena | buene | buna | bune |
Feminine singular | Ablative | bonā | buena | buene | buna | bune |
Feminine plural | Nominative | bonae | buenas | buenes | bunas | bune |
Feminine plural | Vocative | bonae | buenas | buenes | bunas | bune |
Feminine plural | Accusative | bonās | buenas | buenes | bunas | bune |
Feminine plural | Genitive | bonārum | buenas | buenes | bunas | bune |
Feminine plural | Dative | bonīs | buenas | buenes | bunas | bune |
Feminine plural | Ablative | bonīs | buenas | buenes | bunas | bune |
A different type of noun inflection survived into the medieval period in a number of western Romance languages. This inflection distinguished nominative from oblique, grouping the accusative case with the oblique, rather than with the nominative as in Romanian.
The oblique case in these languages generally inherits from the Latin accusative; as a result, masculine nouns have distinct endings in the two cases while most feminine nouns do not.
A number of different inflectional classes are still represented at this stage. For example, the difference in the nominative case between masculine li voisins "the neighbor" and li pere "the father", and feminine la riens "the thing" vs. la fame "the woman", faithfully reflects the corresponding Latin inflectional differences.
A number of synchronically quite irregular differences between nominative and oblique reflect direct inheritances of Latin third-declension nouns with two different stems, most with of which had a stress shift between nominative and the other forms: li ber – le baron "baron" ; la suer – la seror "sister" ; li prestre – le prevoire "priest" ; li sire – le seigneur "lord" ; li enfes – l'enfant "child".
A few of these multi-stem nouns derive from Latin forms without stress shift, e.g. li om – le ome "man". All of these multi-stem nouns refer to people; other nouns with stress shift in Latin have not survived. Some of the same nouns with multiple stems in Old French or Old Occitan have come down in Italian in the nominative rather than the accusative, suggesting that a similar system existed in pre-literary Italian.
The modern situation in Sursilvan is unique in that the original nominative/oblique distinction has been reinterpreted as a predicative/attributive distinction:
- il hotel ej vɛɲiws natsionalizaws "the hotel has been nationalized"
- il hotel natsionalizaw "the nationalized hotel"
Pronouns, determiners
Most Romance languages have the following sets of pronouns and determiners:
- Personal pronouns, in three persons and two genders.
- A reflexive pronoun, used when the object is the same as the subject. This approximately corresponds to English "-self", but separate forms exist only in the third person, with no number marking.
- Definite and indefinite articles, and in some languages, a partitive article that expresses the concept of "some".
- A two-way or three-way distinction among demonstratives. Many languages have a three-way distinction of distance which, though not paralleled in current English, used to be present as "this/that/yon".
- Relative pronouns and interrogatives, with the same forms used for both.
- Various indefinite pronouns and determiners.
Personal pronouns
Development from Latin
The Romance languages do not retain the Latin third-person personal pronouns, but have innovated a separate set of third-person pronouns by borrowing the demonstrative ille, and creating a separate reinforced demonstrative by attaching a variant of ecce "behold!" to the pronoun.Similarly, in place of the genitive of the Latin pronouns, most Romance languages adopted the reflexive possessive, which then serves indifferently as both reflexive and non-reflexive possessive. Note that the reflexive, and hence the third-person possessive, is unmarked for the gender of the person being referred to. Hence, although gendered possessive forms do exist—e.g. Portuguese seu vs. sua —these refer to the gender of the object possessed, not the possessor.
The gender of the possessor needs to be made clear by a collocation such as French la voiture à lui/elle, Portuguese o carro dele/dela, literally "the car of him/her".
The same demonstrative ille is the source of the definite article in most Romance languages, which explains the similarity in form between personal pronoun and definite article. When the two are different, it is usually because of differing degrees of phonological reduction. Generally, the personal pronoun is unreduced, while the article has undergone various degrees of reduction, beginning with loss of one of the two original syllables, e.g. Spanish ella "she" < illa vs. la "the " < -la < illa, or masculine el, developed from il- < illud.
Clitic pronouns
Object pronouns in Latin were normal words, but in the Romance languages they have become clitic forms, which must stand adjacent to a verb and merge phonologically with it. Originally, object pronouns could come either before or after the verb; sound change would often produce different forms in these two cases, with numerous additional complications and contracted forms when multiple clitic pronouns cooccurred.Catalan still largely maintains this system with a highly complex clitic pronoun system. Most languages, however, have simplified this system by undoing some of the clitic mergers and requiring clitics to stand in a particular position relative to the verb.
When a pronoun cannot serve as a clitic, a separate disjunctive form is used. These result from dative object pronouns pronounced with stress, or from subject pronouns.
Most Romance languages are null subject languages. The subject pronouns are used only for emphasis and take the stress, and as a result are not clitics. In French, however, verbal agreement marking has degraded to the point that subject pronouns have become mandatory, and have turned into clitics. These forms cannot be stressed, so for emphasis the disjunctive pronouns must be used in combination with the clitic subject forms. Friulian and the Gallo-Italian languages have actually gone further than this and merged the subject pronouns onto the verb as a new type of verb agreement marking, which must be present even when there is a subject noun phrase.
Familiar–formal distinction
In medieval times, most Romance languages developed a distinction between familiar and polite second-person pronouns, similar to the former English distinction between familiar "thou" and polite "you". This distinction was determined by the relationship between the speakers. As in English, this generally developed by appropriating the plural second-person pronoun to serve in addition as a polite singular. French is still at this stage, with familiar singular tu vs. formal or plural vous. In cases like this, the pronoun requires plural agreement in all cases whenever a single affix marks both person and number, but singular agreement elsewhere where appropriate.Many languages, however, innovated further in developing an even more polite pronoun, generally composed of some noun phrases and taking third-person singular agreement. A plural equivalent was created at the same time or soon after, taking third-person plural agreement. Spanish innovated similarly, with usted from earlier vuestra merced.
In Portuguese and Spanish, the "extra-polite" forms in time came to be the normal polite forms, and the former polite second-person vos was displaced to a familiar form, either becoming a familiar plural or a familiar singular. In the latter case, it either competes with the original familiar singular tú, displaces it entirely, or is itself displaced. In the Spanish of the Americas, the gap created by the loss of familiar plural vos was filled by originally polite ustedes, with the result that there is no familiar/polite distinction in the plural, just as in the original tú/vos system.
A similar path was followed by Italian and Romanian. Romanian uses dumneavoastră "your lordship", while Italian the former polite phrase sua eccellenza "your excellency" has simply been supplanted by the corresponding pronoun Ella or Lei. As in European Spanish, the original second-person plural voi serves as familiar plural.
Portuguese innovated again in developing a new extra-polite pronoun o senhor "the sir", which in turn downgraded você. Hence, modern European Portuguese has a three-way distinction between "familiar" tu, "equalizing" você and "polite" o senhor.
Brazilian Portuguese, however, has diverged from this system, and most dialects simply use você as a general-purpose second-person pronoun, combined with te as the clitic object pronoun. The form o senhor is sometimes used in speech, but only in situations where an English speaker would say "sir" or "ma'am". The result is that second-person verb forms have disappeared, and the whole pronoun system has been radically realigned. However that is the case only in the spoken language of central and northern Brazil, with the northeastern and southern areas of the country still largely preserving the second-person verb form and the "tu" and "você" distinction.
Catalan still retains the plural form vós for formal distinction but it is falling out of use, and nowadays is usually seen in extremely formal circumstances or in writing. Instead, or is normally used orally, which functions just like Spanish and Portuguese usted/você.
Articles
Latin had no articles as such. The closest definite article was the non-specific demonstrative is, ea, id meaning approximately "this/that/the". The closest indefinite articles were the indefinite determiners aliquī, aliqua, aliquod "some " and certus "a certain".Romance languages have both indefinite and definite articles, but none of the above words form the basis for either of these. Usually the definite article is derived from the Latin demonstrative ille, but some languages have forms from ipse. The indefinite article everywhere is derived from the number ūnus.
Some languages, e.g. French and Italian, have a partitive article that approximately translates as "some". This is used either with mass nouns or with plural nouns—both cases where the indefinite article cannot occur. A partitive article is used whenever a bare noun refers to specific quantity of the noun, but not when a bare noun refers to a class in general. For example, the partitive would be used in both of the following sentences:
But neither of these:
The sentence "Men arrived today", however, means "some specific men arrived today" rather than "men, as a general class, arrived today". On the other hand, "I hate men" does mean "I hate men, as a general class" rather than "I hate some specific men".
As in many other cases, French has developed the farthest from Latin in its use of articles. In French, nearly all nouns, singular and plural, must be accompanied by an article or demonstrative pronoun.
Due to pervasive sound changes in French, most nouns are pronounced identically in the singular and plural, and there is often heavy homophony between nouns and identically pronounced words of other classes. For example, all of the following are pronounced : sain "healthy"; saint "saint, holy"; sein "breast"; ceins " tie around, gird"; ceint " ties around, girds"; ceint "tied around, girded"; and the equivalent noun and adjective plural forms sains, saints, seins, ceints. The article helps identify the noun forms saint or sein, and distinguish singular from plural; likewise, the mandatory subject of verbs helps identify the verb ceint. In more conservative Romance languages, neither articles nor subject pronouns are necessary, since all of the above words are pronounced differently. In Italian, for example, the equivalents are sano, santo, seno, cingi, cinge, cinto, sani, santi, seni, cinti, where all vowels and consonants are pronounced as written, and /s/ and /t͡ʃ/ are clearly distinct from each other.
Latin, at least originally, had a three-way distinction among demonstrative pronouns distinguished by distal value: hic 'this', iste 'that ', ille 'that ', similar to the distinction that used to exist in English as "this" vs. "that" vs. "yon". In urban Latin of Rome, iste came to have a specifically derogatory meaning, but this innovation apparently did not reach the provinces and is not reflected in the modern Romance languages. A number of these languages still have such a three-way distinction, although hic has been lost and the other pronouns have shifted somewhat in meaning. For example, Spanish has este "this" vs. ese "that " vs. aquel "that ". The Spanish pronouns derive, respectively, from Latin iste ipse accu-ille, where accu- is an emphatic prefix derived from eccum "behold ", possibly with influence from atque "and".
Reinforced demonstratives such as accu-ille arose as ille came to be used as an article as well as a demonstrative. Such forms were often created even when not strictly needed to distinguish otherwise ambiguous forms. Italian, for example, has both questo "this" and quello "that", in addition to dialectal codesto "that ". French generally prefers forms derived from bare ecce "behold", as in the pronoun ce "this one/that one" and the determiner ce/cet "this/that".
Reinforced forms are likewise common in locative adverbs, based on related Latin forms such as hic "this" vs. hīc "here", hāc "this way", and ille "that" vs. illīc "there", illāc "that way". Here again French prefers bare ecce while Spanish and Italian prefer eccum. In western languages such as Spanish, Portuguese and Catalan, doublets and triplets arose such as Portuguese aqui, acá, cá " here". From these, a prefix a- was extracted, from which forms like aí "there " and ali "there " were created; compare Catalan neuter pronouns açò "this", això "that ", allò "that ".
Subsequent changes often reduced the number of demonstrative distinctions. Standard Italian, for example, has only a two-way distinction "this" vs. "that", as in English, with second-person and third-person demonstratives combined. In Catalan, however, a former three-way distinction aquest, aqueix, aquell has been reduced differently, with first-person and second-person demonstratives combined. Hence aquest means either "this" or "that "; on the phone, aquest is used to refer both to speaker and addressee.
Old French had a similar distinction to Italian, both of which could function as either adjectives or pronouns. Modern French, however, has no distinction between "this" and "that": ce/cet, cette < cest, ceste is only an adjective, and celui, celle < cel lui, celle is only a pronoun, and both forms indifferently mean either "this" or "that".
Verbal morphology
Verbs have many conjugations, including in most languages:- A present tense, a preterite, an imperfect, a pluperfect, a future tense and a future perfect in the indicative mood, for statements of fact.
- Present and preterite subjunctive tenses, for hypothetical or uncertain conditions. Several languages have also imperfect and pluperfect subjunctives, although it is not unusual to have just one subjunctive equivalent for preterit and imperfect. Portuguese and Spanish also have future and future perfect subjunctives, which have no equivalent in Latin.
- An imperative mood, for direct commands.
- Three non-finite forms: infinitive, gerund, and past participle.
- Distinct active and passive voices, as well as an impersonal passive voice.
- Note that, although these categories are largely inherited from Classical Latin, many of the forms are either newly constructed or inherited from different categories.
The main tense and mood distinctions that were made in classical Latin are generally still present in the modern Romance languages, though many are now expressed through compound rather than simple verbs. The passive voice, which was mostly synthetic in classical Latin, has been completely replaced with compound forms.
- Owing to sound changes which made it homophonous with the preterite, the Latin future indicative tense was dropped, and replaced with a periphrasis of the form infinitive + present tense of habēre. Eventually, this structure was reanalysed as a new future tense.
- In a similar process, an entirely new conditional form was created.
- While the synthetic passive voice of classical Latin was abandoned in favour of periphrastic constructions, most of the active voice remained in use. However, several tenses have changed meaning, especially subjunctives. For example:
- * The Latin pluperfect indicative became a conditional in Sicilian, and an imperfect subjunctive in Spanish.
- * The Latin pluperfect subjunctive developed into an imperfect subjunctive in all languages except Romansh, where it became a conditional, and Romanian, where it became a pluperfect indicative.
- * The Latin preterite subjunctive, together with the future perfect indicative, became a future subjunctive in Old Spanish, Portuguese, and Galician.
- * The Latin imperfect subjunctive became a personal infinitive in Portuguese and Galician.
- Many Romance languages have two verbs "to be". One is derived from Vulgar Latin *essere < Latin esse "to be" with an admixture of forms derived from sedēre "to sit", and is used mostly for essential attributes; the other is derived from stāre "to stand", and mostly used for temporary states. This development is most notable in Spanish, Portuguese and Catalan. In French, Italian and Romanian, the derivative of stāre largely preserved an earlier meaning of "to stand/to stay", although in modern Italian, stare is used in a few constructions where English would use "to be", as in sto bene "I am well". In Old French, the derivatives of *essere and stāre were estre and ester, respectively. In modern French, estre persists as être "to be" while ester has been lost as a separate verb; but the former imperfect of ester is used as the modern imperfect of être, replacing the irregular forms derived from Latin, iere. In Italian, the two verbs share the same past participle, stato. sedēre persists most notably in the future of *essere, although in Old French the future is a direct derivation from Latin, e.g. ert "he will be" < erit. See Romance copula for further information.
- During the Renaissance, Italian, Portuguese, Spanish and a few other Romance languages developed a progressive aspect which did not exist in Latin. In French, progressive constructions remain very limited, the imperfect generally being preferred, as in Latin.
- Many Romance languages now have a verbal construction analogous to the present perfect of English. In some, it has taken the place of the old preterite ; in others, the two coexist with somewhat different meanings. A few examples:
- * preterite only: Galician, Asturian, Sicilian, Leonese, Portuguese, some dialects of Spanish;
- * preterite and present perfect: Catalan, Occitan, standard Spanish, Tuscan/standard Italian;
- * present perfect predominant, preterite now literary: French, Romanian, several Italian dialects, some dialects of Spanish;
- * present perfect only: Romansh
The following are common features of the Romance languages that are different from Classical Latin:
- Adjectives generally follow the noun they modify.
- The normal clause structure is SVO, rather than SOV, and is much less flexible than in Latin.
- Many Latin constructions involving nominalized verbal forms were dropped in favor of constructions with subordinate clause. Exceptions can be found in Italian, for example, Latin tempore permittente > Italian tempo permettendo; L. hoc facto > I. ciò fatto.
Lexicon
Loanwords
Romance languages have borrowed heavily, though mostly from other Romance languages. However, some, such as Spanish, Portuguese, Romanian, and French, have borrowed heavily from other language groups. Vulgar Latin borrowed first from indigenous languages of the Roman empire, and during the Germanic folk movements, from Germanic languages, especially Gothic; for Eastern Romance languages, during Bulgarian Empires, from Slavic languages, especially Bulgarian. Notable examples are *blancus "white", replacing native albus ; *guerra "war", replacing native bellum; and the words for the cardinal directions, where cognates of English "north", "south", "east" and "west" replaced the native words septentriō, merīdiēs, oriens, and occidens. Some Celtic words were incorporated into the core vocabulary, partly for words with no Latin equivalent, but in some cases replacing Latin vocabulary. Many Greek loans also entered the lexicon, e.g. spatha "sword" ; cara "face" ; colpe "blow" ; cata "each" ; common suffixes *-ijāre/-izāre, -ista.Lexical innovation
Many basic nouns and verbs, especially those that were short or had irregular morphology, were replaced by longer derived forms with regular morphology. Nouns, and sometimes adjectives, were often replaced by diminutives, e.g. auris "ear" > auricula > oricla ; avis "bird" > avicellus > aucellu ; caput "head" > capitium ; vetus "old" > vetulus > veclus. Sometimes augmentative constructions were used instead: piscis "fish" > Old French peis > peisson > French poisson. Verbs were often replaced by frequentative constructions: canere "to sing" > cantāre; iacere "to throw" > iactāre > *iectāre ; iuvāre > adiūtāre ; vēnārī "hunt" > replaced by *captiāre "to hunt", frequentative of capere "to seize".Many Classical Latin words became archaic or poetic and were replaced by more colloquial terms: equus "horse" > caballus ; domus "house" > casa ; ignis "fire" > focus ; strāta "street" > rūga or callis . In some cases, terms from common occupations became generalized: invenīre "to find" replaced by afflāre ; advenīre "to arrive" gave way to plicāre, elsewhere arripāre . The same thing sometimes happened to religious terms, due to the pervasive influence of Christianity: loquī "to speak" succumbed to parabolāre or fabulārī ~ fābellāre, based on Jesus' way of speaking in parables.
Many prepositions were used as verbal particles to make new roots and verb stems, e.g. Italian estrarre, Aromanian astragu, astradziri "to extract" from Latin ex- "out of" and trahere "to pull", or to augment already existing words, e.g. French coudre, Italian cucire, Portuguese coser "to sew", from cōnsuere "to sew up", from suere "to sew", with total loss of the bare stem. Many prepositions and commonly became compounded, e.g. de ex > French dès "as of", ab ante > Italian avanti "forward". Some words derived from phrases, e.g. Portuguese agora, Spanish ahora "now" < hāc hōrā "at this hour"; French avec "with" < Old French avuec < apud hoc "with that"; Spanish tamaño, Portuguese tamanho "size" < tam magnum "so big"; Italian codesto "this, that" < Old Italian cotevesto < eccum tibi istum approx. "here's that thing of yours"; Portuguese você "you" < vosmecê < vossemecê < Galician-Portuguese vossa mercee "your mercy".
A number of common Latin words that have disappeared in many or most Romance languages have survived either in the periphery or in remote corners, or as secondary terms, sometimes differing in meaning. For example, Latin caseum "cheese" in the periphery, but in the central areas has been replaced by formāticum, originally "moulded " edere "to eat ", which survives as Spanish/Portuguese comer but elsewhere is replaced by mandūcāre, originally "to chew". In some cases, one language happens to preserve a word displaced elsewhere, e.g. Italian ogni "each, every" < omnes, displaced elsewhere by tōtum, originally "whole" or by a reflex of Greek κατά ; Friulian vaî "to cry" < flere "to weep"; Serbo-Croatian otijemna "sail pole" < Dalmatian < antenna "yardarm". Sardinian in particular preserves many words entirely lost elsewhere, e.g. eja "yes" < etiam "also/yes/indeed", emmo "yes" < immo "rather/yes/no", mannu "big" < magnus, nàrrere "to say" < narrāre "to tell", and domo "house" < domō "at home". Sardinian preserves some words that were already archaic in Classical Latin, e.g. àchina "grape" < acinam, while Sicily and Calabria typically have forms with initial /r/: ràcina''.
Latinisms
During the Middle Ages, scores of words were borrowed directly from Classical Latin, either in their original form or in a somewhat nativized form. These resulted in many doublets—pairs of inherited and learned words—such as those in the table below:Latin | Romance | Inherited | Latinism |
fragilis "fragile" | French | frêle "frail" | fragile "fragile" |
fabrica "craft, manufacture" | French | forge "forge" | fabrique "factory" |
fabrica | Spanish | fragua "forge" | fábrica "factory" |
fabrica | Romanian | făură "blacksmith " | fabrică "factory" |
lēgālis "legal" | French | loyal "loyal" | légal "legal" |
lēgālis | Spanish | leal "loyal" | legal "legal" |
advōcātus "advocate " | French | avoué "solicitor " | avocat "barrister " |
polīre "to polish" | Portuguese | puir "to wear thin" | polir "to polish" |
locālis "place" | Portuguese | lugar "place" | local "place" |
Sometimes triplets arise: Latin articulus "joint" > Portuguese artículo "joint, knuckle", artigo "article", artelho "ankle". In many cases, the learned word simply displaced the original popular word: e.g. Spanish crudo "crude, raw" ; French légume "vegetable" ; Portuguese flor "flower". The learned loan always sounds more like the original than the inherited word does, because regular sound change has been bypassed; likewise, the learned word usually has a meaning closer to that of the original. In French, the stress of the modern form of the learned loan may be on the "wrong" syllable vis-à-vis Latin, whereas the stress of the inherited word always corresponds to the Latin stress: e.g. Latin :wikt:vipera#Latin|vipera vs. French :wikt:vipère#French|vipère, learned loan, and guivre/:wikt:vouivre#French|vouivre, inherited.
Borrowing from Classical Latin has produced a large number of suffix doublets. Examples from Spanish : -ción vs. -zon; -cia vs. -za; -ificar vs. -iguar; -izar vs. -ear; -mento vs. -miento; -tud vs. -dumbre ; -ículo vs. -ejo; etc. Similar examples can be found in all the other Romance languages.
This borrowing also introduced large numbers of classical prefixes in their original form and reinforced many others. Many Greek prefixes and suffixes also found their way into the lexicon: tele-, poli-/poly-, meta-, pseudo-, -scope/scopo, -logie/logia/logía, etc.