Jump to content

Linguistics: Difference between revisions

From Thesmotetai
Line 198: Line 198:


'''Lexical relations''' describe how words stand to one another. Two words are <u>synonyms</u> if they share the same or a very similar meaning, like ''car'' and ''automobile,'' or ''buy'' and ''purchase''. <u>Antonyms</u> have opposite meanings, such as with ''alive'' and ''dead,'' or ''fast'' and ''slow''. One term is a <u>hyponym</u> of another if the meaning of the first is included in the meaning of the second (''ant'' is a hyponym of ''insect''). A <u>prototype</u> is a hyponym that has characteristic features of the type it belongs to (a ''robin'' is a prototype of a ''bird'' but a ''penguin'' is not). Two words with the same pronunciation are <u>homophones</u> like ''flour'' and ''flower'', while two words with the same spelling are <u>homonyms</u>, like a bank of a river in contrast to a bank as a financial institution. <u>Hyponymy</u> is closely related to <u>meronymy</u>, which describes the relation between part and whole. For instance, ''wheel'' is a meronym of ''car''. An expression is <u>ambiguous</u> if it has more than one possible meaning. The term <u>polysemy</u> is used if different meanings of a word are closely related to one another, like with the word ''head:'' the topmost part of the human body, or the top-ranking person in an organization.
'''Lexical relations''' describe how words stand to one another. Two words are <u>synonyms</u> if they share the same or a very similar meaning, like ''car'' and ''automobile,'' or ''buy'' and ''purchase''. <u>Antonyms</u> have opposite meanings, such as with ''alive'' and ''dead,'' or ''fast'' and ''slow''. One term is a <u>hyponym</u> of another if the meaning of the first is included in the meaning of the second (''ant'' is a hyponym of ''insect''). A <u>prototype</u> is a hyponym that has characteristic features of the type it belongs to (a ''robin'' is a prototype of a ''bird'' but a ''penguin'' is not). Two words with the same pronunciation are <u>homophones</u> like ''flour'' and ''flower'', while two words with the same spelling are <u>homonyms</u>, like a bank of a river in contrast to a bank as a financial institution. <u>Hyponymy</u> is closely related to <u>meronymy</u>, which describes the relation between part and whole. For instance, ''wheel'' is a meronym of ''car''. An expression is <u>ambiguous</u> if it has more than one possible meaning. The term <u>polysemy</u> is used if different meanings of a word are closely related to one another, like with the word ''head:'' the topmost part of the human body, or the top-ranking person in an organization.
=== Thematic roles ===
Also known as semantic roles, are conceptual categories that describe the relationship between a '''verb''' and its '''arguments''' (the entities involved in the action or state described by the verb). These roles help in understanding how different participants in a sentence are related to the action or state of the verb.
The '''Agent''' is the <u>doer or initiator of an action</u>, typically a volitional entity that performs the action described by the verb. Agents are usually animate and capable of intentional actions. In the sentence "The chef cooked dinner," the chef is the Agent; the one performing the action of cooking.
The '''Patient''' is the entity that <u>undergoes the action</u> or the entity that is <u>affected by the action</u> of the verb. Patients are often the recipient of an action's effects and can be animate or inanimate. In "The chef cooked dinner," "dinner" is the Patient because it is being acted upon - it undergoes the action of being cooked.
The '''Theme''' is similar to the Patient in that it is <u>involved in</u> the action, but it is often used in a broader sense to include entities that are <u>being moved</u>, <u>experienced</u>, or otherwise <u>central to the action</u> without being <u>affected</u> in a manner that changes their state. The distinction between Patients and Themes can be subtle and, in some contexts, overlap; themes are central to the action, but not necessarily affected by it. In "She read a book," the book is the Theme - it is involved in the action of reading without undergoing a change.
The '''Experiencer''' is the entity that <u>perceives</u> or <u>feels an action</u> or state rather than physically carrying it out. This role is typically associated with mental states and perceptions. Experiencers are sentient, usually animate, and involved in actions or states in a '''non-volitional''' manner. In "She loves the painting," "she" is the Experiencer because she is the one having the feeling of love towards the painting.


== Morphology ==
== Morphology ==

Revision as of 00:31, 11 March 2024

Syntax

The study of how words (the smallest unit of grammar that has meaning and can stand alone; the word 'car') and morphemes (smaller units than words that carry meaning but cannot stand alone; the prefix 'un-') combine to form larger units of grammar, such as phrases and sentences. Syntax confirms itself with word order, grammatical relationships between words and morphemes, constituency (hierarchical structure for sentences), agreement (when words change to adapt to their neighbors in a larger combined unit of grammar), crosslinguistic variation, and the semantics (the relationship between the form of a word and its meaning).

The word syntax comes from Ancient Greek: σύνταξις 'coordination,' which consists of σύν (syn), 'together,' and τάξις (táxis), 'ordering.' Language played a crucial role in Greek philosophy, with figures like Plato and Aristotle exploring the relationship between words, meaning, and reality. Aristotle's work on logic delved into the principles of constructing valid arguments, indirectly touching upon aspects of syntax. Classical Greek and Hellenic thinkers made substantial contributions to the study of grammar; Dionysius Thrax (100s BCE), composed the Tékhnē Grammatikḗ (Art of Grammar), the first work of analytical linguistics focusing on Ancient Greek, which included discussions on parts of speech, morphology, and syntax.

In English, syntax is largely controlled by word order ('the girl loves the boy' versus 'the boy loves the girl') whereas in many other languages, case markers indicate these grammatical relationships; we see this trait in Latin where word order is far less important - 'the girl loves the boy' can be written in a variety of correct orders because the -um ending on the object (boy) stays constant (puerum puella amat, amat puella puerum, amat puerum puella, or puella amat puerum are all correct. Editor's note: puella is girl, puer(-um) is boy, and amat is love).

Word Order

The sequence in which the subject (S), verb (V), and object (O) typically appear in sentences. Typically is an important qualifier; while the dominant word order offers a foundational understanding, many languages exhibit flexibility in their word order, especially for emphasis, thematic structure, or in questions versus statements.

  • Subject-Object-Verb (SOV): Languages that typically use SOV order include Japanese, Korean, Turkish, and Latin. Around 40-45% of the world's languages are believed to use SOV as their default word order.
  • Subject-Verb-Object (SVO): Languages that typically use SVO order include English, Mandarin Chinese, Spanish, and Russian. Approximately 35-40% of languages are thought to use SVO as their primary order.
  • Verb-Subject-Object (VSO) and Verb-Object-Subject (VOS): These orders are less common. VSO is seen in Classical Arabic and Welsh, while VOS appears in languages like Malagasy and Fijian. Combined, VSO and VOS might account for around 10-15% of languages.
  • Object-Verb-Subject (OVS) and Object-Subject-Verb (OSV): These are the rarest word orders among languages. Examples include Hixkaryana for OVS and Xavante for OSV. Together, they constitute less than 1% of all languages.

The separation by language family is as follows:

  • Indo-European predominantly features SVO (English, Spanish) and SOV (Hindi, Persian) orders.
  • Sino-Tibetan languages can show diversity in structure; Mandarin Chinese is notably SVO.
  • Turkic generally features SOV order, as seen in Turkish and Uzbek.
  • Uralic features a mix, but Finnish and Hungarian, for example, tend to favor SVO.
  • Afro-Asiatic languages like Arabic (VSO), Hebrew (SVO), and Amharic (SOV) show diversity.
  • Austronesian also has this variability; Indonesian (SVO) and Malagasy (VOS).
  • Dravidian primarily features SOV order, as in Tamil and Telugu.

Grammatical Relationships

The standard examples of grammatical functions from traditional grammar are subject, direct object, and indirect object.

  • The subject is the noun/pronoun about which a statement is made (John gave an apple to Sally).
  • The direct object is the noun/pronoun being acted upon by the verb (John gave an apple to Sally).
  • The indirect object is the recipient of the direct object (John gave an apple to Sally).

Many modern theories of grammar acknowledge numerous additional types of relations (e.g. complement, specifier, predicative, et cetera). The role of grammatical relations in theories of grammar is greatest in dependency grammars, which propose dozens of distinct grammatical relations. Critics argue that overemphasis on certain grammatical models, such as word order (particularly from an Indo-European focus), may overlook the diversity and complexity of language structures worldwide.

  • Complement: A word or phrase that is necessary to complete the meaning of another part of the sentence. Their main role is to complete the idea expressed by the word they complement. Without the complement, the idea would feel incomplete.
    • Verb Complement: Provides essential information about the action or state described by the verb. For example, in "She gave her friend a gift," "her friend a gift" is a complement of the verb "gave" because it completes the action by specifying what was given and to whom.
    • Noun Complement: Completes the meaning of a noun. For instance, "The decision to leave early" includes "to leave early" as a complement of "decision," explaining what the decision is about.
    • Adjective Complement: Completes the meaning of an adjective. In "She is capable of winning," "of winning" is a complement of "capable," specifying in what way she is capable.
  • Specifier: A word that modifies or provides more specific information about another word, often relating to quantity, definiteness, or possession. Specifiers can be articles (the, a), possessive pronouns (his, her), demonstratives (this, that), and quantifiers (some, many).
  • Predicative: A predicative (or predicate) element relates to the subject or object by providing information about it, typically through a linking verb (such as "to be," "seem," "become"). Predicatives can be predicative adjectives or predicative nominatives (nouns or pronouns) that describe or identify the subject or object. Predicatives provide additional information or description about the subject or object, linking it to a quality, identity, or condition.
    • Predicative Adjective: Describes the subject or object. In "The sky is blue," "blue" is a predicative adjective providing information about "the sky."
    • Predicative Nominative: Identifies or renames the subject or object. For example, in "Karen is a teacher," "a teacher" is a predicative nominative that identifies Karen's occupation.

Dependency grammars emphasize the idea that linguistic units are connected to each other by direct links or dependencies, forming a network of relations that structure the sentence. In this framework, each unit depends on a head (a central word it is connected to) and can have dependents (words that depend on it). Dependency grammars identify a wide range of specific grammatical relations to describe the types of dependencies that can exist, reflecting the nuanced ways words can relate to each other within a sentence.

Some languages, often referred to as non-configurational, exhibit a high degree of flexibility in word order. Grammatical relationships are often indicated through inflectional morphology. In Australian Aboriginal languages, Latin, and Classical Greek, the subject, object, and verb can appear in various orders without changing the fundamental meaning of the sentence.

In topic-prominent languages, such as Mandarin Chinese, the topic of the sentence (what the sentence is about, which can be the subject, object, or another element) comes first, and what is said about the topic follows. This structure emphasizes the topic-comment construction over the subject-predicate construction typical of subject-prominent languages like English.

In languages with rich morphological systems, such as agglutinative and fusional languages, grammatical roles are often marked by case endings or through agreement rather than strictly by word order. This allows for greater flexibility in sentence structure and can convey additional nuances.

Polysynthetic languages, which include many indigenous languages of the Americas, incorporate a high degree of information within single words through complex inflection. This can include the subject, object, verb, and additional modifiers and relational elements, making the concept of word order as applied in more analytic languages less directly relevant.

Inflection

A process of word formation where a word is modified to express different grammatical categories such as tense, case, voice, aspect, comparatives, person, number, gender, mood, animacy, agreement, and definiteness. When a verb is inflected, it is called conjugation; when other word types (nouns, adjectives, adverbs, pronouns, determiners, participles, prepositions, postpositions, numerals, articles, aspects, et cetera) it is called declension.

An inflection expresses grammatical categories with affixation (such as prefix, suffix, infix, circumfix, and transfix), apophony (as Indo-European ablaut, sound changes to the root word), or other modifications. For example, the Latin verb ducam, meaning "I will lead", includes the suffix -am, expressing person (first), number (singular), and tense-mood (future indicative or present subjunctive).

The inflected form of a word often contains both free morphemes (can stand by itself as a word), and bound morphemes (cannot stand alone as a word). For example, the English word cars is a noun that is inflected for number to express the plural: the morpheme car is unbound because it could stand alone as a word, while the suffix -s is bound because it cannot stand alone as a word. These two morphemes together form the inflected word cars.

Words that are never subject to inflection are said to be invariant; for example, the English verb must is an invariant item: it never takes a suffix or changes form to signify a different grammatical category. Its categories can be determined only from its context. Languages that have some degree of inflection are synthetic languages. These can be highly inflected (such as Latin, Greek, Biblical Hebrew, and Sanskrit), or slightly inflected (such as English, Dutch, Persian). Languages that use little inflection are also said to be 'analytic.' Analytic languages that do not make use of derivational morphemes, such as Standard Chinese, are said to be isolating.

Types of Inflections

Inflection often involves agreement between nouns and other parts of speech, such as adjectives, pronouns, and verbs.

Cases indicate the grammatical role of a noun or pronoun in a sentence, such as subject, object, or possession. Common cases include: Nominative - subject of the verb; Accusative - direct object of the verb; Dative - indirect object of the verb; Genitive - possessive relation; Locative - location; Instrumental - means by which an action is performed; Ablative - movement away from something; Vocative - direct address; Allative - movement toward something.

Number distinguishes between one (singular), two (dual), and more than two (plural) entities, with some languages having additional distinctions - some languages distinguish between 'a few' and 'many' (paucal) or may have a particular inflection for a collective quantity.

Gender in language can be a way to classify nouns and pronouns, often impacting agreement with adjectives, verbs, and pronouns; can generally include masculine, feminine, neuter, and/or common, among others.

Animacy can also play a role similar to gender in some languages, with animate or inanimate objects having distinct inflection markers.

Definiteness distinguishes between specific and non-specific objects ('the' versus 'a/an').

Partitive is used in to express something being 'part of' some bigger or greater whole.

Comparison markers indicate comparative and superlative forms.

Many languages have unique declension categories not widely found elsewhere, reflecting specific semantic or syntactic distinctions important in those languages.

In English most nouns are inflected for number with the inflectional plural affix -s (as in "dog" → "dog-s"), and most English verbs are inflected for tense with the inflectional past tense affix -ed (as in "call" → "call-ed"). English also inflects verbs by affixation to mark the third person singular in the present tense (with -s), and the present participle (with -ing). English short adjectives are inflected to mark comparative and superlative forms (with -er and -est respectively). Despite the general regularization, English retains traces of its ancestry, with a minority of its words still using inflection by ablaut (sound change, mostly in verbs) and umlaut (a particular type of sound change, mostly in nouns), as well as long-short vowel alternation.

  • Write, wrote, written (marking by ablaut variation, and also suffixing in the participle); sing, sang, sung (ablaut); foot, feet (marking by umlaut variation); mouse, mice (umlaut); child, children (ablaut, and also suffixing in the plural)

In Latin, there are five types of declension. Words that belong to the first declension usually end in -a and are usually feminine in gender, and thus can be said to share a common inflectional framework. In Old English, nouns are divided into two major categories of declension, strong and weak. Through the process of syncretism, the same inflection form may serve multiple grammatical functions in many languages, and context helps reveal the semantic details.

Morphemes can be added to inflectional languages in a variety of ways:

  • Affixation, or simply adding morphemes onto the word without changing the root;
  • Reduplication, repeating all or part of a word to change its meaning;
  • Alternation, exchanging one sound for another in the root (usually vowel sounds, as in the ablaut / umlaut in Germanic languages);
  • Suprasegmental variations, such as of stress, pitch or tone, where no sounds are added or changed but the intonation and relative strength of each sound is altered regularly.

Inflection and PIE Languages

Because the Proto-Indo-European (PIE) language was highly inflected, all of its descendants, such as Albanian, Armenian, English, German, Ukrainian, Russian, Persian, Kurdish, Italian, Irish, Spanish, French, Hindi, Marathi, Urdu, Bengali, and Nepali, are inflected to a greater or lesser extent. In general, older Indo-European languages such as Latin, Ancient Greek, Old English, Old Norse, Old Church Slavonic and Sanskrit are extensively inflected because of their temporal proximity to the PIE ancestor.

Deflexion has caused modern versions of some Indo-European languages that were previously highly inflected to be much less so; an example is English, as compared to Old English. In general, languages where deflexion occurs replace inflectional complexity with more rigorous word order, which provides the lost inflectional details. However, the transition from synthetic (inflection-heavy) to analytic (relying more on word order and auxiliary words) is complex and can be influenced by multiple factors.

Most Slavic languages and some Indo-Aryan languages are an exception to the general Indo-European deflexion trend, continuing to be highly inflected (in some cases acquiring additional inflectional complexity and grammatical genders, as in Czech & Marathi).

Subordination and Coordination

These fundamental mechanisms by which clauses are combined in languages allow for the expression of complex ideas and relationships between events, actions, and descriptions. These mechanisms contribute to the grammatical hierarchy of sentences, determining how clauses are structured and integrated.

Coordination joins two or more syntactic units of equal rank, such as words, phrases, or clauses, enabling them to function as a single unit. The coordinated elements can be seen as parallel or additive in nature.

  • Markers: often marked by coordinating conjunctions ("and," "or," "but"). However, languages may also use non-conjunctive means, such as intonation, punctuation (in written language), or even juxtaposition without explicit markers.
  • Symmetry and Balance: typically share the same syntactic type (noun with noun, clause with clause) and contribute equally to the meaning of the sentence.
  • Functions: allow for listing (enumeration), addition, choice, contrast, and other relational meanings between equally important ideas or actions.

Subordination involves embedding one clause (the subordinate or dependent clause) within another (the main or independent clause), creating a hierarchical relationship. The subordinate clause functions as a single unit within the larger structure, often specifying time, reason, condition, manner, or place.

  • Markers: Subordination is marked in various ways, including subordinating conjunctions ("because," "if," "when"), relative pronouns ("which," "that," "whom"), or specific verb forms (subjunctive moods, non-finite verb forms). Some languages employ specific particles or changes in word order to indicate subordination.
  • Dependency: The subordinate clause depends on the main clause for its interpretation, not being able to stand alone as a complete sentence. It typically provides additional information about an element in the main clause.
  • Types of Subordinate Clauses: These include adverbial clauses (indicating time, reason, condition, etc.), relative clauses (modifying nouns), and complement clauses (serving as the object or subject of a verb or complement of an adjective or noun).

In a grammatical hierarchy, independent clauses rank higher than dependent clauses. Subordination allows for the embedding of clauses within clauses, leading to potentially complex sentence structures. This embeddedness is key to expressing nuanced and layered relationships between actions, events, and descriptions. Coordination and subordination can interact within a single sentence, with coordinated structures containing subordinate clauses, or with subordinate clauses themselves being coordinated, contributing to the syntactic and semantic complexity of language.

Sentence and Clause Structure

Although the specific grammatical tools and markers used to combine clauses can vary widely among languages, the concepts of simple, compound, complex, and compound-complex sentences offer a useful framework for analyzing sentence types from a language-agnostic perspective.

Simple Sentences

A simple sentence consists of a single independent clause; it contains a subject and a predicate and expresses a complete thought. Simple sentences can be straightforward or can include multiple subjects, objects, or adjuncts, but they always revolve around a single main verb or verb phrase.

  • [Subject] [Verb] [Object].
  • The bird sings.

Compound Sentences

Compound sentences are formed by joining two or more independent clauses of equal status, meaning they could stand alone as simple sentences. The clauses in a compound sentence are typically connected by coordinating conjunctions (such as 'and,' 'but,' 'or') or by punctuation marks like semicolons in written language. The exact conjunctions or punctuation marks used can vary significantly across languages.

  • [Independent Clause] [Conjunction] [Independent Clause].
  • The bird sings and the cat meows.

Complex Sentences

Complex sentences consist of one independent clause and one or more dependent (or subordinate) clauses, which cannot stand alone as a sentence and is linked to the independent clause by subordinating conjunctions (like 'because,' 'since,' 'which') or relative pronouns, depending on the language. The dependent clause adds information to the independent clause, specifying time, reason, condition, contrast, et cetera.

  • [Independent Clause] [Subordinating Conjunction] [Dependent Clause].
  • The bird sings because it is happy.

Compound-Complex Sentences

Compound-complex sentences combine elements of both compound and complex sentences; they include at least two independent clauses and one or more dependent clauses. These sentences are useful for conveying intricate relationships between ideas and combining multiple actions or situations.

  • [Independent Clause] [Conjunction] [Independent Clause] [Subordinating Conjunction] [Dependent Clause].
  • The bird sings, and the cat meows, because it is morning.

Considerations Across Languages

While these categories are helpful for analyzing sentence structure, it's important to recognize that languages may employ unique mechanisms for combining clauses:

  • Differential markings: Some languages might use specific particles, verb forms, or word order changes to distinguish between independent and dependent clauses.
  • Conjunctions and connectors: The exact words or particles used to connect clauses can vary widely, and some languages may use non-conjunctional means to indicate clause relationships.
  • Ellipsis and inference: In some languages, elements of compound or complex sentences might be omitted if they can be inferred from context, making the sentence structure less explicit.

Syntactic Theories and Models of the 21st Century

Developed by Noam Chomsky, generative grammar emphasizes the innate linguistic capability of humans and the idea of a universal grammar underlying all languages. It has evolved through various iterations:

  • Government and Binding (GB) Theory was focused on the modular organization of syntax, including principles like X-bar theory, case theory, and theta theory.
  • Minimalist Program (MP) aims to explain language using the minimal necessary theoretical constructs and principles, focusing on the economy of derivation and representation.

Lexical-Functional Grammar (LFG) emphasizes the importance of lexical information in determining grammatical structure and includes a distinction between functional structure (f-structure) and constituent structure (c-structure).

Head-Driven Phrase Structure Grammar (HPSG) focuses on the lexicon and complex feature-based descriptions, with a strong emphasis on the role of the head in phrase structure and a commitment to a non-transformational view of syntax.

Categorial Grammar (CG) is based on the idea that syntactic categories can be understood in terms of functions that combine words and phrases into larger units. Modern variants include Combinatory Categorial Grammar (CCG) and Type Logical Grammar.

Dependency Grammar (DG) centers on the dependency relation between headwords and their dependents as the primary structure organizing sentences, differing from phrase structure grammars by not assuming constituent phrases as primary.

Construction Grammar (CxG) argues that knowledge of language is based on constructions, or form-meaning pairings, that range from specific idioms to general grammatical patterns. Variants include Cognitive Grammar and Radical Construction Grammar.

Tree Adjoining Grammar (TAG) is a highly lexicalized grammar framework that uses trees as the basic unit of syntax, allowing for more flexibility in describing cross-linguistic syntactic phenomena.

Role and Reference Grammar (RRG) integrates syntactic, semantic, and pragmatic information, focusing on the role of verb argument structure and the linking between semantic and syntactic roles.

Optimality Theory (OT) for Syntax was originally developed for phonology, but has been applied to syntax. It proposes that surface forms result from the competition between constraints, rather than from transformational derivations.

Dynamic Syntax (DS) emphasizes the temporal dimension of syntax, modeling how sentences are processed in real-time, focusing on the incremental building of semantic representations.

Stochastic/Probabilistic Grammar models incorporate probabilities into grammatical descriptions, often used in computational linguistics to model language understanding and production processes based on statistical patterns.

Semantics

Put simply, semantics is the study of linguistic meaning. It evaluates how words get specific meanings, how the meaning of a complex combination-unit of grammar is dependent upon its constituent parts. The term derives from the Greek verb sēmainō (“to mean” or “to signify”), through the adjective sēmantikos (“significant”).

Natural languages (those that develop 'in the wild' among human populations) are known for being unboundedly productive; there is no upper limit in length, complexity, or number of grammatical expressions; Simpler expressions can be concatenated, relativized, complementized, et cetera to create ever-more-complex formulations. These complex expressions are both grammatically correct, and meaningful.

In addition to compositionality, semantic theories must also account for the phenomenon of reference. Reference is a characteristic of many expressions where the appearance of connection between words and the world seems quite unclear to outsiders by 'de jure meaning' (or sense) alone.

In the philosophy of language, the distinction between sense and reference was an idea of the German philosopher and mathematician Gottlob Frege (On Sense and Reference; Über Sinn und Bedeutung), reflecting the two ways he believed a singular term could have meaning.

  • Reference (Bedeutung): The reference of a term is the object it refers to in the world. For example, the reference of the name "Venus" is the planet Venus itself.
  • Sense (Sinn): The sense of a term is the way the reference is presented or the mode of presentation of the object. It involves the cognitive aspect of the term, how an individual conceives of the term's reference. For instance, "the morning star" and "the evening star" have the same reference (the planet Venus) but different senses, because they refer to Venus in different ways or contexts. The statement "the morning star is the evening star" is informative because it reveals that two different senses ("the morning star" and "the evening star") refer to the same object (Venus), even though the reference is the same.

The expressions triangular and trilateral, for example, are not synonymous, but there is no possible world in which they do not apply to exactly the same things. And the expression round square appears to be meaningful, but there is no possible world in which it applies to anything at all. Such examples are easy to multiply.

In semantics, truth is a property of statements that accurately present the world and true statements are in accord with reality. Whether a statement is true usually depends on the relation between the statement and the rest of the world; the truth conditions of a statement are the way the world needs to be for the statement to be true. For example, it belongs to the truth conditions of the sentence "it is raining outside" that raindrops are falling from the sky. The sentence is true if it is used in a situation in which the truth conditions are fulfilled: if there is actually rain outside.

The semiotic triangle is a model used to explain the relation between language, language users, and the world, represented in the model as Symbol, Thought / Reference, and Referent. The symbol is a linguistic signifier, either in its spoken or written form. The central idea of the model is that there is no direct relation between a linguistic expression and what it refers to; this is expressed in the diagram by the dotted line between symbol and referent.

The model holds instead that the relation between the two is mediated through a third component (the thought or reference). For example, the term apple stands for a type of fruit but there is no direct connection between this string of letters and the corresponding physical object; the relation is only established indirectly through the mind of the language speaker. When they see this string of symbols or hear this string of phonemes, it evokes a mental image or a concept, which establishes the symbol's connection to the physical reference. This process is only possible if the speaker learned the meaning of the symbol previously; the meaning of a specific symbol is governed by the conventions of each specific language.

Lexical relations describe how words stand to one another. Two words are synonyms if they share the same or a very similar meaning, like car and automobile, or buy and purchase. Antonyms have opposite meanings, such as with alive and dead, or fast and slow. One term is a hyponym of another if the meaning of the first is included in the meaning of the second (ant is a hyponym of insect). A prototype is a hyponym that has characteristic features of the type it belongs to (a robin is a prototype of a bird but a penguin is not). Two words with the same pronunciation are homophones like flour and flower, while two words with the same spelling are homonyms, like a bank of a river in contrast to a bank as a financial institution. Hyponymy is closely related to meronymy, which describes the relation between part and whole. For instance, wheel is a meronym of car. An expression is ambiguous if it has more than one possible meaning. The term polysemy is used if different meanings of a word are closely related to one another, like with the word head: the topmost part of the human body, or the top-ranking person in an organization.

Thematic roles

Also known as semantic roles, are conceptual categories that describe the relationship between a verb and its arguments (the entities involved in the action or state described by the verb). These roles help in understanding how different participants in a sentence are related to the action or state of the verb.

The Agent is the doer or initiator of an action, typically a volitional entity that performs the action described by the verb. Agents are usually animate and capable of intentional actions. In the sentence "The chef cooked dinner," the chef is the Agent; the one performing the action of cooking.

The Patient is the entity that undergoes the action or the entity that is affected by the action of the verb. Patients are often the recipient of an action's effects and can be animate or inanimate. In "The chef cooked dinner," "dinner" is the Patient because it is being acted upon - it undergoes the action of being cooked.

The Theme is similar to the Patient in that it is involved in the action, but it is often used in a broader sense to include entities that are being moved, experienced, or otherwise central to the action without being affected in a manner that changes their state. The distinction between Patients and Themes can be subtle and, in some contexts, overlap; themes are central to the action, but not necessarily affected by it. In "She read a book," the book is the Theme - it is involved in the action of reading without undergoing a change.

The Experiencer is the entity that perceives or feels an action or state rather than physically carrying it out. This role is typically associated with mental states and perceptions. Experiencers are sentient, usually animate, and involved in actions or states in a non-volitional manner. In "She loves the painting," "she" is the Experiencer because she is the one having the feeling of love towards the painting.

Morphology

The internal construction of words from morphemes; some languages have highly complex morphology, while others (like Vietnamese) have very little, or even none. In linguistic typology, languages are categorized based on how they use morphology to convey grammatical relationships.

Inflectional morphology involves the modification of a word to express different grammatical categories such as tense, mood, voice, aspect, person, number, gender, and case. Inflection does not change the basic category or meaning of a word but rather adjusts it to fit its role within a sentence.

Inflection conveys syntactic information necessary for the grammatical functioning of the word within a sentence. It helps in indicating relationships between different words in the sentence. The set of inflectional morphemes in a language is typically fixed or closed, meaning that new inflectional morphemes are rarely added to the language. Inflectional processes are generally productive, applying to nearly all words within the category they affect (though there can be irregular forms).

Derivational morphology involves creating a new word from an existing one by adding a prefix or suffix (or sometimes through other processes like internal change or compounding). Derivation alters the base word to create a new word with a related, but distinct, meaning (or it can place the word in a new grammatical category; noun to adjective, for example).

New derivational morphemes can be added to the language over time (the set is open), and the process can give rise to an extensive variety of new words. Derivational processes vary in productivity; some apply broadly across many words, while others are restricted to a limited set of cases.

A fundamental difference between inflection and derivation is that inflection adjusts a word's form to fit grammatical constraints without changing its core meaning or category, whereas derivation changes the word's meaning and often its category as well. Inflectional changes result in different inflected forms of the same word, whereas derivational changes result in entirely new derived words.

Analytic (Isolating) Languages

These languages tend to use a single morpheme (the smallest meaningful unit in a language) per word, meaning that they do not rely heavily on inflectional changes to indicate grammatical relationships. Instead, they often use word order, auxiliary words, and prepositions to convey grammatical relationships, tense, aspect, and other grammatical categories.

Synthetic Languages

These languages are characterized by a higher use of inflection or derivation to encode grammatical relationships within words. Synthetic languages can be further divided into:

Agglutinative: morphemes (prefixes, suffixes, infixes, or circumfixes) are added to a base word in a relatively straightforward manner, with each morpheme typically representing a single grammatical category (such as tense, number, case, or aspect). These morphemes are easily separable and maintain consistent forms across different words. This cumulative process allows for a high degree of precision and clarity in expressing nuances of meaning and grammatical relationships. Turkish, Finnish, Swahili, and Japanese are examples.

Fusional (Inflectional): morphemes combine several grammatical categories such as gender, number, tense, mood, or case into a single affix attached to a word. This means that a single inflectional ending or internal modification in a word can convey multiple grammatical details. These languages tend to have a rich inventory of morphological forms. A single verb form can encode person, number, tense, aspect, and mood simultaneously. Unlike agglutinative languages, where morphemes have clear boundaries, fusional languages often exhibit forms where it's challenging to segment the word into discrete morphemes due to the merging of grammatical categories within a single affix. Languages like Russian and Latin fit here.

Polysynthetic: multiple morphemes to express what might be conveyed in a full sentence in more analytic languages. These languages feature incorporation of nouns into verbs, allowing a single word to convey complex actions and relations. A single word can express what would require a full sentence in more analytic or even agglutinative languages. This includes incorporating subjects, objects, and relational information into one word. While agglutinative languages have clear morpheme boundaries and functions, polysynthetic languages often feature morphemes that blend together more fluidly, with less clear-cut distinctions between them. A single affix in a polysynthetic language might serve multiple grammatical or semantic roles. Examples include Inuktitut, Yupik, and Mohawk.

Affixation

A core morphological process used across languages to modify the meaning of a word or to adjust it for grammatical congruity. This process involves adding an affix, a bound morpheme (that cannot stand alone) to a base word (or root). The base word can be simple form or another morphologically complex form; affixes can be categorized by their position relative to the base and the functions they perform.

Types of Affixes

  • Prefixes are added to the beginning of a word. They can alter the meaning of the word or adapt it for grammatical purposes. The English prefix "un-" can be added to "happy" to form "unhappy."
  • Suffixes attach to the end of a word. They are prevalent in indicating grammatical categories like tense, case, or number, as well as deriving new words. In English, adding "-ness" to "happy" creates "happiness."
  • Infixes are inserted within the body of a word. Infixation is less common in Indo-European languages but is found in other language families, such as Austronesian. In Tagalog, the infix "-um-" can be inserted into "sulat" (write) to form "sumulat" (wrote).
  • Circumfixes (or disfixes) attach to a base word in two parts, one at the beginning and one at the end. Circumfixation is seen in languages like German, where "ge-" and "-t" can enclose a verb to form a past participle, as in "gesagt" (said) from "sagen" (to say).

Affixation serves two primary functions: derivational and inflectional.

  • Derivational Affixation creates new words with new meanings and often changes the grammatical category of the base. This process is crucial for expanding a language's vocabulary and can involve nuanced changes in meaning.
  • Inflectional Affixation modifies a word to fit grammatical requirements without changing the word's essential meaning or category. It indicates grammatical relationships and features such as tense, number, gender, mood, and case. Inflectional affixation is more uniform and limited compared to derivational affixation.

The distinction between these two types of affixation is not always clear-cut; some languages use the same morphological process for both grammatical functions and lexical derivation, blurring the lines between the categories.

Affixation in Morphological Types

  • Agglutinative languages tend to have a clear-cut, one-to-one correspondence between affixes and grammatical functions, stacking multiple affixes onto a base to express compound grammatical concepts.
  • Fusional languages allow affixes to convey multiple grammatical categories simultaneously, and the form of the affix can change depending on the word it attaches to, leading to less transparent relationships between form and function.
  • Polysynthetic languages can utilize extensive affixation, incorporating a large number of affixes into single, highly complex words that can express a great deal of information.

Reduplication

A morphological process involving the repetition of a whole or part of a word to convey grammatical functions or semantic nuances. This process can modify the meaning of the original word, indicate grammatical categories, or create new words, and can be partial or total, depending on whether part of the word or the entire word is repeated.

  • Total Reduplication: The entire word or root is repeated. This can serve various functions, such as pluralization, intensification, or aspectual distinctions.
    • In Indonesian, "orang" (person) can become "orang-orang" to indicate "people" or a plurality of persons.
  • Partial Reduplication: Only a segment of the word, such as a syllable or a set of phonemes, is duplicated. Partial reduplication can be used to denote diminutives, verb tenses, and other grammatical or semantic changes.
    • In Tagalog, "ganda" (beauty) can be reduplicated as "maganda" (beautiful) by partially repeating the first syllable and adding it as a prefix.

Reduplication serves a variety of linguistic functions, which can differ widely among languages:

  • Grammatical Aspect: Indicating completed, habitual, ongoing, or iterative actions.
  • Tense: Marking past, present, or future tense in verbs.
  • Mood: Showing mood such as indicative, imperative, or subjunctive.
  • Number: Distinguishing singular from plural or dual.
  • Intensity or Degree: Signaling intensity, emphasis, or degree of an adjective or adverb.
  • Diminutive, Augmentative, or Pejorative Meaning: Modifying nouns or adjectives to convey size, importance, or emotional tone.

The function of reduplication can also extend to creating onomatopoeic expressions or mimetic words, which is common in some languages to imitate sounds or describe movements and appearances.

Compounding

A morphological process involving the combination of two or more lexemes (independent words) to form a new word. The resulting compound word typically acquires a meaning that is distinct from, but related to, the meanings of its constituent parts. Compounds can be formed from various combinations of parts of speech, including nouns, adjectives, verbs, and adverbs, among others.

  • Two nouns are combined to form a new noun. For example, "toothbrush" (tooth + brush) or "bookcase" (book + case).
  • An adjective and a noun are combined, usually to describe a specific type of the noun. Examples include "blackboard" (black + board) or "greenhouse" (green + house).
  • A verb and a noun are combined, often describing an action related to the noun. For instance, "pickpocket" (pick + pocket).
  • Two verbs are combined to describe actions that are done together or in sequence. This is less common in some families but can be seen in some South Asian languages.

The compound often represents a concept or entity that is semantically unified, which means its overall meaning cannot always be directly inferred from the meanings of its parts. Compounding can involve morphological changes (such as the modification of roots) or phonological changes (such as stress shifts) to mark the compound status, or aid in pronunciation. Once words are compounded, the new formation behaves as a single unit syntactically. It takes on the grammatical roles as one word, and its internal structure is not affected by syntactic operations that affect phrases.

Compounds typically have a head, the part that determines the overall category and grammatical properties of the compound. The position of the head varies across languages; for example, in English, the head is usually on the right ("toothbrush" is a type of brush), whereas in Japanese, it is typically on the left (電話 "denwa," phone, is literally "electric + talk").

Endocentric compounds have a head that provides the compound with its basic meaning and category (e.g., "blueberry" is a type of berry). Exocentric compounds lack a clear head, and the compound does not belong to the category of any of its parts (e.g., "pickpocket" is not a type of pocket). The rules governing compounding and the productivity of this process vary widely among languages; some are highly compounding, creating many new words through this process, while others use it much more sparingly.

The semantic relationship between components of a compound can vary greatly and is often not explicitly marked, which can lead to a wide range of interpretations based on cultural or contextual knowledge.

Suppletion

This concept represents an irregular way to form word variants, such as different tenses of a verb or comparative and superlative forms of an adjective. Unlike regular morphological processes that add, modify, or replace affixes while keeping the base of the word recognizable, suppletion involves using entirely different roots to express grammatical or semantic relations. This process results in word forms that cannot be predicted from the form of the base word and must be memorized as separate lexical entries.

Suppletion is highly irregular and constitutes an exception to the normal morphological rules of a language. The suppletive forms bear no phonological or morphological resemblance to their base forms. It is therefore relatively rare and typically affects a small set of commonly used words within a language: often verbs, adjectives, and pronouns. Despite its irregularity, suppletion serves regular grammatical functions, such as tense, number, or degree of comparison.

One of the most well-known examples is the verb "to go" in English, which forms its past tense as "went," a word originally derived from the Old English verb "wendan" (to turn). Thus, "go" and "went" demonstrate suppletion. The adjectives "good," "better," and "best" in English show suppletion, where "better" and "best" do not derive from "good" through regular morphological processes. In many languages, personal pronouns exhibit suppletion. For example, in English, the forms "I," "me," "my," and "mine" are related through meaning and function but not through form.

Suppletion is found across a wide range of languages, indicating that it is a common, though irregular, linguistic phenomenon:

  • Latin: "To be" shows suppletion with "sum," "es," "est" (present) and "fui" (perfect).
  • Spanish: "To go" is "ir," but its future stem is "ir-" and the past stem is "fu-," as in "fui" (I went).
  • Russian: "To go" has "идти" (idti) in the present tense and "пойти" (pojti) in the future tense.

Suppletion is often explained as the result of historical linguistic evolution, where words from different etymological sources merge over time due to similar meanings or functions. These changes are usually driven by frequent use, phonological erosion, and the natural tendency for languages to regularize irregular forms, leaving only a few highly irregular suppletive forms. Linguists study suppletion to understand language change and the balance between irregularity and the drive for regular grammatical patterns. Suppletion often arises historically from different lexical sources merging into the grammatical system of a language.

Conversion

Also known as zero derivation or functional shift, conversion is a morphological process through which a word is reassigned to a new grammatical category without the addition of an affix or any change to its form. This process allows a word from one part of speech, such as a noun, to function as another part of speech, like a verb, simply through a shift in its grammatical context. Conversion allows languages to expand their lexical and functional capacity, often creating pairs of related words that differ in grammatical category but are identical in form.

Unlike other morphological processes, conversion involves no alteration to the word's spelling or phonetics; the change in grammatical function is indicated solely by the word's use in context. Conversion can occur in multiple directions, such as noun to verb, verb to noun, adjective to noun, et cetera. It is a highly productive means of word formation in many languages, allowing for the rapid creation of new words to meet communicative demands.

  • Noun to Verb: The use of a noun in a verbal context, thereby creating a verb with a related meaning. For example, "to bottle" (from "bottle," implying the action of putting something into bottles).
  • Verb to Noun: The use of a verb as a noun, often referring to the act or instance of performing the verb. An example might be "a run" (from "to run," referring to a single act of running).
  • Adjective to Verb: The transformation of an adjective into a verb, enabling the description of the action of becoming or making something have the quality of the adjective. For instance, "to empty" (from "empty," meaning to make something empty).

Cliticization

A morphological and phonological process involving clitics, grammatical elements that possess syntactic characteristics of words but behave phonologically like affixes. Clitics are dependent on adjacent words for their pronunciation and often cannot stand alone, lacking the stress patterns typical of full words.

They require a host word to attach to, as they cannot independently bear stress; unlike affixes, which are bound to specific parts of speech, clitics can attach to various categories of words. They often encode grammatical functions such as negation, possession, articles, pronouns, or auxiliary verbs. Proclitics attach to the beginning of a host word; enclitics attach to the end of a host word; mesoclitics are inserted within a host word.

In Romance languages like Spanish and Italian, object pronouns often appear as clitics attached to verbs. In Italian, "lo" (it) can attach to "vedo" (I see) to form "lo vedo" (I see it). In English, the possessive "'s" is considered a clitic that can attach to noun phrases to indicate possession, as in "the king of Spain's crown." In Greek, the definite article can act as a proclitic, attaching to the beginning of the word it modifies. In French, "ne" in the negation "ne...pas" behaves like a clitic, attaching to auxiliary or main verbs to form negatives, as in "Je ne sais pas" (I don't know).

Distinguishing clitics from affixes can be challenging but important. Affixes are inherently bound morphemes that directly modify the meaning or grammatical category of the words they attach to and are restricted to specific lexical categories. Clitics, while also bound in pronunciation, retain a syntactic identity separate from the host and can often attach to a broader range of categories.

Clitics pose a significant challenge for linguistic analysis and language learning, as their behavior intersects morphology and syntax, sometimes leading to debates and confusion about their classification or use.

Case Systems

Case systems organize how nouns and their related words (pronouns, adjectives) connect to verbs and other sentence elements, indicating their grammatical roles (such as subject, object, et cetera). Transitive and intransitive verbs are categories that describe how verbs interact with objects within sentences. The distinction between them is foundational in understanding sentence structure and grammar across languages.

Transitive verbs require one or more objects to complete their meaning; they express actions that are done to someone or something. The object of a transitive verb receives the action of the verb. In the sentence "She reads a book," "reads" is a transitive verb because the action is being performed on an object, "a book."

Intransitive verbs are verbs that do not take an object; their meaning is complete without needing to express an action being performed on someone or something. Intransitive verbs often describe actions, states of being, or occurrences that involve only the subject. In the sentence "He sleeps," "sleeps" is an intransitive verb because there is no object receiving the action; the verb simply describes the subject's state or action.

Some verbs can be both transitive and intransitive, depending on their use in a sentence; they are known as ambitransitive verbs. For example, "to eat" can be used transitively as in "She eats an apple" or intransitively as in "She eats quickly." While transitive verbs require direct objects to complete their meaning, some intransitive verbs can be followed by other types of complements (like adverbials or prepositional phrases) that provide additional information but do not receive the action of the verb.

A subtype of transitive verbs, ditransitive verbs, require two objects (direct and indirect) to complete their meaning, as in "She gave her friend a gift," where "gave" is the ditransitive verb, "her friend" is the indirect object, and "a gift" is the direct object.

Nominative-Accusative

In these language systems, the subject of both transitive and intransitive verbs is marked by the nominative case, and the object of transitive verbs is marked by the accusative case. This alignment emphasizes the action's agent, regardless of the verb's transitivity. This system is the most common type of alignment; many Indo-European languages, such as English, Spanish, and Russian, primarily use nominative-accusative alignment, although in English, case marking is largely restricted to personal pronouns ("I" versus "me").

Ergative-Absolutive

These languages mark the subject of an intransitive verb in the same way as the object of a transitive verb - both in what's called the absolutive case - while the subject of a transitive verb is marked differently, in the ergative case. This alignment emphasizes the action's undergoer or the participant most directly affected by the action. Languages such as Basque, Georgian, and many Australian Aboriginal languages exhibit ergative-absolutive alignment.

Split Ergativity

A system where both nominative-accusative and ergative-absolutive alignments are used, under different conditions. These conditions can be based on grammatical factors such as tense, aspect, or the person of the subject, or on semantic factors like the nature of the verb or its participants. Languages with split ergativity include Georgian, where the split is based on the verb's tense or aspect, and Hindi-Urdu, which shows ergative behavior in perfective aspects of certain verbs.

Tripartite

Where the subject of an intransitive verb, the subject of a transitive verb, and the object of a transitive verb each receive different case markings. This is relatively rare among the world's languages, as it requires a clear and distinct marking for each of these roles:

  • Nominative case marks the subject of intransitive verbs.
  • Accusative case marks the object of transitive verbs.
  • Ergative case marks the subject of transitive verbs.

This system provides a clear distinction between subjects acting upon objects and subjects acting alone. An example of a language with a tripartite alignment is Nez Perce, spoken in the Pacific Northwest of the United States.

Active-Stative

Languages of this system categorize subjects based on whether they are agents (performers of an action) or patients (recipients of an action). Unlike nominative-accusative or ergative-absolutive systems that categorize based on transitivity, active-stative systems allow the verb's subject to be marked differently based on semantic roles, even in intransitive sentences.

In Split-S languages there is a consistent rule for marking subjects of intransitive verbs as either agents or patients, based on the verb or the verb's semantics. In Fluid-S languages, the marking depends on the verb's semantics or the speaker's perspective. A verb like "to run" might mark its subject as an agent in one context but as a patient in another, depending on whether the running is volitional.

Direct-Inverse

This system typifies some indigenous languages of the Americas, and marks the direction of the action in terms of the hierarchy of participants (often based on animacy or topicality). A direct marking indicates that the action follows the expected hierarchy (a higher-ranking agent acting on a lower-ranking patient), while an inverse marking shows that the action goes against the expected hierarchy.

Differential Object Marking (DOM)

DOM is a related phenomenon where the object of a verb is marked differently based on specific semantic or pragmatic factors, such as definiteness, animacy, or specificity. This can sometimes blur the lines with accusative systems but serves different functions by highlighting certain features of the direct object.

Austronesian Alignment

This system, found in many Austronesian languages, features a unique way of marking the focus of a sentence through voice or trigger. Rather than focusing on the agent or patient inherently, the language can mark which participant is the focus of a given clause, affecting verb morphology.

Semantic Role-Based

Some languages employ case marking systems that are more directly tied to the semantic roles of participants, such as experiencer, benefactive, or instrument, rather than or in addition to their syntactic roles.

Fusion between Systems

Some languages exhibit features of more than one alignment system, demonstrating the fluidity of case roles and grammatical structures across languages. Case systems can change over time within a language, contributing to the development of mixed or evolving alignments. Historical linguistics often studies these changes to understand language evolution. The functional load of case markings can vary widely, with some languages relying heavily on case to indicate grammatical relations and others using word order or auxiliary words for the same purpose.

Tense, Aspect, and Mood (TAM)

Tense

Conveys when an action happens from the speaker's perspective, relating the position in time of the action; a specific point in the past or future, or, in some languages, even extending to the speaker's perspective on the flow of time itself.

  • Past: Indicates actions or states that occurred before the moment of speaking. It can be further subdivided into categories like simple past, past continuous, past perfect, and past perfect continuous, among others, depending on how specific languages conceptualize the flow of time and action.
  • Present: Describes actions or states occurring at the moment of speaking, or general truths that are timeless. It can also encompass habitual actions or states that are not necessarily happening at the moment but occur regularly.
  • Future: Refers to actions or states that are expected to occur after the moment of speaking; future tense can have subdivisions that provide additional nuances regarding the future action's certainty, immediacy, or manner of occurrence.

Some languages extend tense beyond these basics to express a wide range of temporal nuances:

  • Remote versus Recent/Near: differentiate between actions that happened a long time ago versus those that occurred more recently; or those that will occur in the remote future versus those that will occur in the near future.
  • Imperfect versus Perfect: imperfect tense refers to actions that were ongoing or not completed in the past, while the perfect tense indicates actions that were completed in the past and have relevance to the present.
  • Aorist: in some languages, the aorist tense describes actions without specifying whether they are completed or ongoing, focusing instead on the fact that an action simply occurs.

Aspect

Focuses on the internal structure of an action (how it unfolds over time) rather than its position in time; it allows speakers to provide insights into the process, completion, repetition, or duration of a described action.

  • Perfective aspect indicates that an action is viewed as a complete whole, without focusing on its internal structure. Perfective aspect often implies that the action has been completed, especially when referring to past actions, and is used to express a single, bounded event.
  • Imperfective aspect focuses on the internal structure of an action, viewing it as ongoing, habitual, or incomplete. The imperfective can describe actions that were happening in the past, are happening in the present, or will be happening in the future, without specifying an endpoint.
    • Progressive aspect is a subtype of the imperfective; it specifically indicates actions that are ongoing at the reference time. It zooms in on the process of the action.
    • Habitual aspect is another subtype of the imperfective; it describes actions that occur regularly or habitually over a period of time.
  • Perfect aspect focuses on the outcome or result of an action rather than the action itself. It implies that the action, which occurred in the past, has relevance or consequences for the present or future.
  • Prospective aspect indicates that an action is about to happen, focusing on the period immediately before the action starts.
  • Iterative or Frequentative aspect describes actions that are repeated multiple times.
  • Stative aspect refers to states or conditions rather than actions, emphasizing the static nature of the verb's meaning.

Some languages (especially in the Slavic subfamily) have pairs of verbs that differ only in aspect, known as aspectual pairs. They consist of one verb with a perfective aspect and another with an imperfective aspect, each capturing different temporal contours of the same action.

The Russian verb "писать/нап-исать" (to write) demonstrates how aspectual pairs work:

  1. Imperfective Aspect: "писать" (pisat')
    • This form is used to indicate an ongoing action, a repeated action, or a general statement about the action without specifying its completion.
    • "Она пишет письмо" (Ona pisat' pis'mo) — "She writes a letter" or "She is writing a letter." Here, the action is not necessarily completed; it might be ongoing or habitual. This aspect captures the process or internal duration of the action, without indicating whether the action has been completed; it's about the action in progress or habitually done over time.
  2. Perfective Aspect: "написать" (napisat')
    • This form denotes the action's completion, focusing on the outcome rather than the process.
    • "Она написала письмо" (Ona napisala pis'mo) — "She has written a letter." The focus is on the fact that the letter has been completed. This aspect views the action as a finished whole, emphasizing the action's completed state and its results.

This distinction between perfective and imperfective aspects can intersect with tense to provide a nuanced temporal framework; in Russian, tense is primarily marked on the imperfective verb, while the perfective verb inherently carries a sense of completion.

Aspect can be expressed through various means depending on the language; many languages use affixes or verb conjugation to indicate aspect (morphology). In periphrastic languages, aspect can be indicated by auxiliary verbs or verb phrases, while some languages known for suppletion use entirely different verbs to express aspectual distinctions.

Periphrasis uses a combination of auxiliary verbs alongside the main verb, rather than through inflection or single aspectual verbs; it allows more than one word to convey the temporal structure, phase, or continuity of the action or state described by the verb.

The use of auxiliary (or helping) verbs in combination with the main verb is central to periphrastic aspect expression; these auxiliaries modify the main verb to indicate aspectual nuances such as ongoing action, completion, or repetition. Unlike with inflectional strategies, the main verb often remains in a base or infinitive form, and aspectual distinctions are made through the auxiliary verbs and their placement.

Because this form of aspect expression relies on syntax (the arrangement of words) rather than morphology (word structure changes), it allows for considerable flexibility in how actions and states can be temporally framed within a sentence.

  • "She is singing;" the auxiliary verb "is" combined with the present participle "singing" expresses an ongoing action.
  • "She has sung;" the auxiliary "has" combined with the past participle "sung" indicates a completed action with relevance to the present.
  • "Elle va chanter;" (she is going to sing). The verb "va" (goes) combined with the infinitive "chanter" (to sing) expresses an action that is about to happen in the near future.

Languages that rely heavily on periphrastic constructions for aspect expression tend to be more analytic; synthetic languages tend to encode aspect directly onto the verb through inflection. Periphrastic constructions can offer a wide range of nuanced temporal distinctions, reflecting subtle differences in how an action is viewed in terms of its progress, completion, or frequency. Over time, periphrastic constructions can become grammaticalized into more condensed forms, showing how languages can evolve from more analytic to more synthetic expressions, or vice versa.

Suppletion occurs when a language uses entirely different roots to express grammatical variations such as tense, number, person, or, as in this case, aspect, which results in pairs or sets of words that are related in meaning but have no resemblance in form.

In the context of aspect, suppletive forms are used to differentiate between how an action or state is viewed in terms of its temporal flow - such as whether it is ongoing, completed, or habitual. Suppletion in aspect is less common than in tense or person marking but can be found in languages that have a high degree of irregularity or in verbs that are frequently used and thus subject to historical changes and irregularities.

A compelling example of using completely different roots to signify aspectual differences comes from Irish (Gaelic); Irish, like other Celtic languages, often exhibits suppletion in its verb system. The verb "to be" in Irish has the root "bí" for the present tense, which is used in contexts that suggest habitual or ongoing actions. "Bíonn siad ag siúl" – "They walk (habitually)" or "They are walking (as a general activity)." For the past tense, indicating completed actions, Irish uses a completely different root, "bhí." "Bhí siad ag siúl" – "They walked" or "They were walking (at a specific instance in the past)."

Mood

Conveys the speaker's attitude towards the action or state described by the verb, indicating modality - the manner in which the action is regarded or related to reality. Unlike tense, which places actions or states in time, and aspect, which describes the nature of the action's flow, mood deals with the speaker's perspective on the certainty, necessity, or desirability of the event, among other nuances.

  • The indicative mood is the most common, used for stating facts or asking questions. It presents the action or state as real, actual, or certain. "She goes to school" (a statement of fact).
  • The imperative mood is used for commands, requests, or prohibitions; it directly addresses the listener and expresses a directive. "Go to school!" (a command).
  • The subjunctive mood is used to express wishes, hypothetical situations, demands, or conditions contrary to fact. It often indicates something desired, possible, or imagined. "If she were president, she would change the law" (a hypothetical situation).
  • The conditional mood indicates actions or states that are dependent on certain conditions. It often expresses hypothetical outcomes and is sometimes considered a subcategory of the subjunctive. "She would go to school if she woke up early" (a condition and its potential outcome).
  • The optative mood is found in some languages, expressing wishes or hopes. "May you have a long life!" (a wish).
  • The jussive mood is used to express commands, suggestions, or necessity, often in the first or third person. "Let us pray" (a suggestion or command).
  • The potential mood expresses possibility or capability. "She might go to school" (a possibility).
  • The interrogative mood is recognized by some linguists, used for asking questions. "Does she go to school?" (a question).

Mood is closely related to modality, which more broadly refers to the speaker's attitude regarding the likelihood, necessity, or obligation of the action or state. Modality can be expressed through mood, but also through modal verbs ("can," "must," "should"), adverbs, or other linguistic means, depending on the language.

The expression and classification of mood can vary significantly across languages. Some languages have distinct morphological markers for mood, with specific verb forms for the subjunctive, conditional, or imperative moods. In other languages, mood may be indicated more subtly through context, word order, particles, or the use of modal auxiliary verbs without distinct morphological changes to the main verb. While the moods listed above are relatively common, individual languages may have additional moods or lack some of these categories altogether.

Voice and Valency

Voice

Describes the relationship between the action expressed by the verb and the participants identified by the subject, object, and sometimes other parts of the sentence. It's a way to articulate who is doing the action and who is receiving it, highlighting different aspects of the event or state described by the verb. Voice can change the focus of a sentence without altering the factual content of the action being described.

In the active voice, the subject of the sentence performs the action of the verb. The focus is on the agent (the doer) of the action; active voice constructions are direct and are often used to emphasize the agent's role in the action.

In the passive voice, the focus shifts from the agent to the recipient (or patient) of the action. The subject of the sentence is the entity that experiences the action, while the doer of the action (if mentioned) comes later in the sentence.

In the middle voice found in some languages, the subject performs an action upon themselves or for their benefit. It blurs the distinction between the agent and patient roles. In Ancient Greek, the middle voice could be used for reflexive actions or situations where the subject acts in its own interests.

Languages may have additional voices that provide different perspectives on the action.

  • Causative voice indicates that the subject causes someone else to perform the action.
  • Reflexive voice shows that the subject performs the action upon themselves.
  • Reciprocal voice is used when subjects perform the action on each other.
  • Antipassive voice is found in ergative-absolutive languages; it decreases the valency of transitive verbs, often omitting or demoting the object, and focusing on the action and its agent.

Valency

Refers to the capacity of verbs to combine with a specific number of arguments (subjects, objects, or complements) to form a complete and meaningful sentence. It is a concept that helps in understanding how verbs function and interact with other sentence elements to convey information. The term originates from chemistry, where it describes how atoms combine based on their number of valence electrons, similarly applied in linguistics to describe how verbs "bond" with their arguments.

Intransitive verbs (monovalent) require only one argument (typically a subject) to form a complete sentence. They do not take a direct object. "The baby slept."

Transitive verbs (divalent) need two arguments to complete their meaning - a subject that performs the action and a direct object that receives the action. "She reads a book."

Ditransitive verbs (trivalent) require three arguments: a subject who performs the action, a direct object that receives the action, and an indirect object that is affected by the action. "He gave her a gift."

Processes like passivization (the transformation from active to passive voice) can reduce a verb's valency by removing the need for an explicit subject, as in the transformation from "She opens the door" to "The door is opened." Conversely, certain constructions can increase a verb's valency by adding arguments, such as causatives which make an intransitive verb take an additional causee argument: "He made her laugh." Some linguists also talk about zero-valent verbs (or weather verbs) that do not require any arguments, expressing situations rather than actions performed by agents: "It is raining."

Valency is crucial for sentence structure, influencing word order and the use of grammatical markers such as prepositions or case markings to identify the roles of different arguments. Understanding valency helps in identifying the semantic roles played by different sentence elements, like agent, patient, theme, or experiencer, enhancing the analysis of sentence meaning. Valency patterns vary significantly across languages, with some languages offering more flexibility in valency alteration than others. This variability is a key area of study in typological linguistics and comparative grammar.

Applicatives allow for the addition of an argument to a verb that wouldn't normally take it, effectively increasing the valency of the verb. This is achieved by applying an applicative morpheme to the verb, which enables the inclusion of an indirect object, beneficiary, location, instrument, or other roles directly into the verb phrase. Unlike causatives, which introduce a causee and create a causation relationship in the action, applicatives enhance the verb to involve an additional participant more directly in the action or event described.

Applicatives increase a verb's valency by adding a new argument to the core argument structure of the verb; this additional argument is often an indirect object but can also express other semantic roles. The added argument can specify a wide range of roles, including but not limited to:

  • Beneficiary: Who benefits from the action.
  • Location: Where the action takes place.
  • Instrument: What is used to perform the action.
  • Reason: Why the action is performed.

In languages with applicative constructions, the addition of an applicative morpheme to the verb signals that an extra argument is being integrated into the verb's argument structure. This morpheme can be a prefix, suffix, or infix, depending on the language's morphological typology.

In Chichewa (a Bantu language), the applicative suffix "-ir-" can be added to a verb to include an indirect object or beneficiary. For instance, from "pita" (go) to "pitira" (go for the sake of), allowing the sentence to include whom the action is performed for. In Navajo (a Na-Dené language), the use of certain verbal prefixes can add locative or instrumental arguments to the action described by the verb, altering its valency to directly incorporate these elements into the action's core meaning.

Applicatives provide a mechanism for languages to manipulate and expand syntactic possibilities, allowing speakers to incorporate additional information directly into the verb phrase and providing a flexibility to verbal valency. By incorporating additional participants directly into the verb's argument structure, applicatives can offer a more nuanced and precise way to express actions and relationships between entities.

Semantic Roles and Relations

  • Thematic Roles: Agent, Patient, Theme, Experiencer
  • Semantic Fields and Lexical Sets

Language Change and Evolution

  • Historical Linguistics and Language Change
    • Sound Changes, Analogical Changes
    • Grammaticalization and Language Contact
  • Dialectology and Sociolinguistics
    • Language Variation and Change
    • Social Factors in Language Change

Comparative Studies Across Language Families

  • Indo-European Languages
  • Afro-Asiatic Languages
  • Sino-Tibetan Languages
  • Language Isolates and Constructed Languages

Research Methods in Comparative Linguistics

  • Comparative Method and Reconstruction
  • Typological and Areal Linguistics
  • Corpus Linguistics and Computational Approaches

Current Trends and Future Directions

  • Interdisciplinary Approaches: Cognitive Linguistics
  • Endangered Languages and Language Revival
  • Universal Grammar and Language Acquisition