A Phase-Based and Feature-Driven Analysis of English, Urdu, and Saraiki in the Minimalist Program
This blog post develops a unified syntactic-cognitive model of English, Urdu, and Saraiki within the Minimalist Program, arguing that apparent typological variation reflects parameterization of a single computational system rather than independent grammatical architectures. The analysis proposes that all three languages instantiate a shared derivational engine composed of Merge, feature valuation (Agree), and phase-based computation (vP, TP, CP). Differences in word order, agreement alignment, and discourse structuring are derived from variation in feature strength, phase visibility, and interface dominance rather than from distinct rule systems.
The post further argues that South Asian languages (Urdu and Saraiki) provide critical empirical evidence for the visibility of discourse-sensitive syntax, ergative alignment as phase-conditioned agreement restructuring, and scrambling as feature-driven Internal Merge. English is used as a comparative baseline for structurally rigid TP-driven linearization. The post contributes to syntactic theory by demonstrating that cross-linguistic variation is reducible to constraints on computational economy and interface mapping.
1. Introduction: Rethinking Language as Computational Architecture
Traditional linguistics often treats languages as discrete systems. This monograph rejects that assumption.
Instead, it adopts the Strong Uniformity Hypothesis:
All human languages are manifestations of a single cognitive computational system.
This system, often termed the Faculty of Language (FLN), is minimally composed of:
Within this framework, English, Urdu, and Saraiki are not separate grammars but distinct parameterizations of a unified derivational engine.
The central question is therefore not what languages differ in, but:
How does a single computational system generate divergent surface configurations under different parameter settings?
2. Theoretical Framework
2.1 The Minimalist Architecture
The model assumes three core operations:
Merge: syntactic combinationAgree: feature valuation under c-command
Move (Internal Merge): reapplication of Merge to satisfy feature constraints
These operate over cyclic domains called phases:
transfers completed structure to interface systems (SEM/PF)
restricts accessibility (Phase Impenetrability Condition)
2.2 Functional Hierarchy
The clause is structured as a nested system of functional projections:
Each layer corresponds to a distinct cognitive domain:
VP: event structurevP: argument structure and agency
TP: temporal anchoring and agreement
CP: discourse force and clause typing
3. Methodology
This study adopts a theory-driven comparative syntactic method, combining:
Structural diagnostics
word order variation
agreement patterns
case alignment (ergative vs nominative systems)
Cross-linguistic triangulation
English (rigid SVO, weak morphology)
Urdu (mixed alignment, rich agreement)
Saraiki (discourse-sensitive scrambling, ergative marking in perfectives)
Minimalist derivational modeling
phase-based computation
feature valuation under locality constraints
No corpus frequency assumptions are required; analysis is based on structural possibility space, not usage probability.
4. The vP Domain: Event Structure and Agency Encoding
4.1 The vP Shell Hypothesis
Following modern generative theory, the verb phrase is decomposed:
V introduces the lexical event
v introduces external argument (Agent)
This explains why subjects behave differently from objects across languages.
4.2 South Asian Evidence: Ergative Alignment
Urdu and Saraiki exhibit split ergativity:
Perfective: object agreementImperfective: subject agreement
This is not morphological irregularity but:
a phase-conditioned redistribution of Agree relations under aspectual v.
5. The TP Domain: Temporal Anchoring and Agreement
5.1 T as the Categorical Anchor of Finite Clauses
TP introduces:
tense interpretationagreement valuation
clause finiteness
English encodes tense morphologically.
Urdu distributes tense across the following:
auxiliariesaspectual markers
verb inflection
Saraiki relies heavily on:
contextual temporal anchoring
aspect-driven inference
5.2 The EPP Requirement
TP enforces a structural constraint:
Every TP must have a subject in Spec-TP.
This explains obligatory subject presence in English and structured subject positioning in Urdu/Saraiki despite scrambling.
6. CP Domain: Discourse, Force, and Clause Typing
6.1 Complementizer System
CP encodes clause type:
C-head features include:
[±question][±declarative]
[±embedded]
6.2 South Asian CP Sensitivity
Urdu and Saraiki show:
flexible word order driven by discourse prominenceovert marking of clause embedding
strong topic-focus interaction
This suggests CP is not peripheral; it is dominant in discourse-sensitive languages.
7. Word Order as Derived Linearization
Word order is not base-generated.
It is the result of post-syntactic linearization of hierarchical structure.
English:
rigid SVO due to early linearization constraints
Urdu/Saraiki:
late linearization allows scrambling under discourse features
Thus:
Variation in word order reflects differences in linearization timing, not structural architecture.
8. Agreement as Feature Computation
Agreement is modeled as:
Cross-linguistic variation arises from:
probe strengthaccessibility of goals
case-driven visibility conditions
Urdu and Saraiki show agreement sensitivity to:
ergativityaspect
argument prominence
9. Movement as Internal Merge and Discourse Optimization
Movement is reinterpreted as:
Internal Merge driven by feature satisfaction.
In South Asian languages:
scrambling reflects discourse-driven Internal Mergewh-movement reflects CP feature attraction
subject raising reflects EPP satisfaction
10. Comparative Synthesis: Three Languages, One System
Despite superficial divergence, all three languages instantiate:
identical hierarchical structureidentical computational primitives
identical phase system
Differences emerge from:
strength of functional featurestiming of linearization
interface dominance (syntax vs discourse)
11. Theoretical Implications
11.1 Rejection of Language-Specific Grammar Hypotheses
The data supports a strong universalist position:
There is no Urdu grammar, no Saraiki grammar, and no English grammar, only parameterized realizations of one grammar.
11.2 Syntax as Cognitive Architecture
Language is not communicative code alone.
It is:
a hierarchical computation systemconstrained by memory and locality
optimized for interface interpretability
11.3 South Asian Languages as Theoretical Diagnostics
Urdu and Saraiki are not peripheral cases.
They are:
diagnostic systems for ergativityevidence for discourse-driven CP activation
empirical support for late linearization models
12. Conclusion
This blog post has argued that:
Human language is a single computational system (FLN)Variation across English, Urdu, and Saraiki is parameter-driven
Syntax is phase-based and feature-driven
Word order is a surface reflex of hierarchical computation
The deeper implication is foundational:
Languages do not differ in what they are.
They differ in how a single cognitive architecture is externally realized.
Bibliography
Chomsky, N. (2000). Derivation by Phase

