Thematic and Grammatical Hierarchies in Persian Language

Document Type : مقالات علمی پژوهشی

Author
Assistant Professor of Linguistics, Allameh Tabataba`I university, Tehran, Iran
Abstract
There is no one-to-one correspondence between semantic roles and grammatical relations. Based on the concept of implicational hierarchy, some linguists have attempted to identify the relationship between grammatical relations and thematic roles. In this descriptive research, based on the Persian Proposition bank, Semantic Role Corpus in the Persian Language, and Persian Syntactic Dependency Treebank, the semantic hierarchy for different grammatical relations like subject, direct object, indirect object, subject complement and object complement have been explored in the one hand and also the grammatical hierarchy for different thematic roles like agent, experiencer, cause, theme, stimulus, attribute have been considered on the other hand. The syntactic and semantic corpora above consist of approximately 30,000 sentences and about half a million tokens. Observations showed that in the Persian language, among different kinds of semantic roles, the first candidate to be the subject is the agent followed by the theme, experiencer and cause. For the object position, the theme outranks the result which in turn outranks the topic and the recipient. In the direct object position, the most frequent role is the location followed by theme, recipient and destination; and finally, in the subject and object complement position, the attribute is the most frequent role among the other thematic roles in the sentences. On the grammatical hierarchy for thematic roles, subject outranks the object, as the agent and the experiencer; subject outranks the adjunct as the cause; triple preposition outranks the subject as the stimulus; object outranks the subject as the theme, and finally, the subject complement outranks the subject complement as the attribute. Apart from this dominant tendency, this research showed that there are other remarkable correlations between syntactic and semantic arguments; for example, the agent in addition to subject position can be placed in the direct object, indirect object, and even in the adjunct position. The appearance of experiencer in object position and stimulus in subject position is the other remarkable result.



1. Introduction

Grammatical relations do not necessarily correspond to the specific semantic roles. For example, although the subject usually tends to be agent, data-driven studies show that the other thematic roles like theme, experiencer or cause tend to be in the subject position too. In this study, we sought to answer these two questions; what the relationship between the hierarchy of semantic roles and grammatical relations is? And what the thematic and grammatical hierarchies in the Persian language are? This study hypothesizes that Persian native speakers usually place agent, patient, recipient, and attribute respectively into the subject, object, prepositional object, object complement, and subject complement position.



2. Literature Review

Based on the concept of grammatical hierarchy, Fillmore (1968), Jackandoff (1972), Comrie (1981), Givon (1984), Kroft (1990), Dowty 1991, Saeed (2003), and some other linguists have shown the relationship between semantic roles and grammatical relations in different ways. Fillmore (1968) states that not all semantic roles appear equally in the subject position. It seems that there is a tendency for the actor to be the subject than the instrument. Among the instrument and the receptive, the instrument more likely appears in the subject position. Saeed (2003: 155) constructs the subject hierarchy as below:

AGENT>RECIPIENT/BENEFACTIVE>THEME/PATIENT>INSTRUMENT>LOCATION

In the Persian language, some studies have separately concentrated on semantic roles, grammatical relations, or grammatical hierarchies. For example, Farrokhi Rad (2007), Mirzaei & Moloodi (2014), and Alizadeh & Rezghi (2015) have studied thematic roles in the Persian language. Bahrami (2014) and Bahrami (2017) have explored the animacy hierarchy and definiteness hierarchy. Gholipour (2018) has studied some thematic roles in identical syntactic valency structures of Persian verbs.

3. Methodology

To achieve the hierarchy of grammatical relations and semantic roles in Persian language and to examine the hypotheses of this research, we used the Persian proposition Bank (Mirzaei & Moloodi: 2016), Semantic Role Corpus in Persian Language (Mirzaei and Moloudi, 2015), and Persian syntactic dependency treebank (Rasooli et al., 2013). These corpora, which have the same content, consist of approximately 30,000 sentences. The two first were annotated whit the predicate-argument information and the third with syntactic annotations in addition to morpho-syntactic features.

In the present study, in order to investigate the correspondences between the semantic roles of the verbs and grammatical relations, at first, the semantic correlations of grammatical relations including subject, object, the prepositional object, subject complement, and object complement have been considered. Then the syntactic correlations of semantic roles including agent, experiencer, cause, theme, stimulus, and attribute have been explored.



4. Results

As shown in Table 1, findings indicated that among different semantic roles, the first candidate that tends to be the subject is the agent followed by the theme, experiencer and cause. For the object position, the theme outranks the result which in turn outranks the topic and the recipient. In the direct object position, the most frequent role is the location followed by theme, recipient and destination; and finally, in the subject and object complement position, the attribute is the most frequent role among the other thematic roles in the sentences.



Table 1

grammatical relations and equivalent semantic roles




first thematic role & frequency
second thematic role & frequency
third thematic role & frequency
forth thematic role & frequency




Subject
agent

14440
theme

9985
experiencer

3145
cause

1409


Object
theme

1343
result

1093
topic

564
recipient

372


Prepositional object
location

1680
theme

1463
recipient

1084
destination

644


Subject complement
attribute

3875
the other roles

< 100




Object complement
attribute

692
the other roles

< 100







On the grammatical hierarchy for thematic roles, as shown in Table 2, the subject, object, adjunct, and prepositional object respectively tend to be the agent. This hierarchy is different for the cause, in that subject tends to be in the first position and then adjunct, triple preposition, and subject complement. For the experiencer, stimulus, theme and attribute, the grammatical hierarchy respectively is as below:

experiencer: subject> object> triple preposition> subject complement

stimulus: triple preposition> subject> complement clause> object

theme: object > subject> triple preposition> complement clause

attribute: subject complement> object complement> subject> complement clause



Table 2

semantic roles and equivalent grammatical relations




agent
subject

14440
object

214
adjunct

221
prepositional object

62


cause
subject

1409
adjunct

99
* triple preposition

44
subject complement

27


experiencer


subject

3145
object

348
triple preposition

92
subject complement

29


stimulus
triple preposition

615
subject

218
complement clause

110
Object

63


theme
object

13430
subject

9985
triple preposition

6006
complement clause

3205


attribute
subject complement

3875
object complement

692
subject

152
complement clause

74



* object complement, prepositional complement of noun and prepositional complement of adjective are introduced as the triple preposition.



5. Conclusion

According to Dowty 1991, the best theory for describing the argument selection will be one that involves just two roles named Proto-Agent and Proto-Patient. Proto-Agent includes agent, causer, experiencer as the roles that they have volitional involvement in the event or state, causes an event or change of state in another participant, moves relative to the position of another participant, internally causes the event or state.

The concept of Proto-Patient also includes that role which undergoes a change of state, is causally affected by another participant, is stationary relative to the movement of another participant, or is subject of an external cause

This study showed that the subject as a relational meaning in the Persian language is equal to the concept of Proto-Agent as a thematic role. The object also is nearly equal to the concept of Proto-Patient as a thematic role.

Additionally in this study, the semantic hierarchy for different grammatical relations like subject, direct object, indirect object, subject complement, and object complement have been explored on the one hand and also the syntactic hierarchy for different thematic roles like agent, experiencer, cause, theme, stimulus, attribute have been considered on the other hand.

Keywords

Subjects


بهرامی، فاطمه (1392) معرفگی و نمایه‌سازی مفعول در زبان فارسی، پژوهش‌های زبانی، د 4، ش2، صص 1-20.
-------------- (1392) نمایه‌سازی مفعول صریح و جانداری، مجموعه مقالات نهمین همایش زبان‌شناسی ایران، تهران: دانشگاه علامه طباطبائی.
بهرامی، کاوه (1396). بررسی سلسله‌مراتب دسترسی نقش‌های نحوی در زبانهای آلمانی و فارسی، دوماهنامه جستارهای زبانی، د 8، (پیاپی 38، )، صص 42-23.
طبیب‌زاده، امید. (1391). دستور زبان فارسی بر اساس نظریۀ گروه‌های خودگردان در دستور وابستگی. تهران: مرکز.
طبیب‌زاده، امید. (1395). تناوب‌های گذرایی در فارسی؛ پژوهشی بر اساس آراء بت لوین. جستارهای زبانی. د7. ش2 (پیاپی 30).
علیزاده علی، و فرزانه رزقی (1394). جایگاه سازه‌های موضوعی زبان فارسی در ساخت لایه‌ای بند دستور نقش و ارجاع، مجموعه مقاله‌های دهمین همایش بین‌المللی ترویج زبان و ادب فارسی، صص 409-429.
فرخی‌راد، الهام (1386). بررسی پیوستار نقش‌های معنایی در زبان فارسی، پایان‌نامۀ کارشناسی ارشد، دانشگاه الزهرا.
قلی‌پورکشمرزی، حمیدرضا (1397). بررسی نقش‌های معنایی در ساخت‌های ظرفیتی نحوی مشابه در افعال فارسی بررسی دو ساخت «فاعل، مفعول حرف‌اضافه‌ای از» و «فاعل، مفعول حرف‌اضافه‌ای به»، پایان‌نامۀ کارشناسی ارشد، دانشگاه علامه طباطبائی.
مشکوه‌الدینی. مهدی. (1379). دستور زبان فارسی بر پایۀ نظریه گشتاری. ویرایش دوم. مشهد: دانشگاه فردوسی مشهد.
میرزائی، آزاده، امیرسعید مولودی. (1394). نخستین پیکرۀ نقش‌های معنایی زبان فارسی، علم زبان. دوره 2، شماره 3، صفحه 29-48.
ناتل خانلری. پرویز. (1392). دستور زبان فارسی. ویرایش 3. تهران: توس.
Comrie, Bernard (1989). Language universals and linguistic typology. Oxford: Blackwell.
Croft, William (1990). Typology and universals. Cambridge: Cambridge University Press.
Dowty, David R. (1991). Thematic proto-roles and argument selection. Language. 67(3), pp. 547-619.
Fillmore, Charles. J. (1968). The case for case. In E. Bach, & R. T. Harms (Eds.), Universals in linguistic theory (pp. 1-88). New York, NY: Holt, Rinehart, and Winston.
Fillmore, Charles. J. (1976). Frame semantics and the nature of language. Annals of the New York Academy of Sciences. Vol. 280. pp. 20-32.
Givón. Talmy (1984).Syntax: A functional-typological Introduction. Vol. 1. Amsterdam: Benjamins.
Jackendoff, Ray (1972). Semantic Interpretation in Generative Grammar. Cambridge: MIT Press.
Kipper, Karin, Anna Korhonen, Neville Ryant and Martha Palmer (2006) Extending VerbNet with novel verb classes. Proceedings of LREC. Vol. 2006. pp. 1027-1032.
Kipper, Karin, Anna Korhonen, Neville Ryant and Martha Palmer. (2008) A large-scale classification of English verbs. Language Resources and Evaluation 42, no. 1: 21-40.
Mirzaei, Azadeh & Amirsaeid Moloodi (2016). Persian proposition Bank. In Proceedings of the 10th International Language Resources and Evaluation. Portorož (Slovenia). May. pp. 3828-3835.
Palmer, Marta, Daniel Gildea and Paul Kingsbury (2005). The proposition bank: An annotated corpus of semantic roles. Computational linguistics. 31(1). pp. 71-106.
Rasooli, Mohammad Sadegh, Manouchehr Kouhestani, and Amirsaeid Moloodi (2013). Development of a Persian syntactic dependency treebank. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Atlanta, USA, pp. pp. 306-314.
Saeed, John I. (2003). Semantics. Blackwell publishing.
VerbNet Guidelines, https://verbs.colorado.edu/verb-index/VerbNet_Guidelines.pdf