A Corpus-based Investigation of the Near-synonymy of ‘‘ziyâd and farâvân’’ Size Adjectives according to Behavioral Profile Approach

Document Type : مقالات علمی پژوهشی

Authors
1 Associate Professor, Department of Foreign Languages and Linguistics, Faculty of Literature and Humanities, University of Shiraz, Shiraz, Iran
2 PhD Candidate, Department of Foreign Languages and Linguistics, Faculty of Literature and Humanities, University of Shiraz, Shiraz, Iran
3 Asisstant Professor, Department of Foreign Languages and Linguistics, Faculty of Literature and Humanities, University of Shiraz, Shiraz, Iran
Abstract
Investigations show that in Persian no significant research has been done on lexical relationships of size adjectives. In this paper, near-synonymy relationship of ‘‘ziyâd and farâvân’’ size adjectives and their inflectional forms has been studied based on Behavioral Profile Analysis approach of Divjack and Gries (2006) and the data of Hamshahri 2 corpus (Alahmad and et.al, 2009). Accordingly in this study, first, by using AntConc software (2011) the data related to ‘‘ziyâd, ziyâdi, ziyâdtar, ziyâdtarin, ziyâdtari, farâvân, farâvâni, farâvântar, farâvântarin, farâvântari’’ size adjectives were extracted from the corpus and then were annotated on the basis of the morphological, syntactic and semantic features. The result of behavioral profile approach which is acquired according to hierarchical cluster analysis indicated that the highest level of near-synonymy belongs to ‘‘ziyâdi, farâvâni’’, ‘‘ziyâdtari, farâvântari’’, ‘‘ziyâdtar, farâvântar’’, ‘‘ziyâd, farâvân’’ and ‘‘ziyâdtarin, farâvântarin’’ groups and the lowest level of near-synonymy is between ‘‘ziyâdi, farâvâni, ziyâdtari, farâvântari’’ and ‘‘ziyâd, farâvân, ziyâdtarin, farâvântarin, ziyâdtar, farâvântar’’ groups. Finally, annotation analysis based on Z-scores has specified all meaningful and meaningless ID-tags of size adjectives.

1. Introduction

Persian is one of many languages throughout the world which have a class of words whose grammar is clearly different from the others, and can be identified as adjectives. Adjectives, also, can perform some semantic tasks. For example, one major task of adjective is to show the property of things. For example, Ali's house is big, this sentence could referred to Ali's house, that it has a property of being big.

Although traditional grammarians stressed on the formal features, today, linguists typically classify adjectives according to semantic features. However, one lexical field that has received a lot of attention both in general and in corpus is adjective size, presumably because it includes two pairs of canonical antonyms and many studies have already focused on size adjectives. For example, Charles and Miller (1989) showed that big and little as well as large and small tend to occur in one and the same sentence in the Brown corpus with a probability that is much larger than chance would predict and concluded (Gries and Otani, 2010). In Persian, however, no attention has been paid to the size adjectives. Therefore, this is a motivation for us, to investigate near-synonymy of ziyâd and farâvân size adjectives and their inflectional forms according to behavioral profile analysis approach.

The body of this paper, generally, structured as follow: In section 2, we examine briefly background study of adjectives, in particular, size adjectives, synonymy and the application of behavioral profile. In section 3, we investigate behavioral profile approach. Section 4 deals with methodology of research according to corpus-based data, process of annotation and random data selection by using Kerjcie & Morgan table (1970). In section 5, we analyze statistical information acquired from the process of annotation by using SPSS software. And finally in section 6, we present the results of data.



2. Methodology

Divjak and Gries (2006) identified the following four steps for the behavioral profile approach: 1. The retrieval of (a representative random sample of) all instances of the lemmas from a corpus. 2. Manual analysis and annotation of many properties of each match in the concordance of lemmas. 3. The conversion of data into a co-occurrence table. 4. The evaluation of the table by means of exploratory and other statistical techniques. In this study to do the above steps, we first used Antconc software to retrieve all matches of the lemmas ziyâd and farâvân plus their inflectional forms from 150 million-word corpus of Hamshahri 2 AleAhmad et al (2009). Then we annotated morphological, syntactic and semantic features to lexical forms in order to identify distributional features. And finally, we utilized SPSS software to evaluate the data and acquire Hierarchical agglomerative cluster charts.



3. Discussion and Results

No considerable study has been done on lexical relationships of size adjectives in Persian and only some grammarians and linguists such as Anvari and Givi (2008), Assi (2006) and etc simply have categorized size adjectives. Consequently, it is defficult to compare the achievements of this study with previous studies on near-synonymy.

In this article, generally, determining near-synonymy is performed according to the observation and analyzing similarities and differences of distributional behaviors of size adjectives and therefore is different from previous researches such as Abdolkarimi (2014, 2016) and etc. Our findings ,also, support Gries and Otani (2010) assumption on theoretical implication of behavioral profile analysis and most corpus-based works that believe formal differences reflect functional differences. For example, formal tag levels such as active, passive, plural, singular and etc have significant role in near-synonymy of “ziyâd and farâvân’’ inflectional forms. The findings of this study, moreover, clear that size adjectives (base form) or along with “tar” inflectional suffix are always dependent of a noun or a causative or predicative verb but size adjectives plus “tarin” superlative suffix can only be dependent of a noun not a verb, because no occurrence is observed for “tarin” as dependent of verbs.



4. Conclusion

In this study, we explored the semantic field of size by analyzing the distributional behavior of lexical forms "ziyâd, ziyâdi, ziyâdtar, ziyâdtarin, ziyâdtari, farâvân, farâvâni, farâvântar, farâvântarin, farâvântari" according to behavioral profile analysis and also three variable with 61 ID tags level were annotated to find near-synonymy relationship between size adjectives inflectional forms. The analysis of hierarchical agglomerative cluster charts all ID tags signified that mentioned lexical forms in terms of tag level made sub-groups which have near-synonymy between their own members sub-group. These sub-groups include "ziyâdi, farâvâni", "ziyâdtari, farâvântari", "ziyâdtar, farâvântar", "ziyâdi, farâvân", "ziyâdtarin, farâvântarin", "ziyâdi, farâvâni, ziyâdtari, farâvântari", "ziyâd, farâvân, ziyâdtarin, farâvântarin" and "ziyâdtar, farâvântar, ziyâd, farâvân, ziyâdtarin, farâvântarin".


Keywords

Subjects


Abdolkarimi, S. (2014). ‘‘A review on propositional, descriptive and partial synonymy between Persian simple verbs and their corresponding compound verbs’’. Language Science. No. 2. Pp: 143-158.
------------------. (2016). ‘‘Study of different kinds of synonymy between Persian non-compound verbs and their formal–semantic equivalent compound verbs’’. Language Related Research. No. 6. Pp: 201-207. [In Persian].
AleAhmad, A., H. Amiri, E. Darrudi, M. Rahgozar and F. Oroumchian. (2009). ‘‘Hamshahri: A standard Persian text collection’’. Knowledge Based Systems. 22(5). Pp: 382-387.
Anthony, L. (2011). AntConc (Version 3.2.2.1) [Computer Software]. Waseda University, Tokyo, Japan. Retrieved from. http://www.antlab.sci.waseda.ac.jp/antconc_index.html.
Anvari, H. & AhmadiGhivi, H. (2008). Persian Grammar 2. 3rd Edition. Tehran: Fatemi Publications. [In Persian].
Assi, M. Gholifamian, A. R. & Aghajani, D. (2006). ‘‘Towards building a word-net for Persian adjectives’’. Language and Linguistics Journal. No. 4. Pp: 125-136. [In Persian].
Atkins, B. T. S. (1987). ‘‘Semantic ID-tags: Corpus evidence for dictionary senses’’. In Proceedings of the Third Annual Conference of the UW Centre for the New Oxford English Dictionary. Pp: 17–36.
Biber, D, S. Conrad & R. Reppen. (1998). Corpus linguistics: Investigating language structure and use. Cambridge: Cambridge University Press.
Brown, K. and Miller, J. (2013). The Cambridge dictionary of linguistics. New York: Cambridge University Press.
Charles, W. G. & Miller, G. A. (1989). ‘‘Contexts of antonymous adjectives’’. Applied Psycholinguistics. No. 3. Pp: 357–375.
Croft, W. (2012). Verbs: aspect and causal structure. Oxford university press.
Deese, J. (1964). ‘‘The associative structure of some common English adjectives’’. Journal of Verbal Learning and Verbal Behavior .No. 5. Pp: 347-357.
Divjak, D. (2010). Structuring the lexicon: A clustered model for near-synonymy. Berlin/New York: Walter de Gruyter GmbH & Co. KG.
Divjak, D. & Gries, Th. (2006). ‘‘Ways of Trying in Russian: Clustering Behavioral Profiles’’. Corpus Linguistics and Linguistic Theory. No 2 (1).Pp: 23-60.
Dixon, R. W. (2004). Adjective Classes in Typological Perspective. In Dixon, R.M.W. & Aikhenvald, A (eds.) Adjective Classes: A Cross linguistic Typology.
Edo Marza, N. E. (2011). A comprehensive corpus-based study of the use of evaluative adjectives in promotional hotel websites. Odisea. No: 12, ISSN 1578-3820. Pp: 97-123.
Golshaie, R. (2016). ‘‘A corpus study on identification and semantic classification of light verb constructions in Persian: the case of the light verb xordan 'to eat/ collide'’’. Language Sciences. Volume 57. Pp: 21-33.
Gries, Th. S. (2010). Behavioral profiles: a fine-grained and quantitative approach in corpus-based lexical semantics. The Mental Lexicon. 5. Pp: 323-346.
Gries, Th. S. & Divjak, D. (2009). ‘‘Behavioral profiles: a corpus-based approach to cognitive semantic analysis’’. In New directions in Cognitive Linguistics. Vyvyan Evans and Stephanie Pourcel (eds). Pp: 57–75.
Gries, Th. S. & Otani, N. (2010). ‘‘Behavioral profiles: A corpus-based perspective on synonymy and antonymy’’. ICAME Journal. No 34. Pp: 121- 150.
Hanks, P. (1996). ‘‘Contextual Dependency and Lexical Sets’’. International Journal of Corpus Linguistics 1 (1): Pp: 75–98.
Jones, S., Paradis, C., Murphy, M. L., & Willners, C. (2007). Googling for opposites: a web-based study of antonym canonicity. Corpora. 2. Pp: 129-154.
Justeson J. S. & Kats, S. M. (1991). Co-occurrences of antonymous adjectives and their contexts. Computational Linguistics 1 (1). Pp: 1–19.
KaviyaniFardzade, E. (2017). An investigation of the polysemy of four negative non-verbal prefixes in Persian language according to Principled Polysemy Framework and Behavioral Profiles Approach. M.A. Thesis. Shiraz University. Shiraz. [In Persian].
Krejcie, R. V. & Morgan, D.W. (1970). ‘‘Determining sample size for research activities’’. Educational and Psychological measurement. No. 30. Pp: 607-610.
Kuznetsova, J. (2015). Linguistics profiles. Berlin/Boston: Walter de Gruyter. GmbH.
Lakoff, G, & Turner, M. (1989). More than Cool Reason: A Field Guide to Poetic Metaphor. Chicago: University of Chicago Press.
Lakoff, G. (1987). Women, Fire and Dangerous Things: What Categories Reveal about the Mind. Chicago: University of Chicago Press.
-----------. (1993). ‘‘the Contemporary Theory of Metaphor. In A. Ortony (Ed)’’. Metaphor and Thought. Cambridge: Cambridge University Press. Pp: 202-251.

Lakoff, G., & Johnson, M. (1980). Metaphors We Live by. Chicago and London: The University of Chicago Press.
Lee, S. (1994), Untersuchungen zur Valenz des Adjektivs in der deutschen Gegenwartssprache. Berlin: Lang.
Mahootian, Sh. (1999). The Persian grammar from typological perspective. Translated by: Mehdi Samaei. Tehran: Nashr-e Markaz. [In Persian].
McEnery, T & Hardie, A. (2012). Corpus Linguistics: Theory, Method and Practice. Cambridge: Cambridge University Press.
Moser, Ch. (2014).Synonymy, plesionymy and sameness of meaning: A corpus-based behavioral profiles analysis of adjectives of size. Master Thesis. University of Wien.