Assistant Professor of Linguistics, University of Isfahan, Isfahan, Iran
Abstract
This study investigated the mapping between Farsnet and Princeton English WordNet (PWN) and tried to answer these questions: Are the unmapped synsets in FarsNet with Princeton WordNet Persian specific lexical items? How can these cases be classified? Is it possible to find equivalents for some of these cases in PWN? In other words, were all of the equivalents of these lexical elements not lexicalized in English or there is another reason for that? For this purpose, a sample including 2000 synsets of FarsNet was selected and unmapped synsets were analyzed. Out of the sample, near 15 percent of the synsets had no mapping. The Categorization and analysis of the selected synsets showed that less than two third of these synsets were Persian specific words and more than one third were synsets that were unmapped due to other reasons and therefore those are not specific lexical items in Persian. By enriching and completing the information in PWN and then FarsNet a lot of these cases can be mapped with PWN synsets.
حسابی، اکبر (1388). طراحی و ایجاد شبکه واژگانی اسامی زبان فارسی. رسالۀ دکتری زبانشناسی همگانی. تهران: دانشگاه علامه طباطبایی.
روحیزاده، مسعود (1386). طبقهبندی افعال فارسی برای کاربرد در شبکه واژگانی زبانفارسی. پایاننامه کارشناسی ارشد زبانشناسی همگانی. تهران: دانشگاه علامه طباطبایی.
فامیان، علیرضا قلی (1386). بررسی و تحلیل روابط معنایی صفت برای طراحی شبکه واژگانی صفات زبان فارسی. رساله دکتری زبانشناسی همگانی. تهران: دانشگاه تربیت مدرس.
Bakken, K. (2006). “Lexicalization”. In Encyclopedia of Language and Linguistics. (pp.106-108). Second Edition.
Bentivogli, L. & E. Pianta )2000 a). “Looking for lexical gaps”. In Ulrich Heid, Stefan Evert, Egbert Lehmann & Christian Rohrer (Eds.). Proceedings of Euralexـ2000 International Congress. (pp. 663-669). Stuttgart: Institut für Maschinelle Sprachverabeitung.
Bentivogli, L.; E. Pianta & F. Pianesi (2000 b). “Coping with lexical gaps when building aligned multilingual wordnets”. In Proceedings of LREC 2000. (pp. 12-20). Athens. Greece.
Cowie, A. P. (1981). “The treatment of collocations and idioms in learner’s dictionaries”. Applied Linguistics. 2(3). pp. 223ـ235.
Hutchins, W. J. & H. L. Somers (1992). An Introduction to Machine Translation. London: Academic Press.
Keyvan, F.; H. Borjian; M. Kasheff & C. Fellbaum (2006). “Developing PersiaNet: The Persian WordNet”. In Proceedings of the 3rd Global WordNet Conference. (pp. 315ـ318). South Korea.
Lyons, J. (1977). Semantics 1. Cambridge: Cambridge University Press.
Miller, G. (1998). “Forward”. In Christiane Fellbaum (Ed.). WordNet: an Electronic Lexical Database. (pp. xvـxxii). MIT Press.
Shamsfard, M. (2008 a). “Developing Farsnet: A lexical ontology for Persian”. In A. Tanács, D. Csendes, V. Vincze, Ch. Fellbaum & P. Vossen (Eds.). Proceedings of the Fourth Global WordNet Conference. (pp. 413ـ418). Szeged, Hungary.
Shamsfard, M. (2008 b). “Towards Semi automatic construction of a lexical ontology for Persian”. LREC 2008 Proceedings. Marocco.
Shamsfard, M. et al. (2010). “Semi Automatic Development of FarsNet; the Persian WordNet”. In Proceedings of the Fifth Global WordNet Conference. (pp. 413ـ418). Mumbai. India.
Talmy, L. (1985). “Lexicalization patterns: Semantic structure in lexical forms”. In T. Shopen (Ed.). Language Typology and Syntactic Description 3: Grammatical Categories and the Lexicon. (pp. 57–149). Cambridge, UK: Cambridge University Press.
Vincze, V. & A. Almási (2014). “Non-lexicalized concepts in wordnets: A case study of English and Hungarian”. In Proceedings of 7th International Conference of the GlobalWordNet Association. (pp. 118-126). Tartu. Estonia.
Vossen, P. (2002). “Euro WordNet General Document”. In EuroWordNet Project LE2ـ4003 & LE4ـ8328 Report. University of Amsterdam.
Vossen, P. & L. Bloksma (1998). “Categories and classifications in EuroWordNet”. In Proceeding of LREC 1998. (pp. 15-23). Granada.
hesabi,A. (2015). Specific Words of Persian? Lexical Gaps in Mapping Synsets of FarsNet and Princeton WordNet. Language Related Research, 6(6), 75-103.
MLA
hesabi,A. . "Specific Words of Persian? Lexical Gaps in Mapping Synsets of FarsNet and Princeton WordNet", Language Related Research, 6, 6, 2015, 75-103.
HARVARD
hesabi,A. (2015). 'Specific Words of Persian? Lexical Gaps in Mapping Synsets of FarsNet and Princeton WordNet', Language Related Research, 6(6), pp. 75-103.
CHICAGO
A. hesabi, "Specific Words of Persian? Lexical Gaps in Mapping Synsets of FarsNet and Princeton WordNet," Language Related Research, 6 6 (2015): 75-103,
VANCOUVER
hesabi,A. Specific Words of Persian? Lexical Gaps in Mapping Synsets of FarsNet and Princeton WordNet. Language Related Research, 2015; 6(6): 75-103.