• ثبت نام
    • ورود به سامانه
    مشاهده مورد 
    •   صفحهٔ اصلی
    • نشریات انگلیسی
    • Journal of AI and Data Mining
    • Volume 7, Issue 1
    • مشاهده مورد
    •   صفحهٔ اصلی
    • نشریات انگلیسی
    • Journal of AI and Data Mining
    • Volume 7, Issue 1
    • مشاهده مورد
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Automatic Construction of Persian ICT WordNet using Princeton WordNet

    (ندگان)پدیدآور
    Ahmadi Tameh, A.Nassiri, M.Mansoorizadeh, M.
    Thumbnail
    دریافت مدرک مشاهده
    FullText
    اندازه فایل: 
    1.135 مگابایت
    نوع فايل (MIME): 
    PDF
    نوع مدرک
    Text
    Research/Original/Regular Article
    زبان مدرک
    English
    نمایش کامل رکورد
    چکیده
    WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose several automatic methods to extract Information and Communication Technology (ICT)-related data from Princeton WordNet. We, then, add these extracted data to our Persian WordNet. The advantage of automated methods is reducing the interference of human factors and accelerating the development of our bilingual ICT WordNet. In our first proposed method, based on a small subset of ICT words, we use the definition of each synset to decide whether that synset is ICT. The second mechanism is to extract synsets which are in a semantic relation with ICT synsets. We also use two similarity criteria, namely LCS and S3M, to measure the similarity between a synset definition in WordNet and definition of any word in Microsoft dictionary. Our last method is to verify the coordinate of ICT synsets. Results show that our proposed mechanisms are able to extract ICT data from Princeton WordNet at a good level of accuracy.
    کلید واژگان
    WordNet
    semantic relation
    synset
    part of speech
    Information and Communication Technology
    Document and Text Processing

    شماره نشریه
    1
    تاریخ نشر
    2019-01-01
    1397-10-11
    ناشر
    Shahrood University of Technology
    سازمان پدید آورنده
    Computer Department, Faculty of Engineering, Bu-Ali Sina University, Hamedan, Iran.
    Computer Department, Faculty of Engineering, Bu-Ali Sina University, Hamedan, Iran.
    Computer Department, Faculty of Engineering, Bu-Ali Sina University, Hamedan, Iran.

    شاپا
    2322-5211
    2322-4444
    URI
    https://dx.doi.org/10.22044/jadm.2018.4966.1601
    http://jad.shahroodut.ac.ir/article_1182.html
    https://iranjournals.nlai.ir/handle/123456789/294784

    مرور

    همه جای سامانهپایگاه‌ها و مجموعه‌ها بر اساس تاریخ انتشارپدیدآورانعناوینموضوع‌‌هااین مجموعه بر اساس تاریخ انتشارپدیدآورانعناوینموضوع‌‌ها

    حساب من

    ورود به سامانهثبت نام

    آمار

    مشاهده آمار استفاده

    تازه ترین ها

    تازه ترین مدارک
    © کليه حقوق اين سامانه برای سازمان اسناد و کتابخانه ملی ایران محفوظ است
    تماس با ما | ارسال بازخورد
    قدرت یافته توسطسیناوب