Building a Text Collection for Urdu Information Retrieval

Rasheed, Imran; Banka, Haider; Khan, Hamaid M.

doi:10.4218/etrij.2019-0458

Building a Text Collection for Urdu Information Retrieval

Dosyalar

Rasheed.pdf (8.94 MB)

Tarih

2021

Yazarlar

Rasheed, Imran

Banka, Haider

Khan, Hamaid M.

Yayıncı

Wiley Online Library

Erişim Hakkı

info:eu-repo/semantics/openAccess

Özet

Urdu is a widely spoken language in the Indian subcontinent with over 300 million speakers worldwide. However, linguistic advancements in Urdu are rare compared to those in other European and Asian languages. Therefore, by following Text Retrieval Conference standards, we attempted to construct an extensive text collection of 85 304 documents from diverse categories covering over 52 topics with relevance judgment sets at 100 pool depth. We also present several applications to demonstrate the effectiveness of our collection. Although this collection is primarily intended for text retrieval, it can also be used for named entity recognition, text summarization, and other linguistic applications with suitable modifications. Ours is the most extensive existing collection for the Urdu language, and it will be freely available for future research and academic education.

Anahtar Kelimeler

Assessors Agreement, Relevance Judgment, Text Collection Construction and Evaluation, Urdu Corpus, Urdu Information Retrieval

Kaynak

Etri Journal

WoS Q Değeri

Q4

Scopus Q Değeri

Q2

Cilt

43

Sayı

5

Künye

RASHEED, Imran, Haider BANKA & Hamaid M. KHAN. "Building a Text Collection for Urdu Information Retrieval". Etri Journal, 43.5 (2021): 856-868.

Bağlantı

https://onlinelibrary.wiley.com/doi/full/10.4218/etrij.2019-0458
https://hdl.handle.net/11352/3793

Koleksiyon

Alüminyum Test Eğitim ve Araştırma Merkezi (ALUTEAM)
Scopus İndeksli Yayınlar Koleksiyonu
WoS İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

Building a Text Collection for Urdu Information Retrieval

Dosyalar

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon

Onay

İnceleme

Ekleyen

Referans Veren