Hamshahri Corpus
id:
hamshahri-corpus-277-5907631
title:
Hamshahri Corpus
text:
The Hamshahri Corpus is a sizable Persian corpus based on the Iranian newspaper Hamshahri, one of the first online Persian-language newspapers in Iran. It was initially collected and compiled by Ehsan Darrudi at DBRG Group of University of Tehran. Later, a team headed by Abolfazl AleAhmad built on this corpus and created the first Persian text collection suitable for information retrieval evaluation tasks. This corpus was created by crawling the online news articles from the Hamshahri's website
brand slug:
wiki
category slug:
encyclopedia
description:
original url:
https://en.wikipedia.org/wiki/Hamshahri_Corpus
date created:
date modified:
2023-07-03T03:26:11Z
main entity:
{"identifier":"Q5646402","url":"https://www.wikidata.org/entity/Q5646402"}
image:
fields total:
13
integrity:
13