Resource: Al-Mus'haf Corpus

Reference Al-Mus'haf Corpus
Date of Submission Jan. 20, 2016, 11:44 a.m.
Status accepted
ISLRN 114-868-598-820-5
Resource Type Corpus
Media Type Text
Language Arabic
Format/MIME Type text/csv,
Size 8.43 Mo
Description

Al-Mus'haf corpus is a new Quranic corpus rich in morphosyntactical information. To build such a corpus of the Quran, we used a semi-automatic technique which consists in using the morphosyntactic of Standard Arabic words "AlKhalil Morpho Sys" version 2 followed by a manual treatment in colaboration with experts in Arabic grammar. The corpus and the results we achieved can be used by researchers as baselines to test and evaluate their Arabic tools. In addition, this corpus can be used to train, optimize and evaluate existing approaches. Furthermore, the current corpus is always subject to further improvement.

Version 1.0
Creator Imad Zeroual - Mohamed First University , Abdelhak Lakhouaja - Mohammed First University
Distributor Imad Zeroual - Mohamed First University , Abdelhak Lakhouaja - Mohammed First University
Rights Holder Imad Zeroual - Mohamed First University , Abdelhak Lakhouaja - Mohammed First University
Relation new version/release of "Al-Mus'haf Corpus"