|
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
HOME | Teaching | Research | Open Courseware | Biography | |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Arabic OCR | Useful References | Publications | Dataset PATS-A01 | Dataset PATS-A02 | |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Dataset PATS-A01 The first Printed Arabic Text Set A01 (PATS-A01) consists of 2766 text line images. The text of 2751 line images of this set was selected from two standard classic Arabic books. The text of the remaining 15 line images are added from our minimal Arabic script (see publications). The line images are available in eight fonts: Arial, Tahoma, Akhbar, Thuluth, Naskh, Simplified Arabic, Andalus, and Traditional Arabic. AkhbarText.txt
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Please notice that the ground truth text lines are ordered according to the numbering used in the names of the line images. |