
		<paper>
			<loc>https://jjcit.org/paper/215</loc>
			<title>ARABIC SOFT SPELLING CORRECTION WITH T5</title>
			<doi>10.5455/jjcit.71-1699768515</doi>
			<authors>Mohammed Al-Qaraghuli,Ola Arif Jaafar</authors>
			<keywords>Arabic spelling correction,Transformers,Text-to-text transfer transformer,T5,Natural-language processing</keywords>
			<citation>7</citation>
			<views>3871</views>
			<downloads>1245</downloads>
			<received_date>12-Nov.-2023</received_date>
			<revised_date>  3-Jan.-2024</revised_date>
			<accepted_date>   20-Jan.-2024</accepted_date>
			<abstract>Spelling correction is considered a challenging task for resource-scarce languages. The Arabic language is one of these resource-scarce languages, which suffers from the absence of a large spelling correction dataset, thus datasets injected with artificial errors are used to overcome this problem. In this paper, we trained the Text-to-Text Transfer Transformer (T5) model using artificial errors to correct Arabic soft spelling mistakes. Our T5 model can correct 97.8% of the artificial errors that were injected into the test set. Additionally, our T5 model achieves a character error rate (CER) of 0.77% on a set that contains real soft spelling mistakes. We achieved these results using a 4-layer T5 model trained with a 90% error injection rate, with a maximum sequence length of 300 characters.</abstract>
		</paper>


