Information Technology Journal1812-56381812-5646Asian Network for Scientific Information10.3923/itj.2008.1009.1015A. TouirAmeur MathkourHassan Al-SaneaWaleed 7200877In this study, we present an automatic technique to
help segment the Arabic texts while preserving the semantics. The technique
is based on an empirical study on the sentences and clauses connectors.
It has evolved from tedious analysis of various Arabic texts and from
observations that have been noted over a long period of time. The analysis
made it possible to realize the functionality of each connector in terms
of separating standalone segments in the Arabic texts. This has lead to
a categorization of active and passive connectors. We used the introduced
notion of active and passive connectors to develop an algorithm that respects
the semantic of the text to identify the segments of a given Arabic text.
The algorithm has been implemented and experimented with. Various Arabic
essays were segmented using the algorithm and the results were compared
to that of manual segmentations performed by linguistic experts. The performance
of the algorithm was in line with the manual segmentations that were performed
by the linguistic experts.]]>Agichtein, E. and V. Ganti, 20042004pp: 2029Al-Ansari, I. H., 20032003Al-Sanie, W., A. Touir and H. Mathkour, 20052005pp: 535542Al-Sanie, W., A. Touir and H. Mathkour, 20052005pp: 10861091Beeferman, D., A. Berger and J. Lafferty, 19971997pp: 3547Beeferman, D., A. Berger and J.D. Lafferty, 199934177210Chang, D.S. and K.S. Choi, 2005Lecture Notes in Computer Science, Vol. 3248,pp: 61-70pp: 61-70Chang, D.S. and K.S. Choi, 200642662678Cristea, D., O. Postolache and L. Pistol, 2005Lecture Notes in Computer Science, Vol. 3406,pp: 632-644pp: 632-644El-Masri, B. H.,20012001Golcher, F., 20062006pp: 4451Haouam, K., A. Touir and F. Marir, 20032003pp: 139148Lamprier, S., T. Amghar, B. Levrat and F. Saubion, 20072007pp: 16471653Le Thanh, H., G. Abeysinghe and C. Huyck, 20042004pp: 411415Mann, W.C. and S. Thompson, 19888243281Marcu, D., 19971997pp: 96103Marcu, D., 19991999pp: 123-136pp: 123-136Marcu, D., 20001st Edn.Marcu, D., 200026395448Mathkour, H., A. Touir and W. Al-Sanie, 20052005pp: 229236Mazur, P.P., 20052005pp: 4348Sebastian, N. and A. Costa, 199712883887Sparck-Jones, K.,19991999pp: 1-13pp: 1-13Utiyama, M. and H. Isahara, 20012001pp: 491498Villatoro-Tello, E., L. Villasenor-Pineda and M. Montes-Y-Gomez, 2006Lecture Notes in Computer Science, Vol. 4188,pp: 293-300pp: 293-300Wu, Z. and G. Tseng, 1995468396Yang, C.C. and K.W. Li, 20055614381447