Authors: | Popović, Branislav Sečujski, Milan Delić, Vlado Janev, Marko Stanković, Igor |
Affiliations: | Mathematical Institute of the Serbian Academy of Sciences and Arts | Title: | Automatic morphological annotation in a text-to-speech system for Hebrew | Journal: | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | Volume: | 8113 LNAI | First page: | 78 | Last page: | 85 | Conference: | 15th International Conference on Speech and Computer, SPECOM 2013; Pilsen; Czech Republic; 1 September 2013 through 5 September 2013 | Issue Date: | 10-Oct-2013 | Rank: | M33 | ISBN: | 978-3-319-01930-7 | ISSN: | 0302-9743 | DOI: | 10.1007/978-3-319-01931-4_11 | Abstract: | The paper presents the module for automatic morphological annotation within a text synthesizer for Hebrew, based on an efficient combination of two approaches. The first approach includes the selection of lexemes from appropriate lexica, while the other approach involves automatic morphological analysis of text input using a complex expert algorithm relying on a set of transformational rules and using 6 types of scoring procedures. The module operates on a set of 30 part-of-speech tags with more than 3000 corresponding morphological categories. The paper discusses the advantages of the proposed method in the context of an extremely morphologically complex language such as Hebrew, with particular emphasis given to the relative importance of individual scoring procedures. When all 6 scoring procedures are applied, the accuracy of 99.6% is achieved on a corpus of 3093 sentences (55046 words). |
Keywords: | Hebrew | part-of-speech tagging | speech synthesis | Publisher: | Springer Link |
Show full item record
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.