Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi

Recognizing verb-based Croatian idiomatic MWUs (CROSBI ID 642919)

Prilog sa skupa u zborniku | izvorni znanstveni rad

Kocijan, Kristina ; Librenjak, Sara Recognizing verb-based Croatian idiomatic MWUs // Automatic processing of natural-language electronic texts with NooJ : revised selected papers / Okrut, Tatsiana ; Hetsevich, Yuras ; Silberztein, Max et al. (ur.). Springer, 2016. str. 96-106

Podaci o odgovornosti

Kocijan, Kristina ; Librenjak, Sara

engleski

Recognizing verb-based Croatian idiomatic MWUs

This paper tackles the computational problems of Croatian verbal idioms. Croatian language has very rich phraseme structure, as described in Matešić (1982), Menac (2007) and Menac- Mihalić (2007), as well as many others. This work is one of the few attempts of computational analyis of idioms in Croatian language as multi-word units. We used rule- based approach and NooJ syntactic grammars in order to recognize any verb based idiom (of the ~1500 analyzed) in any syntactic position. The Croatian Dictionary of Idioms (Menac et al. 2003) was used for the initial list, which was implemented with new additions during training phase. Grammars were tested within the corpora constructed specifically for this work, and used to calculate statistical measures of recall, precision and f-measure for our grammars. With the final results of recall < 98 %, precision < 96 % and f-measure < 97 %, we consider this a successful attempt in the recognition of verb based idioms in Croatian language.

Croatian, idioms, verbal phrases, NooJ, MWU, MWE, frozen expressions, semi-frozen expressions

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

96-106.

2016.

objavljeno

Podaci o matičnoj publikaciji

Automatic processing of natural-language electronic texts with NooJ : revised selected papers

Okrut, Tatsiana ; Hetsevich, Yuras ; Silberztein, Max ; Stanislavenka, Hanna

Springer

978-3-319-42470-5

Podaci o skupu

Nepoznat skup

predavanje

29.02.1904-29.02.2096

Povezanost rada

Filologija, Informacijske i komunikacijske znanosti

Indeksiranost