Search Results for author: Sanda Martinčić-Ipšić

Finally, since the size of texts is reflected in the network properties, our results suggest that the corpus influence can be reduced by increasing the co-occurrence window size.

Clustering

Paper
Add Code

Toward Network-based Keyword Extraction from Multitopic Web Documents

no code implementations • 14 Jul 2014 • Sabina Šišović, Sanda Martinčić-Ipšić, Ana Meštrović

In this paper we analyse the selectivity measure calculated from the complex network in the task of the automatic keyword extraction.

Keyword Extraction Sentence

Paper
Add Code

Toward Selectivity Based Keyword Extraction for Croatian News

no code implementations • 17 Jul 2014 • Slobodan Beliga, Ana Meštrović, Sanda Martinčić-Ipšić

Obtained sets are evaluated on a manually annotated keywords: for the set of extracted keyword candidates average F1 score is 24, 63%, and average F2 score is 21, 19%; for the exacted words-tuples candidates average F1 score is 25, 9% and average F2 score is 24, 47%.

Keyword Extraction

Paper
Add Code

Non-Standard Words as Features for Text Categorization

no code implementations • 28 Aug 2014 • Slobodan Beliga, Sanda Martinčić-Ipšić

This paper presents categorization of Croatian texts using Non-Standard Words (NSW) as features.

Lemmatization Text Categorization

Paper
Add Code

Network Motifs Analysis of Croatian Literature

no code implementations • 18 Nov 2014 • Hana Rizvić, Sanda Martinčić-Ipšić, Ana Meštrović

Firstly, we show that the triad significance profile for the Croatian language is very similar with the other languages and all the networks belong to the same family of networks.

Paper
Add Code

Normalization of Non-Standard Words in Croatian Texts

no code implementations • 27 Mar 2015 • Slobodan Beliga, Miran Pobar, Sanda Martinčić-Ipšić

This paper presents text normalization which is an integral part of any text-to-speech synthesis system.

General Classification Speech Synthesis +1

Paper
Add Code

Multilayer Network of Language: a Unified Framework for Structural Analysis of Linguistic Subsystems

no code implementations • 30 Jul 2015 • Domagoj Margan, Ana Meštrović, Sanda Martinčić-Ipšić

The multilayer network of language is a unified framework for modeling linguistic subsystems and their structural properties enabling the exploration of their mutual interactions.

Paper
Add Code

The Influence of Feature Representation of Text on the Performance of Document Classification

no code implementations • 5 Jul 2017 • Sanda Martinčić-Ipšić, Tanja Miličić, Ljupčo Todorovski

In this study, we measure the performance of the document classifiers trained using the method of random forests for features generated the three models and their variants.

Document Classification General Classification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.