Search Results for author: Piotr Szymański

Found 18 papers, 10 papers with code

SRAI: Towards Standardization of Geospatial AI

1 code implementation19 Oct 2023 Piotr Gramacki, Kacper Leśniara, Kamil Raczycki, Szymon Woźniak, Marcin Przymus, Piotr Szymański

Spatial Representations for Artificial Intelligence (srai) is a Python library for working with geospatial data.

Improved DeepFake Detection Using Whisper Features

1 code implementation2 Jun 2023 Piotr Kawa, Marcin Plata, Michał Czuba, Piotr Szymański, Piotr Syga

With a recent influx of voice generation methods, the threat introduced by audio DeepFake (DF) is ever-increasing.

Automatic Speech Recognition DeepFake Detection +3

highway2vec -- representing OpenStreetMap microregions with respect to their road network characteristics

1 code implementation26 Apr 2023 Kacper Leśniara, Piotr Szymański

Recent years brought advancements in using neural networks for representation learning of various language or visual phenomena.

Representation Learning

This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish

1 code implementation23 Nov 2022 Łukasz Augustyniak, Kamil Tagowski, Albert Sawczyn, Denis Janiak, Roman Bartusiak, Adrian Szymczak, Marcin Wątroba, Arkadiusz Janz, Piotr Szymański, Mikołaj Morzy, Tomasz Kajdanowicz, Maciej Piasecki

In this paper, we introduce LEPISZCZE (the Polish word for glew, the Middle English predecessor of glue), a new, comprehensive benchmark for Polish NLP with a large variety of tasks and high-quality operationalization of the benchmark.

Benchmarking

Hex2vec -- Context-Aware Embedding H3 Hexagons with OpenStreetMap Tags

1 code implementation1 Nov 2021 Szymon Woźniak, Piotr Szymański

In this paper we propose the first approach to learning vector representations of OpenStreetMap regions with respect to urban functions and land-use in a micro-region grid.

Representation Learning

gtfs2vec -- Learning GTFS Embeddings for comparing Public Transport Offer in Microregions

1 code implementation1 Nov 2021 Piotr Gramacki, Szymon Woźniak, Piotr Szymański

We selected 48 European cities and gathered their public transport timetables in the GTFS format.

Clustering

Transfer Learning Approach to Bicycle-sharing Systems' Station Location Planning using OpenStreetMap Data

1 code implementation1 Nov 2021 Kamil Raczycki, Piotr Szymański

Bicycle-sharing systems (BSS) have become a daily reality for many citizens of larger, wealthier cities in developed regions.

Decision Making Layout Design +1

Spatial Data Mining of Public Transport Incidents reported in Social Media

1 code implementation11 Oct 2021 Kamil Raczycki, Marcin Szymański, Yahor Yeliseyenka, Piotr Szymański, Tomasz Kajdanowicz

We successfully build an information type classifier for social media posts, detect stop names in posts, and relate them to GPS coordinates, obtaining a spatial understanding of long-term aggregated phenomena.

Is the Best Better? Bayesian Statistical Model Comparison for Natural Language Processing

no code implementations EMNLP 2020 Piotr Szymański, Kyle Gorman

Recent work raises concerns about the use of standard splits to compare natural language processing models.

A Network Perspective on Stratification of Multi-Label Data

no code implementations27 Apr 2017 Piotr Szymański, Tomasz Kajdanowicz

We present a new approach to stratifying multi-label data for classification purposes based on the iterative stratification approach proposed by Sechidis et.

Classification Community Detection +2

Is a Data-Driven Approach still Better than Random Choice with Naive Bayes classifiers?

no code implementations13 Feb 2017 Piotr Szymański, Tomasz Kajdanowicz

In case of F1 scores and Subset Accuracy - data driven approaches were more likely to perform better than random approaches than otherwise in the worst case.

General Classification Multi-Label Classification

A scikit-based Python environment for performing multi-label classification

2 code implementations5 Feb 2017 Piotr Szymański, Tomasz Kajdanowicz

It provides native Python implementations of popular multi-label classification methods alongside a novel framework for label space partitioning and division.

General Classification Management +1

How is a data-driven approach better than random choice in label space division for multi-label classification?

no code implementations7 Jun 2016 Piotr Szymański, Tomasz Kajdanowicz, Kristian Kersting

We show that fastgreedy and walktrap community detection methods on weighted label co-occurence graphs are 85-92% more likely to yield better F1 scores than random partitioning.

Community Detection General Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.