Search Results for author: Paolo Papotti

Found 23 papers, 6 papers with code

Automatic Verification of Data Summaries

no code implementations INLG (ACL) 2021 Rayhane Rezgui, Mohammed Saeed, Paolo Papotti

We present a generic method to compute thefactual accuracy of a generated data summarywith minimal user effort.

Fact Checking

Combating Misinformation in the Arab World: Challenges & Opportunities

no code implementations5 Jun 2025 Azza Abouzied, Firoj Alam, Raian Ali, Paolo Papotti

Misinformation and disinformation pose significant risks globally, with the Arab region facing unique vulnerabilities due to geopolitical instabilities, linguistic diversity, and cultural nuances.

Diversity Fact Checking +1

RelationalFactQA: A Benchmark for Evaluating Tabular Fact Retrieval from Large Language Models

no code implementations27 May 2025 Dario Satriani, Enzo Veltri, Donatello Santoro, Paolo Papotti

Current benchmarks often assess short factual answers, overlooking the critical ability to generate structured, multi-record tabular outputs from parametric knowledge.

Retrieval

Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL

no code implementations21 Apr 2025 Simone Papicchio, Simone Rossi, Luca Cagliero, Paolo Papotti

To this end, it considers the following LLM settings: (1) ZSL, including general-purpose reasoning or not; (2) SFT, with and without task-specific reasoning traces; (3) RL, exploring the use of different rewarding functions, both the established EXecution accuracy (EX) and a mix with fine-grained ones that also account the precision, recall, and cardinality of partially correct answers; (4) SFT+RL, i. e, a two-stage approach that combines SFT and RL.

Reinforcement Learning (RL) Zero-Shot Learning

Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning

no code implementations6 Mar 2025 Giulio Corallo, Orion Weller, Fabio Petroni, Paolo Papotti

Incorporating external knowledge in large language models (LLMs) enhances their utility across diverse applications, but existing methods have trade-offs.

RAG Retrieval-augmented Generation

An LLM-Based Approach for Insight Generation in Data Analysis

no code implementations20 Feb 2025 Alberto Sánchez Pérez, Alaa Boukhary, Paolo Papotti, Luis Castejón Lozano, Adam Elwood

Generating insightful and actionable information from databases is critical in data analysis.

Latent Abstractions in Generative Diffusion Models

no code implementations4 Oct 2024 Giulio Franzese, Mattia Martini, Giulio Corallo, Paolo Papotti, Pietro Michiardi

In this work we study how diffusion-based generative models produce high-dimensional data, such as an image, by implicitly relying on a manifestation of a low-dimensional set of latent abstractions, that guide the generative process.

Finch: Prompt-guided Key-Value Cache Compression

no code implementations31 Jul 2024 Giulio Corallo, Paolo Papotti

Only such pairs are stored in the KV cache, which, within the space constrained by the context window, ultimately contains a compressed version of the long text.

Language Modeling Language Modelling +2

Retrieve, Merge, Predict: Augmenting Tables with Data Lakes

5 code implementations9 Feb 2024 Riccardo Cappuzzo, Aimee Coelho, Felix Lefebvre, Paolo Papotti, Gael Varoquaux

Machine-learning from a disparate set of tables, a data lake, requires assembling features by merging and aggregating tables.

AutoML Benchmarking +1

Variable Selection in Maximum Mean Discrepancy for Interpretable Distribution Comparison

no code implementations2 Nov 2023 Kensuke Mitsuzawa, Motonobu Kanagawa, Stefano Bortoli, Margherita Grossi, Paolo Papotti

For this optimisation, we introduce sparse regularisation and propose two methods for dealing with the issue of selecting an appropriate regularisation parameter.

Causal Inference Relevance Detection +2

Querying Large Language Models with SQL

no code implementations2 Apr 2023 Mohammed Saeed, Nicola De Cao, Paolo Papotti

Thus, we envision the use of SQL queries to cover a broad range of data that is not captured by traditional databases by tapping the information in LLMs.

Unsupervised Matching of Data and Text

1 code implementation16 Dec 2021 Naser Ahmadi, Hansjorg Sand, Paolo Papotti

Our method builds a fine-grained graph over the content of the corpora and derives word embeddings to represent the objects to match in a low dimensional space.

Entity Resolution Question Answering +2

RuleBert: Teaching Soft Rules to Pre-trained Language Models

1 code implementation EMNLP 2021 Mohammed Saeed, Naser Ahmadi, Preslav Nakov, Paolo Papotti

While pre-trained language models (PLMs) are the go-to solution to tackle many natural language processing problems, they are still very limited in their ability to capture and to use common-sense knowledge.

Common Sense Reasoning

Automated Fact-Checking for Assisting Human Fact-Checkers

no code implementations13 Mar 2021 Preslav Nakov, David Corney, Maram Hasanain, Firoj Alam, Tamer Elsayed, Alberto Barrón-Cedeño, Paolo Papotti, Shaden Shaar, Giovanni Da San Martino

The reporting and the analysis of current events around the globe has expanded from professional, editor-lead journalism all the way to citizen journalism.

Fact Checking

LIBRE: Learning Interpretable Boolean Rule Ensembles

no code implementations15 Nov 2019 Graziano Mita, Paolo Papotti, Maurizio Filippone, Pietro Michiardi

We present a novel method - LIBRE - to learn an interpretable classifier, which materializes as a set of Boolean rules.

Query-limited Black-box Attacks to Classifiers

1 code implementation23 Dec 2017 Fnu Suya, Yuan Tian, David Evans, Paolo Papotti

Specifically, we consider the problem of attacking machine learning classifiers subject to a budget of feature modification cost while minimizing the number of queries, where each query returns only a class and confidence score.

Bayesian Optimization BIG-bench Machine Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.