1 code implementation • ParlaCLARIN (LREC) 2022 • Tommaso Agnoloni, Carlo Marchetti, Roberto Battistoni, Giuseppe Briotti
In this paper we describe an experiment for the application of text clustering techniques to dossiers of amendments to proposed legislation discussed in the Italian Senate.
no code implementations • ParlaCLARIN (LREC) 2022 • Tommaso Agnoloni, Roberto Bartolini, Francesca Frontini, Simonetta Montemagni, Carlo Marchetti, Valeria Quochi, Manuela Ruisi, Giulia Venturi
The corpus contains 1199 sessions and 79, 373 speeches, for a total of about 31 million words and was encoded according to the ParlaCLARIN TEI XML format, as well as in CoNLL-UD format.