/ News, Forschung, Doktorat

CFP: ParlaCLARIN II: LREC2020 workshop on creating, using and linking parliamentary corpora with other types of political discourse Date: 12 May 2020

Date: 12 May 2020 Venue: Palais du Pharo, Marseille, France Submission Deadline: 20 January 2019 Submission page: will be communicated by 20 December 2019

Workshop Description

Parliamentary data is a major source of socially relevant content. It is available in ever larger quantities, is multilingual, accompanied by rich metadata, and has the distinguishing characteristic that it is spoken language produced in controlled circumstances which has traditionally been transcribed but is now increasingly released also in audio and video formats. All these factors require solutions related to structuring, synchronization, visualization, querying and analysis of parliamentary corpora. Furthermore, approaches to the exploitation of parliamentary corpora to their full extent also have to take into account the needs of researchers from vastly different Humanities and Social Sciences fields, such as political sciences, sociology, history, and psychology.

A successful first edition of the ParlaCLARIN scientific workshop held at LREC 2018 and a follow-up developmental ParlaFormat workshop held by CLARIN ERIC in 2019 (see links below) resulted in a good overview of the multitude of the existing parliamentary resources worldwide as well as tangible first steps towards better harmonization, interoperability and comparability of the resources and tools relevant for the study of parliamentary discussions and decisions.

The second ParlaCLARIN workshop therefore aims to bring together developers, curators and researchers of regional, national and international parliamentary debates that are suitable for research in disciplines in the Humanities and Social Sciences. We invite unpublished original work focusing on the compilation, annotation, visualisation and utilisation of parliamentary records as well as linking or comparing parliamentary records with other datasets of political discourse such as party manifestos, political speeches, political campaign debates, social media posts, etc. Apart from dissemination of the results, the workshop also aims to address the identified obstacles, discuss open issues and coordinate future efforts in this increasingly trans-national and cross-disciplinary community.


Due to the Freedom of Information Acts that are supported by the United Nations and set in place in over 100 countries worldwide, parliamentary debates are being increasingly easy to obtain, and have always been of interest to researchers from a wide range fields in Humanities and Social Sciences both for the potential influence of their content, and the specificities of the formalized, often persuasive and emotional language use in this context. As a consequence, there are many initiatives, on the national and international levels, that aim at compiling and analysing parliamentary data. The recent CLARIN-PLUS survey on parliamentary data has identified over 20 corpora of parliamentary records, with over half of them being available within the CLARIN infrastructure (see link below).

Given the maturity, variety, and potential of this type of language data as well as the rich metadata it is complemented with, it is urgent to gather researchers both from the side of those producing parliamentary corpora and making them available, those making use of them for linguistic, historical, political, sociological etc. research as well as those linking or comparing them with other datasets of political discourse such as party manifestos, political speeches, political campaign debates, social media posts, etc. in order to share methods and approaches of compiling, annotating and exploring parliamentary and other political language data in order to achieve harmonization of the compiled resources, and to ensure current and future comparability of research on national datasets as well as promote transnational analyses.

Topics of interest

Topics include but are not limited to:
- Creation and annotation of parliamentary data in textual, spoken and video format
- Annotation standards and best practices for parliamentary corpora
- Accessibility, querying and visualisation of parliamentary data
- Text analytics, semantic processing and linking of parliamentary and other datasets of political language data
- Parliamentary corpora and multilinguality
- Studies based on parliamentary corpora
- Studies comparing parliamentary corpora with other types of political discourse

Submission & Publication

We accept submission of long papers (up to 8 pages), short papers (up to 4 pages) and demo papers (up to 4 pages) to be presented as a long or short oral presentation at the workshop. The papers of the workshop will be published in online proceedings.

When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.) to enable their reuse and replicability of experiments (including evaluation ones). For contact data, stylesheets, up-to-date details on submission and the workshop itself, please consult the workshop website.

Submission page: will be communicated by 20 December 2019

Important Dates

- Paper submission deadline: 14 February 2020
- Notification of acceptance: 13 March 2020 
- Camera-ready paper: 2 April 2020 
- Workshop date: 12 May 2020

Organizing Committee

- Darja Fišer, University of Ljubljana and Jožef Stefan Institute
- Franciska de Jong, CLARIN ERIC
- Maria Eskevich, CLARIN ERIC

The workshop is supported by the CLARIN research infrastructure.
To contact the organizers, please mail clarin@clarin.eu (Subject: [ParlaCLARIN@LREC2020]).

Programme Committee (in alphabetical order)

Bente Maegaard, University of Copenhagen, Denmark
Francesca Frontini, Université Paul Valéry - Montpellier, France
Henk van den Heuvel, Radboud University, The NetherlandsJan Odijk, Utrecht University, The Netherlands
Kaspar Beelen, The Alan Turing Institute, UKKlaus Illmayer, Austrian Academy of Sciences, Austria
Laura Morales, Sciences Po, France
Maciej Ogrodniczuk, Institute of Computer Science, Polish Academy of Sciences, Poland
Maria Gavriilidou, ILSP/Athena RC, Greece
Maria Pontiki, ILSP/Athena RC, Greece
Monica Monachini, National Research Council of Italy, Italy
Petya Osenova, IICT-BAS and Sofia University "St. Kl. Ohridski", Bulgaria
Sara Tonelli, Fondazione Bruno Kessler, Italy
Simone Paolo Ponzetto, University of Mannheim, Germany
Stelios Piperidis, ILSP/Athena RC, Greece
Tamás Váradi, Hungarian Academy of Sciences, Hungary
Tanja Wissik, Austrian Academy of Sciences, Austria
Tomaž Erjavec, Jožef Stefan Institute

Identify, Describe and Share your LRs!

Describing your LRs in the LRE Map is now standard practice in the submission procedure of LREC (introduced in 2010 and adopted by other conferences). To continue the efforts initiated at LREC 2014 about “Sharing LRs” (data, tools, web-services, etc.), authors will have the possibility, when submitting a paper, to upload LRs in a special LREC repository.  This effort of sharing LRs, linked to the LRE Map for their description, may become a new “regular” feature for conferences in our field, thus contributing to creating a common repository where everyone can deposit and share data.

As scientific work requires accurate citations of referenced work so as to allow the community to understand the whole context and also replicate the experiments conducted by other researchers, LREC 2020 endorses the need to uniquely Identify LRs through the use of the International Standard Language Resource Number (ISLRN, www.islrn.org), a Persistent Unique Identifier to be assigned to each Language Resource. The assignment of ISLRNs to LRs cited in LREC papers  will be offered at submission time.