Conetxt & Scope

This International Workshop & Panel is organized in the context of the collaboration that links since 2014 the University Sidi Mohamed Ben Abdellah and the the Istituto di Linguistica Computazionale "A. Zampolli" (ILC-CNR), Pisa, Italy. The scope is to extend CLARIN to the university centers in Morocco, federate and share existing and new data, expand collaborations and leverage resources for collaborative research.

This panel intends therefore to open up a discussion on how to make Language Resources produced in Morocco more visible and accessible to a broader research community, and how the experience and resources for the setting up of various CLARIN data centres and competence centres could be beneficial to this purpose.

CLARIN is the Language Resources Infrastructure for Social Sciences and Humanities. More precisely, it is a European Research Infrastructure Consortium, or ERIC, which is a specific legal form that facilitates the establishment and operation of Research Infrastructures with European interest. CLARIN was established in 2010 from the vision that all digital language resources and tools from all over Europe and beyond are accessible through a single sign-on online environment for the support of researchers in the humanities and social sciences and beyond. Its mission is to create and maintain an infrastructure to support the sharing, use and sustainability of language data and tools for research. 

Currently CLARIN provides easy and sustainable access to digital language data (in written, spoken, or multimodal form) for scholars. CLARIN also offers advanced tools to discover, explore, exploit, annotate, analyse or combine such data sets, wherever they are located. This is enabled through a networked federation of centres: language data repositories, service centres and knowledge centres, with single sign-on access for all members of the academic community in all participating countries. Tools and data from different centres are interoperable, so that data collections can be combined and tools from different sources can be chained to perform complex operations to support researchers in their work.

Although CLARIN is a European research infrastructure, it is not limited to European Languages or Resources. Non-European languages are made available by several European CLARIN centres, and the South African Centre for Digital Language Resources (SADiLaR) has recently joined CLARIN. CLARIN has also partnerships with centres in the USA, and its deposit, metadata and single sign on framework can represent a viable solution for anyone wishing to easily set up a Language Resources Repository. Language Resources for Arabic and its varieties can be found via the CLARIN Virtual Language Observatory (VLO), the meta-catalogue which harvests all metadata from CLARIN centres and makes them searchable from a single access point. These can be oral recordings, written corpora, or lexicons. However many important corpora and lexical resources are currently not represented. Moreover, the CLARIN Language Resources Switchboard, a tool that helps to find language processing Web applications, currently lacks any NLP tool for Arabic.

This panel will consist of short presentations by ILC-CNR and USMBA researchers aimed at presenting CLARIN ERIC and its various aspects. CLARIN ERIC and its technical and scientific infrastructure will be introduced, showing how the latter is compliant with the internationally recognised FAIR principles, which recommend that data is Findable, Interoperable, Accessible and Reusable. Moreover, examples of resources and tools from various national consortia, notably CLARIN-IT, will be presented, as well as user involvement activities and currently on going projects (such as ParlaMint). The presentations will be followed by a panel discussion.


Dr. Simonetta Montemagni
Dr. Simonetta MontemagniILC-CNR - Director
Dr. Ouafae Nahli
Dr. Ouafae NahliILC-CNR
Making an Arabic Language Resources FAIR with ILC4CLARIN
Dr. Monica Monachini
Dr. Monica MonachiniILC-CNR - National coordinator of CLARIN-IT
Clarin National Consortium: The Example Of Clarin-it And The Ilc4clarin Centre
Pr. Maha El Biadi
Pr. Maha El BiadiLinguistics and English Studies, USMBA, Fez, Morocco
Data Types Linguists collect for their Language Studies
Dr. Francesca Frontini
Dr. Francesca FrontiniILC-CNR and CLARIN ERIC Board of Directors
CLARIN ERIC and what it can do for you