SLDR - 共享语言数据 -
English version
Versión en español
Version française
Read our single-page presentation
http://sldr.org/doc/show/LeSLDR_en.pdf and detailed slide-show
http://sldr.org/doc/show/SldrPresentation-en.pdf
The CRDO project (a Resource Center for the Description of Oral) was initiated in 2006 under the banner of CNRS, the National Center for Scientific Research in France. Now renamed SLDR, this Trusted Data Repository is offering labs and independent scholars a free-of-charge service for the storage, sharing and long-term preservation of sound/video recordings and their related material in compliance with the OAIS model.
This site is dedicated to SLDR (identity:
http://sldr.org/oai-pmh.php?verb=Identify).
这里可划分为三类 :
Oral and video corpora口语和视频语料库以及预演 ;
Tools是给语言学研究专用的 ;
Resources 确切地说是一些词汇,参考指标,系统代表性、文法, ...
Collections binding together several items of the defined types.
SLDR (
sldr.org) is hosted by Aix-Marseille University. Currently, packages are distributed via the TGE-Adonis grid hosted by CC-IN2P3 and preserved on the platform of CINES, an institutional archive beneficiary of the Data Seal of Approval.
Sharing linguistic resources is a comprehensive response to the challenge of bringing together knowledge that has so far been scattered over such diverse domains as descriptive, formal and computational linguistics, literature, translation science, neuroscience and psycholinguistics.
Read SLDR guidelines for the sharing and long-term preservation of oral resources
SLDR: Current state of the art
- Read this page
CRDO report to CLARIN (26 January 2011)
CRDO-Aix renamed SLDR (CLARIN News)
Features specific to SLDR
- SLDR makes it possible to share documents as current data (on the submission site), as a medium-term archive (via the development platform of CC-IN2P3) or as a long-term archive (via the production platform of CC-IN2P3 associated with the archival platform of CINES). Procedures for accessing documents are identical in these three cases. Thus, producers may easily modify the archival status of any item in compliance with the research program from which it originates.
All situations have been taken care of with respect to access rights:
- Items in open access (optionally under Creative Commons licence);
- Items in restricted access (as per user groups);
- Items in restricted access with some file in open access;
- Items in privileged access to some users (downloading, source files, versions, metadata editing).
- Any document (datastream) in a given item may be assigned a private/public status. Public status allows for open access (skipping user's authentication).
- The private/public status of any document in any version of an item can be modified without uploading a new version of the item.
- Additional descriptive documents (free consent forms, licences, notes…) may be added/deleted/modified without uploading a new version of the item.
- Multi-language access to data: navigation is possible in four languages (English/Spanish/French/Chinese) ; descriptions, keywords and tables of contents may be entered in an optional language in addition to navigation languages.
- SLDR is at the service of producers of speech/linguistic resources, i.e. laboratories (see
current list) and individual producers irrespective of institutional/geographical boundaries. Its client-server architecture allows producer laboratories to install their own web services for the display/streaming/analysis of their data distributed via SLDR. - The range of contributions is as wide as possible, from experimental linguistics (laboratory data) to contact linguistics (field data).
- Contributors do their best to supplement data with resources and tools that will facilitate their processing. The aim is to offer a whole set of devices, from primary acoustic signals to the editing and processing of these signals. This service should give access to information, tools and methods allowing data analysis as well as annotations produced by these tools.
- Each item may be put in
relation with publications, teams and research programs. - An item may be stored with SLDR as a personal deposit or in connection with the institution(s) which its author(s) was/were affiliated with at the time of its production. Each institution may declare a path from which its information system will be able to deal with instructions from SLDR triggering processes associated with a particular item. (See details and the
list of institutions). - Downloadings invoking the SLDR non-commercial licence are traced and followed up. Users commit themselves (1) to mention their use of downloaded items in every publication and (2) to enter on the SLDR website the references of
publications based on this use. In this way, the relevance and utility of each corpus, tool or resource distributed by SLDR may be assessed by the speech research community. - Users of any item distributed by SLDR are granted access to the names, professional affiliation and fields of interest of all users who downloaded the same item. This feature is complementary to the sharing of
publications. Its aim is to promote the emergence of communities of producers and users (Web 2.0 approach) collaborating on research projects making an optimal use of available resources.
- Recent additions to the SLDR site (from the RSS feed)
- Pilot project CINES/CC-IN2P3/TGE-Adonis
Project Preservation Description Information (PPDI)- Legal aspects
- OLAC
- OAIS
- Data format
- Work groups
- Links
Included from reference-documents
Documents de référence / Reference documents
- Projet pilote TGE-Adonis/CINES/CC-IN2P3/CRDO d'archivage pérenne et de mutualisation des données orales
Pilot project for the storage, long-term preservation and sharing of oral resources TGE-Adonis/CINES/CC-IN2P3/CRDO) - Archive du projet pilote :
ark:/87895/1.4-187408 - Consortium Corpus oraux et multimodaux (IRCOM) de la TGIR-CORPUS
Consortium on oral and multimodal corpora (IRCOM) of TGIR-CORPUS
Lettre de mission du CRDO (15 février 2006)- Rapports d'activités :
CRDO Aix/Paris (juin 2006) et
CRDO-Paris (décembre 2006)
Hosting of IT services and data for Human and Social Sciences in France (Olof BÄRRING, 31/1/2008)
Mutualisation de la pérennisation et de l'accès aux données - Projet pilote sur les données orales version 0.7 (TGE-Adonis, 30/6/2008)
Mutualisation de la pérennisation et de l'accès aux données en SHS : bilan du projet pilote sur les données orales (Claude HUC, 12 mars 2009)
Rapport d'avancement du projet pilote sur les données orales (Claude HUC, 2 avril 2009)
TGE Adonis – Projet d’archivage des données produites en France par les SHS / Projet pilote sur les donneées orales, novembre 2008 – avril 2009
Rapport d’expertise sur la version préliminaire du résumé opérationnel (Yves MARCOUX, 28 mai 2009)
Évaluation du projet pilote 22 juin 2009 (TGE-Adonis)
Présentation à la Direction des Archives de France du projet pilote d'archivage pérenne des données orales, 23 octobre 2009- Lettre d'intention Lacito/LPL (18 mai 2010) :
texte et
annexe Convention régissant un service de préservation à long terme de documents numériques - entre le CINES et le CNRS au nom et pour le compte du TGE-Adonis, 25 mai 2010
Annexe 0 : Liste des services versants habilités par le service commanditaire à verser des documents électroniques au service d'archives
- Mise au point avant le passage en production de l'archivage pérenne, 18 juin 2010
- Bilan du projet pilote (janvier 2011)
Vers un CRDO « élargi » : rapport (mensonger) de Mathilde Schmitt, mai 2011
Lettre à la direction de TGE-Adonis (Direction du LPL, 15 juin 2011) => remerciements + communication sur CRDO-Aix
Lettre à la direction de TGE-Adonis (Direction du LPL, 29 juin 2011) => cadre juridique (rappel le 2 mars 2012, documents reçus le 19 mars)- Les services versants CRDO-Aix et CRDO-Paris : caractéristiques techniques
CRDO-Aix renamed SLDR (CLARIN News)
SLDR presentation in CLARIN-D tutorial (7 September 2011)
Présentation de CLARIN à la réunion du Consortium Corpus oraux et multimodaux (IRCOM) de la TGIR-CORPUS, 5 octobre 2011
SldrWiki
This wiki space is dedicated to the production and sharing of information about:
- projects related to corpora, tools and resources distributed or documented by SLDR;
- teams? working on these projects;
- scholars taking part in these teams;
- documentation on corpora, tools and resources distributed by SLDR — direct links may be retrieved from records stored on the
SLDR site.
