SLDR


Visible documents : 190
Members : 304 (42 countries)
Spoken languages : 159
 

Jhove
[Valid RSS]   [Valid Atom 1.0]

Speech & Language Data Repository

http://sldr.org

TGE-Adonis CLARIN OAI
Open archives (OAI-PMH)


-   [Sign up]   /   [Login]   - 
--- --- --- --- --- --- --- --- --- --- --- --- 
/ 中文 /  English / español / français / 

Speech & Language Data Repository (SLDR)

SLDR (pronounce ‘SpLanDR’) is replacing CRDO-Aix at the term of its experimental phase. Acronyms CRDO, CRDO-Aix and CRDO-Paris are therefore out of date and should gradually be replaced in all documents. Nonetheless, we are maintaining redirections from ‘crdo.fr’ for the sake of accessibility via old identifiers.

Speech & Language Data Repository (SLDR) is a Trusted Data Repository offering labs and scholars a free-of-charge service for sharing their oral/linguistic data and archiving it with the help of procedures compliant with the OAIS model for long-term preservation. Its entire storage is referenced in international repositories such as OLAC (Open Language Archives Community) and Virtual Language Observatory. Currently, packages are distributed via the TGE-Adonis grid hosted by CC-IN2P3 and preserved on the platform of CINES, an institutional archive beneficiary of the Data Seal of Approval.

Items of four kinds are available on this site:

  • Primary data : sound/video/image/text corpora and any language-related signal ;
  • Resources : annotations of corpora, lexicons, reference databases, systems of representation, grammars etc. ;
  • Tools for linguistic research ;
  • Collections of items as defined above.

Read our flyer, our guidelines for the sharing and long-term preservation of oral resources, and visit our slideshow!

The latest deposits (88) >> morepage 1  >>
[tempARK] Primary data (corpus) sldr000786 MARC-Fr (Brigitte BIGI, Pauline PÉRI)
Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR)
Corpus français manuellement phonétisé et aligné d'une durée de 7 minutes. Composé de 3 sous-corpus : CID, AixOx et Grenelle.

>> Collection Multimodalité et débats à l'Assemblée nationale - Multimodality and debates in the National Assembly sldr000729
2012-05-13
Version 2
medium-term preservation
Primary data (corpus) swedia-000788 Swedia 2000 (Anders ERIKSSON)
Göteborgs universitet (GU, Göteborg SE)
Department of Language Studies, Umeå university (Umeå SE)
Department of linguistics, Stockholm University (SU, Stockholm SE)
Linguistics and Phonetics, Lund University (LU, Lund SE)
This research database consists of recordings of a little more than 1300 speakers representing 107 Swedish dialects. Each recording consists of two major parts. One part consisting of controlled material where specific aspects of Swedish phonology are elicited and one part containing spontaneous speech in the form of informal interviews or dialogues between two speakers of the dialect.
picto
2012-04-24
Version 1
source data
Primary data (corpus) sldr000787 Valjouffrey-Valbonnais 2012 (Médéric GASQUET-CYRUS)
Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR)
Département de linguistique et phonétique générales, Université d'Aix-Marseille (Aix-en-Provence FR)
Audio & video recordings in the context of project “Mémoires et pratiques linguistiques en zone de transition entre francoprovençal et occitan : Valjouffrey et Valbonnais”

>> Collection Valjouffrey valjouffrey-000007
2012-04-22
Version 1
source data
Google earth
OpenStreetMap
• Conseil Régional Rhône-Alpes
[ARK] Primary data (corpus) sldr000033 Aix-MARSEC database (Daniel HIRST, Céline DE LOOZE, Cyril AURAN, Caroline BOUZON)
Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR) -> source
Aix-MARSEC is an evolutive database of spoken British English.
It is composed of over 5 hours of speech data together with annotations at several linguistic levels.
These annotations currently include: phonemes, syllables, syllable constituents, rhythm units, stress feet, words, intonation units, together with the output of the automatic MOMEL modelling and INTSINT symbolic coding algorithms.
The annotation is available both in the form of Praat TextGrids and in tabular form as an ascii text file.
-Aix-Marsec_Read_Me.pdf




2012-03-06
Version 2
long-term preservation

This material is Open Data
Primary data (corpus) sldr000785 Corpus of Kattu Nayaka/Jenu Kurumba interviews 2010 (Oriana REID-COLLINS)
Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR)
Corpus of four semi-structured bilingual interviews (English and Kattu Nayaka/Jenu Kurumba) on the social representations of the participants.
Oriana Reid-Collins conducted the interviews in Gudalur, the Nilgiris, Tamil Nadu, India, between March and May 2010.
2012-02-22
Version 1
source data
Google earth
OpenStreetMap
• Kurumba Languages DOBES Project
[tempARK] Primary data (corpus) sldr000784 AixOx (Sophie HERMENT, Anastassia LOUKINA, Anne TORTEL)
Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR) -> source
A corpus of read speech: 40 one-minute passages (EUROM 1 corpus) in French and in English. The French passages are read by native speakers and English-speaking learners and the English passages are read by natives and French learners.
2012-02-06
Version 1
medium-term preservation
[tempARK] Primary data (corpus) sldr000780 MULTIPHONIA (MULTImodal database of PHONetics teaching methods in classroom InterActions) (Charlotte ALAZARD, Corine ASTESANO, Michel BILLIÈRES)
Unité de Recherche Interdisciplinaire Octogone - EA4156 (Toulouse FR)
Corpus résultant d'enregistrements longitudinaux de cours de correction phonétique en Français Langue Étrangère (FLE) entre avril et juin 2011.
Corpus constitué de 96 heures de cours avec des apprenants anglophones de niveaux débutant et intermédiaire, selon deux méthodes de correction phonétique (Méthode Verbo-Tonale et Méthode Articulatoire).
Les enregistrements ont été réalisés dans le studio vidéo de la Direction des Technologies de l'Information et de la Communication pour l'Enseignement (DTICE) à l'Université Toulouse II.
La durée de chaque cours est de 90 minutes environ.
picto
2012-02-05
Version 2
medium-term preservation
Primary data (corpus) sldr000783 La langue de Renivier, Vanuatu, Malekula - Renivier's language, Vanuatu, Malekula (Jocelyn AZNAR)
Centre de Recherche et de Documentation sur l'Océanie - UMR 7308 (CREDO, Marseille FR)
Traitement automatique du langage écrit et parlé, Laboratoire d'informatique fondamentale (TALEP, Marseille FR)
This corpus contains many stories and songs in Ronivier language and translated to Bislama. It also contains many translated grammatical features of Ronivier language. In addition many pictures of trees and birds are supplied with their names translated to the language.
picto


2012-01-26
Version 1
source data
[tempARK] Primary data (corpus) sldr000782 DySpoLec (Muriel LALAIN)
Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR) -> source
The DySpoLec corpus is an audio corpus featuring recordings of 19 normal readers and dyslexic children (10-11-year-old). Intended for the study of prosodic characteristics of dyslexia, this corpus proposes the audio recording of each subject during a narrative production task (spontaneous speech) and during a reading task (read speech). The total duration of the corpus is about 40 minutes; the durations of productions may vary from 24 seconds to 2.30 minutes depending on subjects.
picto
Documentation_DySpoLec.pdf

control - reading

control - spontaneous

dyslexia - reading

dyslexia - spontaneous
2012-01-10
Version 1
medium-term preservation
[tempARK] Primary data (corpus) sldr000781 REVEL, corpus 2011 (Mathilde SPINI)
Département de sciences du langage, Université d'Aix-Marseille (Aix-en-Provence FR)
Enquête sociolinguistique réalisée au cours de l'année 2011, autour d'une locutrice du "patois" et de son entourage, visant à :
- rassembler différents éléments de description de la langue et de ses usages dans le village aujourd'hui
- construire un corpus audio d'interactions spontanées en patois et en français, et de discours autour de la langue, de ses limites géographiques, de la culture paysanne et des diverses mémoires des locuteurs.
picto

cinq sous ils ont coûté les sabots

ero bouno la confituro

ils disent lou au lieu de dire lé

parler patois ça fait régresser
2011-12-09
Version 1
medium-term preservation
[ARK] Secondary data (resource) sldr000777 Intonation of Conversational English (Educated Southern British) by Wiktor Jassem (1952) (Wiktor JASSEM)
Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR)
Book.
Author: Jassem, Wiktor.
Date: 1952.
Title: Intonation of Colloquial English (Educated Southern British).
Publisher: Wrocław, Nakładem Wrocławskiego Towarzystwa Naukowego; Skład Glówny: Dom Ksiązki.
Series: Prace Wrockławskiego Towarzystwa Naukowego. Travaux de la Société des Sciences et de Lettres de Wrockław Seria A.
Number: 45.
Pages: 122
picto
2011-11-24
Version 1
long-term preservation

This material is Open Data
[ARK] Primary data (corpus) sldr000736 Journées « Patois »
Laboratoire parole et langage - UMR 7309 (LPL, Aix-en-Provence FR)
Département de linguistique et phonétique générales, Université d'Aix-Marseille (Aix-en-Provence FR)
These encounters between speakers of many regional languages have been initiated by Ms. Marcelle Bernard Brunel, spouse of René Péry, as an extension of research work on the patois (dialects) of Valbonnais and neighbouring areas.

>> Collection Code-switching sldr000762
>> Collection Valjouffrey valjouffrey-000007
picto
2011-10-31
Version 2
long-term preservation
Google earth
OpenStreetMap
• Délégation générale à la langue française et aux langues de France (DGLFLF)

This material is Open Data
page 1  >>

The 8 most frequent downloadings under SLDR licence
Primary data (corpus) Videos of CIDDownloaded 51 time(s) (?)
Secondary data (resource) Annotations of CID (Roxane BERTRAND)Downloaded 47 time(s) (?)
Primary data (corpus) Aix-MARSEC database (Daniel HIRST, Céline DE LOOZE, Cyril AURAN, Caroline BOUZON)Downloaded 35 time(s) (?)
Secondary data (resource) VfrLPL (Stéphane RAUZY)Downloaded 27 time(s) (?)
Secondary data (resource) Grammar of French language (GP) (Marie-Laure GUéNOT)Downloaded 20 time(s) (?)
Primary data (corpus) ANGLISH (Anne TORTEL, Daniel HIRST)Downloaded 14 time(s) (?)
Primary data (corpus) Valjouffrey - corpus 2010-2011 (Médéric GASQUET-CYRUS)Downloaded 11 time(s) (?)
Primary data (corpus) EUROM1_frDownloaded 11 time(s) (?)

This site has been declared to Commission Nationale de l’Informatique et des Libertés (CNIL) under agreement Nr.1222972 on 26 March 2008. As per French Law, any person cited by name is granted access to, modification, correction and suppression of data relative to him/her (art. 34 of the « Informatique et Libertés » act of 6 January 1978). To exert your right, send a message to webmaster(at)sldr.org.

This site is optimized for FireFox or any browser with the 'tabs' option set.