SLDR


Visible documents : 184
Members : 279 (40 countries)
Spoken languages : 155
 

Jhove
[Valid RSS]   [Valid Atom 1.0]

Speech & Language Data Repository

http://sldr.org

TGE-Adonis CLARIN OAI
Open archives (OAI-PMH)


-   [Sign up]   /   [Login]   - 
--- --- --- --- --- --- --- --- --- --- --- --- 
/ 中文 /  English / español / français / 

Speech & Language Data Repository (SLDR)

SLDR (pronounce ‘SpLanDR’) is replacing CRDO-Aix at the term of its experimental phase. Acronyms CRDO, CRDO-Aix and CRDO-Paris are therefore out of date and should gradually be replaced in all documents. Nonetheless, we are maintaining redirections from ‘crdo.fr’ for the sake of accessibility via old identifiers.

Speech & Language Data Repository (SLDR) is offering labs and scholars a free-of-charge service for sharing their oral/linguistic data and archiving it with the help of procedures compliant with the OAIS model for long-term preservation. Its entire storage is referenced in international repositories such as OLAC (Open Language Archives Community) and Virtual Language Observatory. Items of four kinds are available on this site:

  • Primary data : sound/video/image/text corpora and any language-related signal ;
  • Resources : annotations of corpora, lexicons, reference databases, systems of representation, grammars etc. ;
  • Tools for linguistic research ;
  • Collections of items as defined above.

Read our flyer, our guidelines for the sharing and long-term preservation of oral resources, and visit our slideshow!

The latest deposits (81) >> morepage 1  >>
Primary data (corpus) sldr000783 La langue de Renivier, Vanuatu, Malekula - Renivier's language, Vanuatu, Malekula (Jocelyn AZNAR)
Département de linguistique et phonétique générales, Université d’Aix-Marseille (Aix-en-Provence FR)
This corpus contains many stories and songs in Ronivier language and translated to Bislama. It also contains many translated grammatical features of Ronivier language. In addition many pictures of trees and birds are supplied with their names translated to the language.
picto


2012-01-26
Version 1
source data
Primary data (corpus) sldr000782 DySpoLec (Muriel LALAIN)
Laboratoire parole et langage (LPL, Aix-en-Provence FR)
The DySpoLec corpus is an audio corpus featuring recordings of 19 normal readers and dyslexic children (10-11-year-old). Intended for the study of prosodic characteristics of dyslexia, this corpus proposes the audio recording of each subject during a narrative production task (spontaneous speech) and during a reading task (read speech). The total duration of the corpus is about 40 minutes; the durations of productions may vary from 24 seconds to 2.30 minutes depending on subjects.
2012-01-10
Version 1
source data
[tempARK] Primary data (corpus) sldr000781 REVEL, corpus 2011 (Mathilde SPINI)
Département de sciences du langage, Université d’Aix-Marseille (Aix-en-Provence FR)
Enquête sociolinguistique réalisée au cours de l'année 2011, autour d'une locutrice du "patois" et de son entourage, visant à :
- rassembler différents éléments de description de la langue et de ses usages dans le village aujourd'hui
- construire un corpus audio d'interactions spontanées en patois et en français, et de discours autour de la langue, de ses limites géographiques, de la culture paysanne et des diverses mémoires des locuteurs.
picto

cinq sous ils ont coûté les sabots

ero bouno la confituro

ils disent lou au lieu de dire lé

parler patois ça fait régresser
2011-12-09
Version 1
dissemination
Primary data (corpus) sldr000780 MULTIPHONIA (MULTImodal database of PHONetics teaching methods in classroom InterActions) (Charlotte ALAZARD, Corine ASTESANO, Michel BILLIÈRES)
Unité de Recherche Interdisciplinaire Octogone (EA4156, Toulouse FR)
Corpus résultant d'enregistrements longitudinaux de cours de correction phonétique en Français Langue Étrangère (FLE) entre avril et juin 2011.
Corpus constitué de 81 heures de cours avec des apprenants anglophones de niveaux débutant et intermédiaire, selon deux méthodes de correction phonétique (Méthode Verbo-Tonale et Méthode Articulatoire).
Les enregistrements ont été réalisés dans le studio vidéo de la Direction des Technologies de l'Information et de la Communication pour l'Enseignement (DTICE) à l'Université Toulouse II.
La durée de chaque cours est de 90 minutes environ.
picto
2011-12-08
Version 1
source data
[ARK] Secondary data (resource) sldr000777 Intonation of Conversational English (Educated Southern British) by Wiktor Jassem (1952) (Daniel HIRST)
Laboratoire parole et langage (LPL, Aix-en-Provence FR)
Book.
Author: Jassem, Wiktor.
Date: 1952.
Title: Intonation of Colloquial English (Educated Southern British).
Publisher: Wrocław, Nakładem Wrocławskiego Towarzystwa Naukowego; Skład Glówny: Dom Ksiązki.
Series: Prace Wrockławskiego Towarzystwa Naukowego. Travaux de la Société des Sciences et de Lettres de Wrockław Seria A.
Number: 45.
Pages: 122
picto
2011-11-24
Version 1
long-term preservation

This material is Open Data
[ARK] Primary data (corpus) sldr000776 Buckeye Corpus of Conversational Speech (Bernard BEL)
Department of Psychology, Ohio State University (Columbus US)
This corpus contains high-quality recordings from 40 speakers in Columbus OH conversing freely with an interviewer. The speech has been orthographically transcribed and phonetically labeled. The audio and text files, together with time-aligned phonetic labels, are stored in a format for use with speech analysis software (Xwaves and Wavesurfer). Software for searching the transcription files is included.
Current documentation is available from http://buckeyecorpus.osu.edu.
picto
2011-10-24
Version 2
long-term preservation
(Publications)
• National Institute on Deafness and other Communication Disorders
• Office of Research at Ohio State University
[ARK] Primary data (corpus) sldr000774 Happy Birthday corpus (Pauline LARROUY)
School of Psychology, Liège University (Liège BE)
Popular song sung by 166 french occasionnal singers. The participants sang the French version of the popular tune “Happy Birthday”, without a compulsory tonality, after production of two glissendi (a continuous glide from a low note to a high note and vice versa). The aim of these glissendi was to warm up the vocal organs, to verify the vocal capacity of the subjects and to encourage a lack of inhibition in front of the experimenter and the recording equipment.
picto



2011-09-21
Version 1
long-term preservation
[ARK] Primary data (corpus) sldr000019 Corpus Représentations linguistiques Marseille 2007 - A corpus of linguistic interactions in Marseille, 2007 (Cécile PETITJEAN)
Département de sciences du langage, Université d’Aix-Marseille (Aix-en-Provence FR)
Laboratoire parole et langage (LPL, Aix-en-Provence FR) -> source
This corpus is based on fieldwork in Marseille. It consists of 10 semi-structured interviews conducted with informants born in Marseille, between January and November 2007.
All informants are French native, born and living in Marseille. Other criteria considered in the sample were gender, age and occupational status.
The recordings took place in informants' homes or at their workplace.
The duration of each interview varied between 12 and 30 minutes.
Related corpus: see http://crdo.fr/crdo000020
picto picto2





2011-08-10
Version 4
long-term preservation
(Publications)
[ARK] Primary data (corpus) sldr000020 Corpus Représentations linguistiques Lausanne 2007 - A corpus of linguistic interactions in Lausanne, 2007 (Cécile PETITJEAN)
Centre de linguistique appliquée (CLA, Neuchâtel CH)
Département de sciences du langage, Université d’Aix-Marseille (Aix-en-Provence FR)
Laboratoire parole et langage (LPL, Aix-en-Provence FR) -> source
This corpus is based on fieldwork in Lausanne (Canton de Vaud, Switzerland) in 2007. It consists of 10 semi-structured interviews conducted with informants living in Lausanne.
All informants are native speakers of French, born in Lausanne. Other criteria considered in the sample were gender, age and occupational status.
The recordings took place in informants' homes, at their workplace or in public spaces.
The duration of each interview varied between 12 and 43 minutes.
Related corpus: see http://crdo.fr/crdo000019
picto





2011-08-10
Version 3
long-term preservation
[tempARK] Primary data (corpus) sldr000764 Valjouffrey - corpus 2010-2011 (Médéric GASQUET-CYRUS)
Laboratoire parole et langage (LPL, Aix-en-Provence FR) -> source
Département de linguistique et phonétique générales, Université d’Aix-Marseille (Aix-en-Provence FR)
1) The analytical documentation of an almost extinct language: Valjouffrey's dialect/patois;
2) The construction of a multi-speaker corpus of spontaneous speech in response to specific requirements of research on prosody, gestures, language/communication interactions, and comparison of languages;
3) A sociolinguistic, cultural and historical enquiry on the Valjouffrey valley.

>> Collection Valjouffrey sldr000007
>> Collection Code-switching sldr000762

1-TexteDuProjet.pdf

Steps on the snow and the music of chairs...

“Tous des Roumains” (J. Gaillard)

“Both of us will speak patois… Well, people will say we're crazy!” (H. Balmet & J. Gaillard)

Snowplough (R. Bois, J. Gaillard, H. Balmet)
2011-07-15
Version 4
dissemination
(Publications)
Google earth
OpenStreetMap
• Délégation générale à la langue française et aux langues de France (DGLFLF)
• Fédération de Recherche Typologie et Universaux Linguistiques (TUL)
• Institut de Linguistique Française (ILF)
[tempARK] Primary data (corpus) sldr000773 Pilote-carambouille (Laurent PREVOT)
Laboratoire parole et langage (LPL, Aix-en-Provence FR)
Petit multilogue (4 personnes) enregistré en chambre sourde avec micro-casque. La situation est une partie du jeu de négociation "Carambouille".
picto picto2
2011-07-05
Version 1
dissemination
[tempARK] Tool sldr000009 Phonedit SIGNAIX (Alain GHIO, Robert ESPESSER)
Laboratoire parole et langage (LPL, Aix-en-Provence FR)
PHONEDIT Signaix is a software for the analysis of sound, aerodynamic, articulatory and electro-physiological signals developped by the "Parole et Langage" Laboratory, Aix-en-Provence, France. It provides a complete environment for the recording, the playback, the display, the analysis, the labeling of multiparametric data.
Current version (2009) authorises the control of EVA2 workstation. You can directly record aerodynamics data of EVA2 with Phonedit. (Use the "Tools" menu to select the recording device.)
PHONEDIT Signaix plugins can be used with Linux/Cygwin environment to customize processes with bash script. It runs with recent Windows operating system on PC.
PHONEDIT Signaix is free of charge and can be downloaded.

>> Collection LPL tools sldr000763
picto
manuel-installationFRA.txt
ManuelPhonedit2011-03-21.pdf
2011-06-17
Version 1
dissemination
(Publications)

This material is Open Data
page 1  >>

The 8 most frequent downloadings under SLDR licence
Primary data (corpus) Videos of CID (Roxane BERTRAND)Downloaded 50 time(s) (?)
Secondary data (resource) Annotations of CID (Roxane BERTRAND)Downloaded 42 time(s) (?)
Secondary data (resource) VfrLPL (Stéphane RAUZY)Downloaded 27 time(s) (?)
Secondary data (resource) Grammar of French language (GP) (Marie-Laure GUéNOT)Downloaded 20 time(s) (?)
Primary data (corpus) ANGLISH (Anne TORTEL, Daniel HIRST)Downloaded 14 time(s) (?)
Primary data (corpus) Valjouffrey - corpus 2010-2011 (Médéric GASQUET-CYRUS)Downloaded 11 time(s) (?)
Primary data (corpus) EUROM1_fr (Daniel HIRST)Downloaded 11 time(s) (?)
Primary data (corpus) MAPTASK-AIX (Ellen Bard, Corine Astésano, Cheryl Frenck-Mestre, Mariapaola D'Imperio, Alice Turk, Noël Nguyen)Downloaded 8 time(s) (?)

This site has been declared to Commission Nationale de l’Informatique et des Libertés (CNIL) under agreement Nr.1222972 on 26 March 2008. As per French Law, any person cited by name is granted access to, modification, correction and suppression of data relative to him/her (art. 34 of the « Informatique et Libertés » act of 6 January 1978). To exert your right, send a message to webmaster(at)sldr.org.

This site is optimized for FireFox or any browser with the 'tabs' option set.