Aix-MARSEC is an evolutive database of spoken British English. It is composed of over 5 hours of speech data together with annotations at several linguistic levels. These annotations currently include: phonemes, syllables, syllable constituents, rhythm units, stress feet, words, intonation units, together with the output of the automatic MOMEL modelling and INTSINT symbolic coding algorithms. The annotation is available both in the form of Praat TextGrids and in tabular form as an ascii text file.
(hasPart primary data (corpus) sldr000751 Forms and prosodic functions in English)
Approximate file(s) size (Mb)
508
Approximate duration
332 minutes
Format
audio --> WAV
Labelling/tagging
The annotation component currently comprises a set of Praat TextGrids for the recordings and an ascii text file containing tabular data extracted from the database. The following tiers are included in the TextGrid files: • Phonemes - SAMPA labels for the phonemes • Syllables - defined according to the Maximal Onset Hypothesis. • Abercrombie – feet (labelled F, pauses are labelled P) begin after an intonation boundary or with a stressed syllable and continue until the next boundary or stressed syllable. • Jassem – Rhythm Units are of two kinds: Narrow rhythm units (NRU), which begin with an accented syllable and end at the following word boundary, and Anacruses (labelled ANA) containing any syllables not in a NRU. • Text - word labels including tonetic stress marks (TSM. • UI - Intonation Units as delimited by minor (|) and major (||) intonation boundaries. • Intsint - tonal labels using the INTSINT alphabet obtained automatically from the Valeurs de F0 tier. • Valeurs de F0 - "target pitch" values output by the automatic F0 modelling algorithm Momel.
Derogation to the principle of open access to public archives (see documentation)
AR042 (25 years) - Documents developed under a contract for the provision of services performed on behalf of one or more specific persons. (Code du Patrimoine, art. L. 213-2, I, 1, b) Starting on 2008-06-01
1 segment(s) in current version ark:/87895/1.4-126690
Archived with CINES on
Tue, 27 Jul 2010 19:49:14 GMT Version #1
New identifiantDocPac
Metadata : 145515, 167603, 174929, 204376
Last modification of metadata
Sun, 14 Aug 2011 11:42:07 GMT par Bernard BEL
This site has been declared to Commission Nationale de l’Informatique et des Libertés (CNIL) under agreement Nr.1222972 on 26 March 2008. As per French Law, any person cited by name is granted access to, modification, correction and suppression of data relative to him/her (art. 34 of the « Informatique et Libertés » act of 6 January 1978). To exert your right, send a message to webmaster(at)sldr.org.