|
||
Transcription of American Norwegian The recordings are transcribed like the Norwegian recordings in the Nordic Dialect Corpus. For each recording, there is one phonetic transcription and one orthographic transcription.
Phonetic transcription The phonetic transcription method is based upon Papazian and Helleland's Norsk talemål. Lokal og sosial variasjon (2005), but we use no special characters, only the Norwegian alphabet. Also, the transcription is quite broad. There are two reasons for these choices. First, we want the transcriptions to be easy to read for most people. Second, there are many transcribers involved in the work, and transcriptions that are too detailed would lead to an extension of training time and greater risk for individual differences between the transcriptions.
Orthographic transcription An orthographic transcription is important because it is a generalization above all the variation. This way one can do general searches, and use automated methods, such as tagging. Doing the orthographic transcription is much faster than doing the phonetic one, because we use a semi-automatic dialect transliterator which translates from the phonetic transcription to bokmål orthography - and for American Norwegian - English where it is neeed. This transliterator was developed by the Text Laboratory specifically for the NorDiaSyn Project. We translate word for word so that we retain the syntax of the phonetic transcription in addition to words that are not standardized in Norwegian or English.
Example of the two transcriptions: Orthographic: 'det er hard to finne' Phonetic: d e haRd tu finn Orthographic: 'vi solgte noe av det og renta ut resten' Phonetic: vi sellt nå tå ri å rennta ut resst'n
Transcription Guidelines (for Nordic Dialect Corpus, in Norwegian) Transcription Guidelines for Norwegian in America (in Norwegian) Translation to orthographic transcription - Guidelines (for Nordic Dialect Corpus, in Norwegian)
|
||