NoTa-Oslo: Norwegian Speech Corpus - the Oslo part


NoTa-Oslo is a speech corpus with interviews and conversations from 166 informants born and raised in Oslo and the Oslo area. The informants are carefully selected w.r.t. sociolinguistic variables and therefore representative in terms of age, gender, place of residence and education. NoTa-Oslo consists of approx. 900 000 words that are orthographically transcribed and morphologically tagged. The corpus is searchable in a specially designed search interface, and the transcriptions are linked to audio and video files.

The NoTa-Oslo corpus was built during the period 2004 - 2006. The corpus is available for research. Please contact the Text Laboratory if you need more information or if you want to use the corpus.

See a NoTa-Oslo MINI-DEMO (The user name and password are both "demo".)

 

 

 

 

 

 

 

 


 

 

 

 

 

 

 

 

 

 

 

Norsk
Go to Norwegian Speech Corpora
Search in NoTa-Oslo
Frequency lists
ILN