NorDiaSyn - Home

NorDiaSyn (2007 - 2013) was one of several subprojects belonging to the joint Nordic project Scandinavian Dialect Syntax - ScanDiaSyn (2004 - 2009). The ScanDiaSyn network explored syntactic variation in the Scandinavian dialects, and simultaneously collected large amounts of material from speakers across the Scandinavian dialect continuum. The speech material is made available for research through the Nordic Dialect Corpus and Nordic Syntax Database.

The Norwegian Dialect Corpus and Syntax Database is the Norwegian part of the Nordic Dialect Corpus and Syntax Database. The corpus and the database are developed in collaboration with our partners in ScanDiaSyn and Nordic Center of Excellence in Microcomparative Syntax (NORMS). The Norwegian Dialect Corpus is the part of the corpus that includes recordings of Norwegian speech. The Nordic Syntax Database contains data from questionnaires that are designed to map the grammatical variation in Nordic dialects.

In the Norwegian Dialect Corpus, one can find nearly 2 million words from Norwegian dialects. Version 4.0 of the corpus contains recordings from 111 different measuring points in Norway, evenly distributed across the country. (See map of the measurement points.) Data has been collected in the period 2006-2010.
Read more about the contents of the corpus under the tab About the Data Collection.

Version 3 of the corpus includes recordings from the Dialect Archive at the Department of Linguistics and Scandinavian Studies at the University of Oslo. These transcriptions are now moved to LIA Norwegian - Corpus of Old Dialect Recordings. The transcriptions from the dialect archive are financed by Norsk ordbok 2014 (NO2014).

Through the search interface Glossa (also developed by the Text Laboratory, see the Tools tab at the top of the page) you can search the corpus through words, grammatical affixes, grammatical categories, etc. The search results come up as concordances, connected directly to audio and video.

The data collection in NorDiaSyn was led by Janne Bondi Johannessen, at the Text Laboratory, UiO, in close cooperation with Tor A. Åfarli, NTNU, and Øystein A. Vangsnes, UiT. The technical development is done by the Text Laboratory. For further information, see the Project Info tab.

The Nordic Atlas of Language Structures (NALS) Online is a collection of more than 50 articles on syntactic phenomena in the North Germanic languages. The articles are illustrated with dialect maps showing both traditional and newly discovered isoglosses. The dialect atlas is based on information found in the Nordic Dialect Corpus and the Nordic syntax database. Go to NALS Online.

Refer to the corpus as follows:
Johannessen, Janne Bondi, Joel Priestley, Kristin Hagen, Tor Anders Åfarli, and Øystein Alexander Vangsnes. 2009. "The Nordic dialect Corpus - an Advanced Research Tool". In Jokinen, Kristiina and Eckhard Bick (eds.): Proceedings of the 17th Nordic Conference of Computational Linguistics NODALIDA 2009. NEALT Proceedings Series Volume 4. (Read the paper)
Please also add the corpus handle:
Nordic Dialect Corpus: https://hdl.handle.net/11538/0000-0005-E7C7-6

Refer to the syntax database as follows:
Lindstad, Arne Martinus; Nøklestad, Anders; Johannessen, Janne Bondi; Vangsnes, Øystein Alexander. 2009. The Nordic Dialect Database: Mapping Microsyntactic Variation in the Scandinavian Languages. In Jokinen, Kristiina and Eckhard Bick (eds.): NEALT Proceedings Series;Volum 4. (Read the paper).
Please also add the database handle:
The Nordic Syntax Database: https://hdl.handle.net/11538/0000-0005-E7C8-5