CMDI 1.1. Metadata
Header
MdCreator: Tekstlaboratoriet
MdCreationDate: 2021-06-29
MdSelfLink:
MdProfile: clarin.eu:cr1:p_1407745711925
MdCollectionDisplayName: Clarino - Textlab
Resources
ResourceProxyList:
ResourceProxy [id=‘lia-norsk-lp’]:
ResourceType [mimetype=‘’]: LandingPage
ResourceRef: http://tekstlab.uio.no/LIA/norsk/index.html
ResourceProxy [id=‘lia-treebank’]:
ResourceType [mimetype=‘’]: Resource
ResourceRef: http://tekstlab.uio.no/LIA/norsk/index.html
ResourceProxy [id=‘lia-corpus’]:
ResourceType [mimetype=‘’]: Resource
ResourceRef: https://tekstlab.uio.no/glossa2/lia_norsk
JournalFileProxyList:
ResourceRelationList:
ResourceRelation:
RelationType: treebank
Res1 [ref=‘lia-corpus’]:
Res2 [ref=‘lia-treebank’]:
IsPartOfList:
Components
corpusProfile:
resourceCommonInfo [ComponentId=‘clarin.eu:cr1:c_1396012485126’] [ref=‘lia-treebank’]:
resourceType [ref=‘lia-treebank’]: corpus
identificationInfo [ComponentId=‘clarin.eu:cr1:c_1396012485125’]:
resourceName [ref=‘lia-treebank’] [xml:lang=‘nb’]: LIA-trebanken
resourceName [ref=‘lia-treebank’] [xml:lang=‘en’]: The LIA Treebank
description [ref=‘lia-treebank’] [xml:lang=‘en’]: The LIA Treebank includes 5250 speech segments and 55 410 tokens from the speech corpus LIA Norwegian. The treebank is annotated with morphological and dependency-style syntactic analysis and manually corrected. The treebank is available in both conllx-format and conllu-format.

LIA Norwegian is a speech corpus with old recordings (1939 - 1996) from four Norwegian universities: NTNU, UoB, UoO and UoT.
description [ref=‘lia-treebank’] [xml:lang=‘nb’]: LIA-trebanken består av 5250 talemålssegment og 55 410 ord/token frå talespråkskorpuset LIA norsk. Trebanken er annotert morfologisk og syntaktisk og manuelt korrigert. Trebanken er tilgjengelig både i conllx-format og conllu-format.

LIA norsk er et talespråkskorpus med gamle opptak (1939 - 1996) fra fire norske universitet: NTNU, UiB, UiO og UiT.
resourceShortName [ref=‘lia-treebank’] [xml:lang=‘en’]: The LIA Treebank
resourceShortName [ref=‘lia-treebank’] [xml:lang=‘nb’]: LIA-trebanken
url: http://tekstlab.uio.no/LIA/norsk/index.html
url: http://tekstlab.uio.no/LIA/norsk/index_english.html
PID: http://hdl.handle.net/11538/0000-000C-368B-B
distributionInfo [ComponentId=‘clarin.eu:cr1:c_1396012485124’]:
licenceInfo [ComponentId=‘clarin.eu:cr1:c_1396012485158’]:
userCategory: Public
distributionAccessMedium: downloadable
downloadLocation: http://tekstlab.uio.no/LIA/trebank.html
downloadLocation: http://tekstlab.uio.no/LIA/norsk/index_english.html
licence [ComponentId=‘clarin.eu:cr1:c_1447674760330’] [ref=‘lia-treebank’]:
licenceFamily [ref=‘lia-treebank’]: Creative Commons (CC)
licenceName: Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
licenceURL: http://creativecommons.org/licenses/by-nc-sa/4.0/
conditionsOfUse: BY
conditionsOfUse: NC
conditionsOfUse: SA
licensor:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘no’]: UiO
organizationShortName [xml:lang=‘en’]: UoO
departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
distributionRightsHolder:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘no’]: UiO
organizationShortName [xml:lang=‘en’]: UoO
departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
contact:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
metadataInfo [ComponentId=‘clarin.eu:cr1:c_1407745711922’]:
metadataCreationDate: 2018-09-26
metadataLastDateUpdated: 2021-06-29
metadataCreator:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: person
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Hagen
givenName: Kristin
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: kristin.hagen@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
versionInfo [ComponentId=‘clarin.eu:cr1:c_1430905751648’]:
version: First version (June 2021)
validationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711923’]:
validated: true
validationType: content
validationMode: manual
validationModeDetails: The treebank is manually corrected by at least one person
validationExtent: partial
validator:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The LIA project
organizationShortName: LIA
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
resourceDocumentationInfo [ComponentId=‘clarin.eu:cr1:c_1355150532301’]:
documentationUnstructured [ComponentId=‘clarin.eu:cr1:c_1355150532302’]:
role: documentation
documentUnstructured: http://tekstlab.uio.no/LIA/trebank.html
(In Norwegian)
documentationStructured [ComponentId=‘clarin.eu:cr1:c_1361876010648’]:
role: documentation
documentInfo [ComponentId=‘clarin.eu:cr1:c_1353678848788’]:
documentType: other
title: The LIA Treebank of Spoken Norwegian Dialects
author: Lilja Øvrelid, Andre Kåsen, Kristin Hagen, Anders Nøklestad, Per Erik Solberg and Janne Bondi Johannessen
editor: Nicoletta Calzolari et al
year: 2018
bookTitle: Proceedings of the Eleventh International Conference on Language Resources and Evaluation
url: http://www.lrec-conf.org/proceedings/lrec2018/summaries/642.html
resourceCreationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711921’]:
creationStartDate: 2014-04-01
creationEndDate: 2019-12-31
resourceCreator:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The LIA project
(Project participants and employees in the LIA project)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://tekstlab.uio.no/LIA/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
fundingProject:
projectInfo [ComponentId=‘clarin.eu:cr1:c_1430905751647’]:
projectName [xml:lang=‘nb’]: LIA (Language Infrastructure made Accessible)
projectShortName: LIA
projectID: 22 59 41
url: http://tekstlab.uio.no/LIA/
url: https://www.hf.uio.no/iln/english/research/projects/language-infrastructure-made-accessible/index.html
fundingType: nationalFunds
funder: The Research Council of Norway
fundingCountry: Norway
projectStartDate: 2014-01-04
projectEndDate: 2019-12-31
corpusInfo [ComponentId=‘clarin.eu:cr1:c_1407745711878’] [ref=‘lia-treebank’]:
corpusType [ref=‘lia-treebank’]: Treebank
corpusPartInfo [ComponentId=‘clarin.eu:cr1:c_1407745711885’] [ref=‘lia-treebank’]:
mediaType: text
corpusTextInfo [ComponentId=‘clarin.eu:cr1:c_1396012485188’]:
textFormatInfo [ComponentId=‘clarin.eu:cr1:c_1427452477072’] [ref=‘lia-treebank’]:
mimeType: Downloadable in two formats: conllx-format and conllu-format
sizePerTextFormat [ComponentId=‘clarin.eu:cr1:c_1447674760342’]:
sizeInfo [ComponentId=‘clarin.eu:cr1:c_1353678848785’]:
size: 55 410
sizeUnit: tokens
sizeInfo [ComponentId=‘clarin.eu:cr1:c_1353678848785’]:
size: 5250 spoken segments
sizeUnit: utterances
characterEncodingInfo [ComponentId=‘clarin.eu:cr1:c_1447674760355’]:
characterEncoding: utf-8
corpusPartGeneralInfo [ComponentId=‘clarin.eu:cr1:c_1407745711882’] [ref=‘lia-treebank’]:
personSourceSetInfo [ComponentId=‘clarin.eu:cr1:c_1360931019775’]:
ageOfPersons: teenager
ageOfPersons: adult
ageOfPersons: elderly
sexOfPersons: mixed
originOfPersons: native
dialectAccentOfPersons: Dialects from 17 places in Norway
geographicDistributionOfPersons: All over Norway
lingualityInfo [ComponentId=‘clarin.eu:cr1:c_1355150532313’]:
lingualityType: monolingual
languageInfo [ComponentId=‘clarin.eu:cr1:c_1428388179423’]:
languageId: No
languageName: Norwegian
languageInfo [ComponentId=‘clarin.eu:cr1:c_1428388179423’]:
languageId: Nn
languageName: Norwegian Nynorsk
modalityInfo [ComponentId=‘clarin.eu:cr1:c_1447674760356’]:
modalityType: spokenLanguage
modalityTypeDetails: Norwegian dialects. Orthographic transcription
sizeInfo [ComponentId=‘clarin.eu:cr1:c_1353678848785’]:
size: 55 410
sizeUnit: tokens
sizeInfo [ComponentId=‘clarin.eu:cr1:c_1353678848785’]:
size: 5250 spoken segments
sizeUnit: utterances
annotationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711924’]:
annotationType: speechAnnotation-orthographicTranscription
annotationType: morphosyntacticAnnotation-posTagging
annotationType: syntacticAnnotation-treebanks
annotationDescription: Original version in conllx-format,annotated with morphological and dependency-style syntactic analysis. The treebank has also been automatically converted to the UD scheme and is available in conllu-format.
annotationManualStructured [ComponentId=‘clarin.eu:cr1:c_1361876010647’]:
role: annotationManual
documentInfo [ComponentId=‘clarin.eu:cr1:c_1353678848788’]:
documentType: manual
title [xml:lang=‘en’]: NDT Guidelines for Morphological and Syntactic Annotation
author: Kari Kinn, Per Erik Solberg og Pål Kristian Eriksen.
Translated from Norwegian to English by Per Erik Solberg
year: 2013
url: https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-10/
annotationManualStructured [ComponentId=‘clarin.eu:cr1:c_1361876010647’]:
role: annotationManual
documentInfo [ComponentId=‘clarin.eu:cr1:c_1353678848788’]:
documentType: manual
title [xml:lang=‘bm’]: Retningslinjer for syntaktisk annotasjon i LIA
author: Andre Kåsen, Kristin Hagen, Lilja Øvrelid, Signe Laake, Håvard Østli
year: 2019
url: http://tekstlab.uio.no/LIA/pdf/parseretningslinjer-lia12042019.pdf
annotationTool [ComponentId=‘clarin.eu:cr1:c_1355150532326’]:
targetResourceNameURI: https://ufal.mff.cuni.cz/tred/
annotationTool [ComponentId=‘clarin.eu:cr1:c_1355150532326’]:
targetResourceNameURI: Read about the annotation process in Norwegian: http://tekstlab.uio.no/LIA/verktoy.html
classificationInfo [ComponentId=‘clarin.eu:cr1:c_1403588862809’]:
genreInfo [ComponentId=‘clarin.eu:cr1:c_1407745711877’]:
genreType: speechGenre
genre: informal
unstandardisedGenre: conversations and informal interviews
classificationInfo [ComponentId=‘clarin.eu:cr1:c_1403588862809’]:
genreInfo [ComponentId=‘clarin.eu:cr1:c_1407745711877’]:
genreType: speechGenre
genre: semi formal
unstandardisedGenre: interviews
timeCoverageInfo [ComponentId=‘clarin.eu:cr1:c_1447674760358’]:
timeCoverage: 1939 - 1995
geographicCoverageInfo [ComponentId=‘clarin.eu:cr1:c_1447674760357’]:
geographicCoverage: 17 places from all over Norway