CMDI 1.1 Metadata
Header
MdCreator: Kristin Hagen
MdCreationDate: 2024-01-08
MdProfile: clarin.eu:cr1:p_1422885449331
MdCollectionDisplayName: Clarino - Textlab
Resources
ResourceProxyList:
ResourceProxy [id=‘ndc-parser-lp’]:
ResourceType [mimetype=‘’]: LandingPage
ResourceRef: https://tekstlab.uio.no/nota/scandiasyn/treebank.html
ResourceProxy [id=‘ndc-parser’]:
ResourceType [mimetype=‘’]: Resource
ResourceRef: https://github.com/textlab/spoken_norwegian_resources/tree/master/parsers/clarino/ndc
ResourceProxy [id=‘ndc-treebank’]:
ResourceType [mimetype=‘’]: Resource
ResourceRef: https://github.com/textlab/spoken_norwegian_resources/tree/master/treebanks/Norwegian-BokmaalNDC
JournalFileProxyList:
ResourceRelationList:
ResourceRelation:
RelationType: partOF
Res1 [ref=‘ndc-parser-lp’]:
Res2 [ref=‘ndc-parser’]:
ResourceRelation:
RelationType: trainedOn
Res1 [ref=‘ndc-parser’]:
Res2 [ref=‘ndc-treebank’]:
IsPartOfList:
Components
toolProfile:
resourceCommonInfo [ComponentId=‘clarin.eu:cr1:c_1396012485126’] [ref=‘ndc-parser-lp’]:
resourceType: toolService
identificationInfo [ComponentId=‘clarin.eu:cr1:c_1396012485125’] [ref=‘ndc-parser-lp’]:
resourceName [cmd=‘lia-parser-lp’] [xml:lang=‘en’]: The NDC parser
resourceName [xml:lang=‘no’]: NDC-parseren
description [xml:lang=‘en’]: The NDC parser is a dependency parser for spoken Norwegian dialects trancribed to Bokmål. The parser is trained on the NDC Treebank.
The NDC parser is a so-called transition-based dependency parser, UUParser, developed at Uppsala University.
description [xml:lang=‘no’]: NDC-parseren er en dependensparser for transkripsjoner av norske dialekter på bokmål. Parseren er trent på NDC-trebanken. NDC-parseren er en såkalt transition-based dependensparser, UUparser, utviklet ved Uppsala Universitet.
resourceShortName: NDC parser
url: https://tekstlab.uio.no/nota/scandiasyn/treebank.html
distributionInfo [ComponentId=‘clarin.eu:cr1:c_1396012485124’] [ref=‘ndc-parser-lp’]:
licenceInfo [ComponentId=‘clarin.eu:cr1:c_1396012485158’] [ref=‘ndc-parser-lp’]:
userCategory: Public
distributionAccessMedium: downloadable
downloadLocation: https://github.com/textlab/spoken_norwegian_resources/tree/master/parsers/clarino/ndc
licence [ComponentId=‘clarin.eu:cr1:c_1447674760330’]:
licenceFamily: Creative Commons (CC)
licenceName: Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
licenceURL: https://creativecommons.org/licenses/by-nc-sa/4.0/
conditionsOfUse: BY
conditionsOfUse: NC
conditionsOfUse: SA
licensor:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘no’]: UiO
organizationShortName [xml:lang=‘en’]: UoO
departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
distributionRightsHolder:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘no’]: UiO
organizationShortName [xml:lang=‘en’]: UoO
departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/english/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
iprHolder:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’] [ref=‘ndc-parser-lp’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
contact [ref=‘ndc-parser-lp’]:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’] [ref=‘ndc-parser-lp’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
metadataInfo [ComponentId=‘clarin.eu:cr1:c_1407745711922’] [ref=‘ndc-parser-lp’]:
metadataCreationDate: 2024-04-08
metadataLastDateUpdated: 2024-01-11
metadataCreator [ref=‘ndc-parser-lp’]:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: person
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Hagen
givenName: Kristin
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: kristin.hagen@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
validationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711923’] [ref=‘ndc-parser-lp’]:
validated: true
validationModeDetails: In order to quantify the parsability i.e. the quality that can be induced by a parser based on the annotations of the treebank; we partitioned the treebank in n folds and performed a n-fold cross validation with n=5 (given the size of the treebank):

UAS (unlabelled attachment score): 84.11
LAS (labelled attachment score): 78.43
resourceDocumentationInfo [ComponentId=‘clarin.eu:cr1:c_1355150532301’]:
resourceCreationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711921’] [ref=‘ndc-parser-lp’]:
creationStartDate: 2019
creationEndDate: 2024
resourceCreator [ref=‘ndc-parser’]:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Kåsen
givenName: Andre
affiliation:
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: Nasjonalbiblioteket
organizationShortName: NB
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: andre.kaasen@gmail.com
url: https://www.nb.no/sprakbanken/
fundingProject:
projectInfo [ComponentId=‘clarin.eu:cr1:c_1430905751647’]:
projectName: Common Language Resources and Technology Infrastructure Norway +
projectShortName: CLARINO +
projectID: 295700
url: http://clarin.b.uib.no/
fundingType: nationalFunds
funder: the Research Council of Norway
fundingCountry: Norway
projectStartDate: 2020-03-01
projectEndDate: 2023-12-31
toolInfo [ComponentId=‘clarin.eu:cr1:c_1422885449327’]:
description: The NDC parser is a dependency parser trained on the NDC Treebank. The parser is a so-called transition-based dependency parser, UUParser (https://github.com/UppsalaNLP/uuparser), developed at Uppsala University.
inputInfo [ComponentId=‘clarin.eu:cr1:c_1360931019804’]:
mediaType: text
resourceType: corpus
modalityType: spokenLanguage
languageName: Norwegian
languageName: Norwegian Bokmål
languageId: No
languageId: Nb
mimeType: txt, xml
characterEncoding: utf-8
annotationType: syntacticAnnotation-treebanks
tagset: http://www.tekstlab.uio.no/obt-ny/english/tagset.html
segmentationLevel: word
segmentationLevel: utterance
outputInfo [ComponentId=‘clarin.eu:cr1:c_1360931019824’]:
mediaType: text
resourceType: corpus
modalityType: spokenLanguage
languageName: Norwegian
languageName: Norwegian Bokmål
languageId: No
languageId: Nb
mimeType: txt, xml
characterEncoding: utf-8
tagset: http://www.tekstlab.uio.no/obt-ny/english/tagset.html
segmentationLevel: utterance
segmentationLevel: word