CMDI 1.1. Metadata
Header
MdCreator: Kristin Hagen
MdCreationDate: 2021-04-29
MdSelfLink:
MdProfile: clarin.eu:cr1:p_1407745711925
MdCollectionDisplayName: Clarino - Textlab
Resources
ResourceProxyList:
ResourceProxy [id=‘taus-lp’]:
ResourceType [mimetype=‘’]: LandingPage
ResourceRef: http://www.tekstlab.uio.no/nota/taus/index.html
ResourceProxy [id=‘taus-transcriptions’]:
ResourceType: Resource
ResourceRef: http://www.tekstlab.uio.no/nota/taus/index.html
ResourceProxy [id=‘taus-corpus’]:
ResourceType: Resource
ResourceRef: https://tekstlab.uio.no/glossa2/taus3
JournalFileProxyList:
ResourceRelationList:
ResourceRelation:
RelationType: transcriptions
Res1 [ref=‘taus-corpus’]:
Res2 [ref=‘taus-transcriptions’]:
IsPartOfList:
Components
corpusProfile:
resourceCommonInfo [ComponentId=‘clarin.eu:cr1:c_1396012485126’]:
resourceType: corpus
identificationInfo [ComponentId=‘clarin.eu:cr1:c_1396012485125’] [ref=‘taus-transcriptions’]:
resourceName [xml:lang=‘nb’]: TAUS - nedlastbare transkripsjoner
resourceName [xml:lang=‘en’]: TAUS - downloadable transcriptions
description [xml:lang=‘en’]: TAUS (The spoken language investigation in Oslo) v.3 is a speech corpus with 86 speakers and 387 551 tokens. The downloadable version of the corpus contains the transcriptions, approx. 387 500 tokens, all of them orthographically transcribed. Some of the interviews are also transcribed phonetically.

The material from TAUS is based on informal interviews with people from Oslo. The interviews were made in 1971-73. The informants are mainly from two eastern districts (Vålerenga and Kampen) and a western (Frogner), and have a social background that can be considered representative with respect to education, occupation and place of adolescence. The informants fall into three groups based on age: youth (15 - 17 years), young adults (20 - 30) and adults (34 - 75).

The topics for the interviews are experiences and descriptions from childhood and adolescence. The interviews were conducted at home with an unceremoniously and informal tone, so that the linguistic style can be described as informal vernacular.

In 2006 - 2007 the TAUS-tapes from the A and B series were digitized, and all the interviews were transcribed orthographically and linked to the digital audio files. The transcriptions are now searchable via the search interface tool Glossa.

In 2014 - 2019 the tapes from the B-series were digitized and transcribed during the LIA-project.

In January 2020 TAUS v.3 was published with all available material from the A, B og C series.
description [xml:lang=‘nb’]: TAUS (Talemålsundersøkelsen i Oslo) v.3 er et talespråkskorpus med 86 talere og 387 551 tokens.

Denne nedlastbare versjoner inneholder transkripsjonene, cirka 44 300 tokens. Alle transkripsjonene er ortografisk transkribert, mange har også en talemålsnær transkripsjon.

Materialet fra (TAUS) er basert på uformelle intervjuer med folk fra Oslo, som ble gjort i 1971-73. Informantene er hovedsakelig fra to østlige bydeler (Vålerenga og Kampen) og en vestlig (Frogner), og har en sosial bakgrunn som kan anses representative med hensyn til utdanning og yrke, og oppvekstmiljø. Personene faller i tre grupper ut fra alder: ungdom (15 - 17 år), unge voksne (20 - 30) og voksne (34 - 75).

Temaene for intervjuene er opplevelser og beskrivelser fra barndom og oppvekst, og det er flere innslag av muntlige fortellinger. Samtalene har foregått hjemme hos de enkelte og i en uhøytidelig og uformell tone, slik at den språklige stilen kan betegnes som uformell dagligtale.

I 2006 - 2007 er A- og C-serien av TAUS-lydbåndene digitalisert, og alle intervjuene er transkribert ortografisk. Transkripsjonene er dessuten koplet sammen med de digitaliserte lydfilene. Hele materialet er søkbart via søkeverktøyet Glossa. Det er mulig å søke både i de originale, fonetiske TAUS-transkripsjonene og i de ortografiske. Vær oppmerksom på at noen av de originale TAUS-lydbåndene har gått tapt. Disse intervjuene mangler derfor i dette søkbare materialet. Les mer om dette under fanen Informanter.

I 2014 - 2019 er B-serien digitalisert og transkribert gjennom LIA-prosjektet.

I januar 2020 ble TAUS v.3 publisert med alt tilgjengelig materiale fra A-, B- og C-serien.
resourceShortName: TAUS
url: http://www.tekstlab.uio.no/nota/taus/index.html
PID: http://hdl.handle.net/11538/0000-0005-E7C2-B
distributionInfo [ComponentId=‘clarin.eu:cr1:c_1396012485124’] [ref=‘taus-transcriptions’]:
licenceInfo [ComponentId=‘clarin.eu:cr1:c_1396012485158’] [ref=‘taus-transcriptions’]:
userCategory: Public
distributionAccessMedium: downloadable
downloadLocation: http://www.tekstlab.uio.no/nota/taus/index.html
executionLocation: http://www.tekstlab.uio.no/nota/taus/index.html
executionLocation: http://www.tekstlab.uio.no/nota/taus/english.html
licence [ComponentId=‘clarin.eu:cr1:c_1447674760330’] [ref=‘taus-transcriptions’]:
licenceFamily: Creative Commons (CC)
licenceName: Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
licenceURL: http://creativecommons.org/licenses/by-nc-sa/4.0/
conditionsOfUse: BY
conditionsOfUse: NC
conditionsOfUse: SA
nonStandardConditionsOfUse: The corpus has audio and video recordings classified as personal data. In agreement with NSD, the Data Protection Official in Norway, the video and audio files are accessible only through Glossa, a search and post-processing tool developed by the Text Laboratory.

Please note that every individual researcher is responsible for treating the participants in the corpus with respect and sincerity. Furthermore, the participants must be kept anonymous in every published paper or other output.
licensor:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘no’]: UiO
organizationShortName [xml:lang=‘en’]: UoO
departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
distributionRightsHolder:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘no’]: UiO
organizationShortName [xml:lang=‘en’]: UoO
departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/english/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
contact:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
metadataInfo [ComponentId=‘clarin.eu:cr1:c_1407745711922’]:
metadataCreationDate: 2015-07-31
metadataLastDateUpdated: 2021-05-04
metadataCreator:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: person
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Hagen
givenName: Kristin
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: kristin.hagen@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
versionInfo [ComponentId=‘clarin.eu:cr1:c_1430905751648’]:
version: Transcriptions from the third version of TAUS
validationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711923’]:
validated: true
validationType: content
validationMode: manual
validationModeDetails: The transcriptions are proof read against the audio files.
validationExtent: full
validator:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
resourceDocumentationInfo [ComponentId=‘clarin.eu:cr1:c_1355150532301’]:
documentationUnstructured [ComponentId=‘clarin.eu:cr1:c_1355150532302’]:
role: documentation
documentUnstructured: http://www.tekstlab.uio.no/nota/taus/index.html
documentationStructured [ComponentId=‘clarin.eu:cr1:c_1361876010648’]:
role: documentation
documentInfo [ComponentId=‘clarin.eu:cr1:c_1353678848788’]:
documentType: book
title: Oslomål. TAUS skrift nr. 6. (Hovedrapport.)
author: E. Hanssen, Th. Hoel, E. H. Jahr, O. Rekdal, G. Wiggen.
year: 1978
documentationStructured [ComponentId=‘clarin.eu:cr1:c_1361876010648’]:
role: documentation
documentInfo [ComponentId=‘clarin.eu:cr1:c_1353678848788’]:
documentType: mastersThesis
title: Sosio-syntaktisk undersøking av talemålet til utvalgte grupper Oslo-ungdom.
author: Wiggen, Geirr
year: 1974
resourceCreationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711921’]:
creationStartDate: 1970-01-01
creationEndDate: 2020-01-15
resourceCreator:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
role: Står som førsteforfatter av prosjektrapporten. TAUS var ellers et gruppearbeid.
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Hanssen
givenName: Eskil
sex: male
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: Prosjektet Talemålsundersøkelsen i Oslo (1971-1976)
departmentName: Tidligere Institutt for Nordisk språk og litteratur ved UiO.
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: eskil.hanssen@iln.uio.no
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
fundingProject:
projectInfo [ComponentId=‘clarin.eu:cr1:c_1430905751647’]:
projectName: Talemålsundersøkelsen i Oslo
projectShortName: TAUS
fundingType: nationalFunds
funder: NAVF, Norges almennvitenskaplige forskningsråd
fundingCountry: Norge
projectStartDate: 1971-01-01
projectEndDate: 1976-12-31
fundingProject:
projectInfo [ComponentId=‘clarin.eu:cr1:c_1430905751647’]:
projectName: Digitalisering og retranskribering av TAUS
fundingType: nationalFunds
funder: Utstyrsmidler fra Humanistisk fakultet, Universitetet i Oslo
funder: Professor Didrik Arup Seips fond
fundingCountry: Norge
projectStartDate: 2006-01-01
projectEndDate: 2007-12-31
fundingProject:
projectInfo [ComponentId=‘clarin.eu:cr1:c_1430905751647’]:
projectName: LIA (Language Infrastructure made Accessible)
projectShortName: LIA
projectID: 22 59 41
url: http://tekstlab.uio.no/LIA/
url: https://www.hf.uio.no/iln/english/research/projects/language-infrastructure-made-accessible/index.html
fundingType: nationalFunds
funder: The Research Council of Norway
fundingCountry: Norway
projectStartDate: 2014-01-04
projectEndDate: 2019-12-31
corpusInfo [ComponentId=‘clarin.eu:cr1:c_1407745711878’] [ref=‘taus-transcriptions’]:
corpusType [ref=‘taus-transcriptions’]: Written Corpus
corpusPartInfo [ComponentId=‘clarin.eu:cr1:c_1407745711885’] [ref=‘taus-transcriptions’]:
mediaType: text
corpusTextInfo [ComponentId=‘clarin.eu:cr1:c_1396012485188’]:
textFormatInfo [ComponentId=‘clarin.eu:cr1:c_1427452477072’]:
mimeType: Downloadable transcriptions in txt format
sizePerTextFormat [ComponentId=‘clarin.eu:cr1:c_1447674760342’]:
sizeInfo [ComponentId=‘clarin.eu:cr1:c_1353678848785’]:
size: 387 551
sizeUnit: tokens
characterEncodingInfo [ComponentId=‘clarin.eu:cr1:c_1447674760355’]:
characterEncoding: Unicode
corpusPartGeneralInfo [ComponentId=‘clarin.eu:cr1:c_1407745711882’] [ref=‘taus-transcriptions’]:
personSourceSetInfo [ComponentId=‘clarin.eu:cr1:c_1360931019775’]:
numberOfPersons: 86
ageOfPersons: teenager
ageOfPersons: adult
ageOfPersons: elderly
ageRangeStart: 15
ageRangeEnd: 75
sexOfPersons: mixed
originOfPersons: native
dialectAccentOfPersons: Oslo dialect: from Kampen, Vålerenga (Oslo east) and Frogner (Oslo west)
lingualityInfo [ComponentId=‘clarin.eu:cr1:c_1355150532313’]:
lingualityType: monolingual
languageInfo [ComponentId=‘clarin.eu:cr1:c_1428388179423’]:
languageId: No
languageName: Norwegian
languageInfo [ComponentId=‘clarin.eu:cr1:c_1428388179423’]:
languageId: Nb
languageName: Norwegian Bokmål
modalityInfo [ComponentId=‘clarin.eu:cr1:c_1447674760356’]:
modalityType: spokenLanguage
modalityTypeDetails: Orthographic transcription. Some of the interviews in the A series also have the original phonetic TAUS transcription linked to the orthographic transcription. The B series transcriptions have phonetic transcriptions following the LIA guidelines together with orthographic transcriptions.
sizeInfo [ComponentId=‘clarin.eu:cr1:c_1353678848785’]:
size: 387 551
sizeUnit: tokens
annotationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711924’]:
annotationType: speechAnnotation-orthographicTranscription
annotationType: speechAnnotation-phoneticTranscription
annotationManualUnstructured [ComponentId=‘clarin.eu:cr1:c_1355150532325’]:
role: annotationManual
documentUnstructured: Orthographic transcription,cf Bokmålsordboka (Wangensteen 2004)
annotationManualStructured [ComponentId=‘clarin.eu:cr1:c_1361876010647’]:
role: annotationManual
documentInfo [ComponentId=‘clarin.eu:cr1:c_1353678848788’]:
documentType: manual
title [xml:lang=‘nb’]: Transkripsjonsveiledning for NoTa-Oslo
author: Kristin Hagen
year: 2008
url: http://www.tekstlab.uio.no/nota/oslo/transkripsjon/NoTa-transkripsjonsveil22.pdf
annotationManualStructured [ComponentId=‘clarin.eu:cr1:c_1361876010647’]:
role: annotationManual
documentInfo [ComponentId=‘clarin.eu:cr1:c_1353678848788’]:
documentType: manual
title [xml:lang=‘nb’]: Transkripsjonsrettleiing for LIA
author: Kristin Hagen and Live Håberg and Eirik Olsen and Åshild Søfteland
year: 2018
url: http://tekstlab.uio.no/LIA/pdf/transkripsjonsrettleiing_lia.pdf
annotationTool [ComponentId=‘clarin.eu:cr1:c_1355150532326’]:
targetResourceNameURI: Transcriber (http://trans.sourceforge.net/en/presentation.php
)
annotationTool [ComponentId=‘clarin.eu:cr1:c_1355150532326’]:
targetResourceNameURI: ELAN: https://tla.mpi.nl/tools/tla-tools/elan/
(for the B series)
annotationTool [ComponentId=‘clarin.eu:cr1:c_1355150532326’]:
targetResourceNameURI: https://www.hf.uio.no/iln/english/about/organization/text-laboratory/services/oslo-transliterator/index.html
classificationInfo [ComponentId=‘clarin.eu:cr1:c_1403588862809’]:
genreInfo [ComponentId=‘clarin.eu:cr1:c_1407745711877’]:
genreType: speechGenre
genre: semi formal
unstandardisedGenre: interviews
genreInfo [ComponentId=‘clarin.eu:cr1:c_1407745711877’]:
genreType: speechGenre
genre: informal
unstandardisedGenre: B series: Conversations between interviewer and informants. Some of them are friends, some of them are pretending to be friends as a part of the task.
timeCoverageInfo [ComponentId=‘clarin.eu:cr1:c_1447674760358’]:
timeCoverage: 1971 - 1976
timeCoverageInfo [ComponentId=‘clarin.eu:cr1:c_1447674760358’]:
timeCoverage: In 2006 - 2007 the TAUS-tapes were digitized, and all the interviews were transcribed orthographically and linked to the digital audio files.
timeCoverageInfo [ComponentId=‘clarin.eu:cr1:c_1447674760358’]:
timeCoverage: In 2014 - 2019 the tapes from the B series were digitalized and transcribed. In 2020 the new TAUS v.3 corpus was published
geographicCoverageInfo [ComponentId=‘clarin.eu:cr1:c_1447674760357’]:
geographicCoverage: Oslo (Vålerenga, Kampen and Oslo. In the B series there are also some other locations in Oslo)
recordingInfo [ComponentId=‘clarin.eu:cr1:c_1426673949970’]:
recordingDeviceType: other
recordingEnvironment: other
recorderActor:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Hanssen
givenName: Eskil
sex: male
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: Prosjektet Talemålsundersøkelsen i Oslo (1971-1976)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: eskil.hanssen@iln.uio.no