CMDI 1.1. Metadata
Header
MdCreator: Kristin Hagen
MdCreationDate: 2021-03-09
MdSelfLink:
MdProfile: clarin.eu:cr1:p_1407745711925
MdCollectionDisplayName: Clarino - Textlab
Resources
ResourceProxyList:
ResourceProxy [id=‘bigbrother-lp’]:
ResourceType [mimetype=‘’]: LandingPage
ResourceRef: http://www.tekstlab.uio.no/nota/bigbrother/index.html
ResourceProxy [id=‘bb-transcriptions’]:
ResourceType [mimetype=‘’]: Resource
ResourceRef: http://www.tekstlab.uio.no/nota/bigbrother/index.html
ResourceProxy [id=‘bb-corpus’]:
ResourceType [mimetype=‘’]: Resource
ResourceRef: https://tekstlab.uio.no/glossa2/bb
JournalFileProxyList:
ResourceRelationList:
ResourceRelation:
RelationType: transcriptions
Res1 [ref=‘bb-corpus’]:
Res2 [ref=‘bb-transcripts’]:
IsPartOfList:
Components
corpusProfile:
resourceCommonInfo [ComponentId=‘clarin.eu:cr1:c_1396012485126’]:
resourceType: corpus
identificationInfo [ComponentId=‘clarin.eu:cr1:c_1396012485125’]:
resourceName [ref=‘bb-transcriptions’] [xml:lang=‘en’]: The BigBrother Corpus - downloadable transcriptions
resourceName [ref=‘bb-transcriptions’] [xml:lang=‘nb’]: BigBrother-korpuset - nedlastbare transkripsjoner
description [ref=‘bb-transcriptions’] [xml:lang=‘nb’]: BigBrother-korpuset er et talespråkskorpus som består av den første sesongen av realityserien BigBrother som ble sendt på TVNorge våren 2001. Deltakerne i BigBrother er i alderen 23-36 år og snakker ulike dialekter.

BigBrother-korpuset inneholder lyd- og videoopptak av nesten alle de 100 sendingene som ble vist på tv. Denne nedlastbare versjoner inneholder transkripsjonene, cirka 44 300 tokens. Materialet er ortografisk transkribert.

BigBrother-korpuset er et unikt talespråkskorpus der deltakerne arbeider sammen, diskutere, argumenterer, krangler, gråter, ler, roper og elsker. I motsetning til kontrollerte talespråksinnspillinger som ofte er begrenset til intervjuer og dialog, har BigBrother-materialet samtaler om alle mulige temaer og innen ulike genre. Noen ganger er sterke følelser i sving, og dette kan tenkes å innvirkning på språket.
description [ref=‘bb-transcriptions’] [xml:lang=‘en’]: The BigBrother Corpus is a speech corpus with recordings from the first season of the BigBrother show, sent on Norwegian television by TVNorge in the first half of 2001. The participants in BigBrother speak different dialects, but primarily they come from the east of Norway. They are aged 23-36 years.

The BigBrother Corpus contains audio and video recordings of almost all the 100 broadcasts that was shown on television. The downloadable version of the corpus contains approx. 440 300 tokens, orthographically transcribed.

The BigBrother Corpus is a unique speech corpus where the participants work together, discuss, argue, quarrel, cries, laugh, shout, make love etc. In contrast to controlled recordings that are limited to interviews and dialogue, the BigBrother-material has conversations about all possible topics and within different genre. Sometimes strong feelings are in turn, which also can conceivably have an impact on the language.
resourceShortName [ref=‘bb-transcriptions’]: BigBrother - transcriptions
url [ref=‘bb-transcriptions bb-corpus’]: http://www.tekstlab.uio.no/nota/bigbrother/index.html
PID [ref=‘bb-transcriptions bb-corpus’]: http://hdl.handle.net/11538/0000-0005-E7C1-C
distributionInfo [ComponentId=‘clarin.eu:cr1:c_1396012485124’]:
licenceInfo [ComponentId=‘clarin.eu:cr1:c_1396012485158’] [ref=‘bb-transcriptions’]:
userCategory: Public
distributionAccessMedium: downloadable
downloadLocation: http://www.tekstlab.uio.no/nota/bigbrother/index.html#transkripsjon
licence [ComponentId=‘clarin.eu:cr1:c_1447674760330’]:
licenceFamily: Creative Commons (CC)
licenceName: Creative_Commons-BY-NC-SA (CC-BY-NC-SA)
licenceURL: http://creativecommons.org/licenses/by-nc-sa/4.0/
conditionsOfUse: BY
conditionsOfUse: NC
conditionsOfUse: SA
nonStandardConditionsOfUse: The corpus has audio and video recordings classified as personal data. The production company Nordic Entertainment has generously given their consent to the usage of the videos as a speech corpus, but the audio and video files are accessible only through Glossa, a search and post-processing tool developed by the Text Laboratory.

Every individual researcher is responsible for treating the participants with respect and sincerity. Furthermore, the informants in the corpora should be anonymized, e.g. by changing their names, in every published paper or other output.
licensor:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘no’]: UiO
organizationShortName [xml:lang=‘en’]: UoO
departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/english/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
distributionRightsHolder:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: University of Oslo
organizationName [xml:lang=‘no’]: Universitetet i Oslo
organizationShortName [xml:lang=‘no’]: UiO
organizationShortName [xml:lang=‘en’]: UoO
departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies
departmentName [xml:lang=‘no’]: Institutt for lingvistiske og nordiske studier (ILN)
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/english/about/organization/text-laboratory/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
iprHolder [ref=‘bb-transcriptions bb-corpus’]:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: Nordic Entertainment (ipr holder of the videos)
contact [ref=‘bb-transcriptions bb-corpus’]:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName [xml:lang=‘en’]: The Text Laboratory
organizationShortName [xml:lang=‘en’]: Textlab
departmentName [xml:lang=‘en’]: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
metadataInfo [ComponentId=‘clarin.eu:cr1:c_1407745711922’] [ref=‘bb-transcriptions bb-corpus’]:
metadataCreationDate: 2015-02-24
metadataLastDateUpdated: 2021-04-06
metadataCreator:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: person
personInfo [ComponentId=‘clarin.eu:cr1:c_1396012485192’]:
surname: Hagen
givenName: Kristin
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: kristin.hagen@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
versionInfo [ComponentId=‘clarin.eu:cr1:c_1430905751648’] [ref=‘bb-transcriptions bb-corpus’]:
version: Second version
validationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711923’] [ref=‘bb-transcriptions bb-corpus’]:
validated: true
validationType: content
validationMode: manual
validationModeDetails: The transcriptions are proof read against the audio files.
validationExtent: full
validator:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
resourceDocumentationInfo [ComponentId=‘clarin.eu:cr1:c_1355150532301’] [ref=‘bb-transcriptions bb-corpus’]:
documentationUnstructured [ComponentId=‘clarin.eu:cr1:c_1355150532302’]:
role: documentation
documentUnstructured: http://www.tekstlab.uio.no/nota/bigbrother/index.html
resourceCreationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711921’] [ref=‘bb-transcriptions bb-corpus’]:
creationStartDate: 2007-08-01
creationEndDate: 2009-12-31
resourceCreator:
actorInfo [ComponentId=‘clarin.eu:cr1:c_1396012485194’]:
actorType: organization
organizationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711883’]:
organizationName: The Text Laboratory
organizationShortName: Textlab
departmentName: Department of Linguistics and Scandinavian Studies, University of Oslo
communicationInfo [ComponentId=‘clarin.eu:cr1:c_1352813745460’]:
email: tekstlab-post@iln.uio.no
url: http://www.hf.uio.no/iln/om/organisasjon/tekstlab/
address: Box 1102 Blindern
zipCode: 0317
city: OSLO
country: Norway
fundingProject:
projectInfo [ComponentId=‘clarin.eu:cr1:c_1430905751647’]:
projectName: Developing and completing language resources: The Big Brother show as a modern speech corpus
url: http://www.tekstlab.uio.no/nota/bigbrother/index.html
fundingType: nationalFunds
funder: The Research Council of Norway, the KUNSTI program (Kunnskapsutvikling for norsk språkteknologi).
fundingCountry: Norway
projectStartDate: 2007-08-31
projectEndDate: 2007-12-31
corpusInfo [ComponentId=‘clarin.eu:cr1:c_1407745711878’]:
corpusType [ref=‘bb-transcriptions’]: Written Corpus
corpusPartInfo [ComponentId=‘clarin.eu:cr1:c_1407745711885’] [ref=‘bb-transcriptions’]:
mediaType: text
corpusTextInfo [ComponentId=‘clarin.eu:cr1:c_1396012485188’]:
textFormatInfo [ComponentId=‘clarin.eu:cr1:c_1427452477072’]:
mimeType: Downloadable transcriptions in txt and html format
sizePerTextFormat [ComponentId=‘clarin.eu:cr1:c_1447674760342’]:
sizeInfo [ComponentId=‘clarin.eu:cr1:c_1353678848785’]:
size: 440 338
sizeUnit: tokens
characterEncodingInfo [ComponentId=‘clarin.eu:cr1:c_1447674760355’]:
characterEncoding: utf-8
corpusPartGeneralInfo [ComponentId=‘clarin.eu:cr1:c_1407745711882’] [ref=‘bigbrother-lp bb-transcriptions’]:
personSourceSetInfo [ComponentId=‘clarin.eu:cr1:c_1360931019775’]:
numberOfPersons: 12
ageOfPersons: adult
ageRangeStart: 23
ageRangeEnd: 36
sexOfPersons: mixed
originOfPersons: native
dialectAccentOfPersons: Some dialects represented, all of them from Southern Norway.
lingualityInfo [ComponentId=‘clarin.eu:cr1:c_1355150532313’]:
lingualityType: monolingual
languageInfo [ComponentId=‘clarin.eu:cr1:c_1428388179423’]:
languageId: No
languageName: Norwegian
languageInfo [ComponentId=‘clarin.eu:cr1:c_1428388179423’]:
languageId: Nb
languageName: Norwegian Bokmål
modalityInfo [ComponentId=‘clarin.eu:cr1:c_1447674760356’]:
modalityType: spokenLanguage
modalityTypeDetails: Informal language from all settings in the BigBrother house.
annotationInfo [ComponentId=‘clarin.eu:cr1:c_1407745711924’]:
annotationType: speechAnnotation-orthographicTranscription
annotationManualUnstructured [ComponentId=‘clarin.eu:cr1:c_1355150532325’]:
role: annotationManual
documentUnstructured: Orthographic transcription,cf Bokmålsordboka (Wangensteen 2004)
annotationManualUnstructured [ComponentId=‘clarin.eu:cr1:c_1355150532325’]:
role: annotationManual
documentUnstructured: http://www.tekstlab.uio.no/nota/bigbrother/
annotationTool [ComponentId=‘clarin.eu:cr1:c_1355150532326’]:
targetResourceNameURI: Transcriber (http://trans.sourceforge.net/en/presentation.php )
classificationInfo [ComponentId=‘clarin.eu:cr1:c_1403588862809’]:
genreInfo [ComponentId=‘clarin.eu:cr1:c_1407745711877’]:
genreType: speechGenre
genre: informal
unstandardisedGenre: All kinds of situations in the BigBrother house. The participants prepare dinner, eat, sleep, make love, discuss, work together etc etc. Lots of emotions.
timeCoverageInfo [ComponentId=‘clarin.eu:cr1:c_1447674760358’]:
timeCoverage: 2001