DanDIGI

– udvikling af et korpus med dansk digitalt medieret interaktion

Forfattere

  • Philip Diderichsen
  • Torben Juel Jensen

Resumé

.

Referencer

Baumgartner, Jason, Zannettou, Savvas, Keegan, Brian, Squire, Megan & Blackburn, Jeremy (2020). The Pushshift Reddit Dataset. Proceedings of the international AAAI conference on web and social media, 14, 830-839.

Beißwenger, Michael, Ermakova, Maria, Geyken, Alexander, Lemnitzer, Lothar & Storrer, Angelika (2012). A TEI Schema for the Representation of Computermediated Communication. Journal of the Text Encoding Initiative, 3. https://doi.org/10.4000/jtei.476.

Beißwenger, Michael & Lüngen, Harald (2020). CMC-core: a schema for the representation of CMC corpora in TEI. Corpus, 20. https://doi.org/10.4000/corpus.4553.

Borin, Lars, Forsberg, Markus & Roxendal, Johan Korp – the corpus infrastructure of Språkbanken Proceedings of LREC 2012 (s. 474-478). ELRA.

Derczynski, Leon, Ciosici, Manuel R., Baglini, Rebekah, Christiansen, Morten H., Dalsgaard, Jacob Aarup, Fusaroli, Riccardo, Henrichsen, Peter Juel, Hvingelby, Rasmus, Kirkedal, Andreas, Kjeldsen, Alex Speed, Ladefoged, Claus, Nielsen, Finn Årup, Madsen, Jens, Petersen, Malte Lau, Rystrøm, Jonathan Hvithamar & Varab, Daniel (2021). The Danish Gigaword Corpus. I Simon Dobnik & Lilja Øvrelid (red.), Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa) (s. 413-421). Linköping University Electronic Press.

Diderichsen, Philip & Jensen, Torben Juel (2023). Samtaler i korpusformat: Repræsentation af talesprog i LANCHARTs korpus-infrastruktur. Nordlyd, 47(2), 77-89. https://doi.org/10.7557/12.7084.

DSL (u.å.). KorpusDK. Det Danske Sprog- og Litteraturselskab. Hentet 03.01.2025 fra https://ordnet.dk/korpusdk.

EU (2016). EUROPA-PARLAMENTETS OG RÅDETS FORORDNING (EU) 2016/679 af 27. april 2016 om beskyttelse af fysiske personer i forbindelse med behandling af personoplysninger og om fri udveksling af sådanne oplysninger og om ophævelse af direktiv 95/46/EF (generel forordning om databeskyttelse). https://eur-lex.europa.eu/legal-content/DA/TXT/?uri=CELEX:32016R0679

Frey, Jennifer-Carmen, König, Alexander, Stemle, Egon, Falaise, Achille, Fišer, Darja & Lüngen, Harald (2020). The FAIR Index of CMC Corpora. I Julien Longhi & Claudia Marinica (red.), CMC Corpora through the prism of digital humanities (s. 127-144). Paris: L’Harmattan.

Gibson, James Jerome (1977). The theory of affordances. I Robert Shaw & Bransford (red.), Perceiving, acting, and knowing: toward an ecological psychology (s. 67-82). Hillsdale, N. J.: Lawrence Erlbaum.

Hansen, Marianne Haugaard & Stæhr, Andreas Candefors (2021). Sproglige generationsforskelle på de sociale medier. NyS, Nydanske Sprogstudier, 59, 113-156. https://doi.org/10.7146/nys.v1i59.121811.

Hutchby, I. (2001). Technologies, Texts and Affordances. Sociology, 35(2), 441-446.

Jensen, Klaus Bruhn (2013). Computermedieret kommunikation. I Gunhild Agger, Agnete Nørgård Kristensen, Per Jauert & Kim Schrøder (red.), Medie- og kommunikationsleksikon. Frederiksberg: Samfundslitteratur. https://medieogkommunikationsleksikon.dk/computermedieret-kommunikation-2/.

Lo, Henry Z. & Cohen, Joseph Paul (2016). Academic Torrents: Scalable Data Distribution Neural Information Processing Systems 2015 (s. 1-2).

Lomborg, Stine (2025). Sociale medier. I Gunhild Agger, Agnete Nørgård Kristensen, Per Jauert & Kim Schrøder (red.), Medie- og kommunikationsleksikon. Frederiksberg: Samfundslitteratur. https://medieogkommunikationsleksikon.dk/sociale-medier-2/. Madsen, Lian Malai, Karrebæk, Martha Sif & Møller, Janus Spindler (red.) (2016). Everyday Languaging: Collaborative Research on the Language Use of Children and Youth. Berlin, München, Boston: De Gruyter Mouton. DOI: 10.1515/9781614514800.

Maegaard, Marie, Monka, Malene, Køhler Mortensen, Kristine & Candefors Stæhr, Andreas (2020) Standardization as Sociolinguistic Change: A Transversal Study of Three Traditional Dialect Areas. Milton: Routledge. DOI: 10.4324/9780429467486.

McEnery, Tony & Hardie, Andrew (2012) Corpus linguistics: method, theory and practice. Cambridge: Cambridge University Press.

Sprogforandringscentret (u.å.). LANCHART-korpusset. Sprogforandringscentret. Hentet 03.01.2025 fra https://dgcss.hum.ku.dk/online-ressourcer/lanchart-korpusset/.

TEI-Consortium (2025, 24.01.2025). TEI P5: Guidelines for Electronic Text Encoding and Interchange. Version 4.9.0. TEI Consortium. Hentet 01.04.2025 fra http://www.tei-c.org/Guidelines/P5/.

W3C (2008, 26.11.2008). Extensible Markup Language (XML) 1.0 (Fifth Edition).

World Wide Web Consortium. Hentet 02.01.2025 fra https://www.w3.org/TR/REC-xml/.

Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., Blomberg, N., Boiten, J. W., da Silva Santos, L. B., Bourne, P. E., Bouwman, J., Brookes, A. J., Clark, T., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C. T., Finkers, R., Gonzalez-Beltran, A., Gray, A. J., Groth, P., Goble, C., Grethe, J. S., Heringa, J., t Hoen, P. A., Hooft, R., Kuhn, T., Kok, R., Kok, J., Lusher, S. J., Martone, M. E., Mons, A., Packer, A. L., Persson, B., Rocca-Serra, P., Roos, M., van Schaik, R., Sansone, S. A., Schultes, E., Sengstag, T., Slater, T., Strawn, G., Swertz, M. A., Thompson, M., van der Lei, J., van Mulligen, E., Velterop, J., Waagmeester, A., Wittenburg, P., Wolstencroft, K., Zhao, J. & Mons, B. (2016). The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data, 3, 160018. https://doi.org/10.1038/sdata.2016.18.

Downloads

Publiceret

18.08.2025

Citation/Eksport

Diderichsen, P., & Jensen, T. J. (2025). DanDIGI: – udvikling af et korpus med dansk digitalt medieret interaktion. Møderne Om Udforskningen Af Dansk Sprog (MUDS), (20), 125–143. Hentet fra https://tidsskrift.dk/muds/article/view/162902