Natural Language Identification using Corpus-Based Models

Authors

  • Clive Souter et al.

DOI:

https://doi.org/10.7146/hjlcb.v7i13.25083

Abstract

This paper describes three approaches to the task of automatically identifying the language a text is written in. We conducted experiments to compare the success of each approach in identifying languages from a set of texts in Dutch/Friesian, English, French, Gaelic (Irish), German, Italian, Portuguese, Serbo-Croat and Spanish.....

Downloads

Published

2017-01-04

How to Cite

Souter et al., C. (2017). Natural Language Identification using Corpus-Based Models. HERMES - Journal of Language and Communication in Business, 7(13), 183–203. https://doi.org/10.7146/hjlcb.v7i13.25083

Issue

Section

Thematic Articles