Pièges méthodologiques des corpus parallèles et comment les éviter

In this article, we present methodological problems pertaining to the exploitation of parallel corpora, i.e. corpora composed of translations and their respective originals, and we try to propose the principles and rules helping to avoid said problems. We treat successively factors of the size and c...

Full description

Bibliographic Details
Main Author: Olga Nádvorníková
Format: Article
Language:English
Published: Cercle linguistique du Centre et de l'Ouest - CerLICO 2017-02-01
Series:Corela
Subjects:
Online Access:http://journals.openedition.org/corela/4810
id doaj-85b7a1187e634410bd5fee897553b2b0
record_format Article
spelling doaj-85b7a1187e634410bd5fee897553b2b02020-11-25T00:12:52ZengCercle linguistique du Centre et de l'Ouest - CerLICOCorela1638-573X2017-02-0110.4000/corela.4810Pièges méthodologiques des corpus parallèles et comment les éviterOlga NádvorníkováIn this article, we present methodological problems pertaining to the exploitation of parallel corpora, i.e. corpora composed of translations and their respective originals, and we try to propose the principles and rules helping to avoid said problems. We treat successively factors of the size and composition of parallel corpora (section 2), technical factors such as alignment or bilingual POS-tagging (section 3), the tricky question of the equivalence of the parallel versions (4.1), the importance of metadata about the originals as well as the translations (4.2) and the particularities of the language of translation (4.3). The methodological principles are illustrated by specific studies on the parallel corpus InterCorp.http://journals.openedition.org/corela/4810parallel corpusmethodologycontrastive linguisticstranslationpunctuationgerund
collection DOAJ
language English
format Article
sources DOAJ
author Olga Nádvorníková
spellingShingle Olga Nádvorníková
Pièges méthodologiques des corpus parallèles et comment les éviter
Corela
parallel corpus
methodology
contrastive linguistics
translation
punctuation
gerund
author_facet Olga Nádvorníková
author_sort Olga Nádvorníková
title Pièges méthodologiques des corpus parallèles et comment les éviter
title_short Pièges méthodologiques des corpus parallèles et comment les éviter
title_full Pièges méthodologiques des corpus parallèles et comment les éviter
title_fullStr Pièges méthodologiques des corpus parallèles et comment les éviter
title_full_unstemmed Pièges méthodologiques des corpus parallèles et comment les éviter
title_sort pièges méthodologiques des corpus parallèles et comment les éviter
publisher Cercle linguistique du Centre et de l'Ouest - CerLICO
series Corela
issn 1638-573X
publishDate 2017-02-01
description In this article, we present methodological problems pertaining to the exploitation of parallel corpora, i.e. corpora composed of translations and their respective originals, and we try to propose the principles and rules helping to avoid said problems. We treat successively factors of the size and composition of parallel corpora (section 2), technical factors such as alignment or bilingual POS-tagging (section 3), the tricky question of the equivalence of the parallel versions (4.1), the importance of metadata about the originals as well as the translations (4.2) and the particularities of the language of translation (4.3). The methodological principles are illustrated by specific studies on the parallel corpus InterCorp.
topic parallel corpus
methodology
contrastive linguistics
translation
punctuation
gerund
url http://journals.openedition.org/corela/4810
work_keys_str_mv AT olganadvornikova piegesmethodologiquesdescorpusparallelesetcommentleseviter
_version_ 1725397082261422080