Pièges méthodologiques des corpus parallèles et comment les éviter
In this article, we present methodological problems pertaining to the exploitation of parallel corpora, i.e. corpora composed of translations and their respective originals, and we try to propose the principles and rules helping to avoid said problems. We treat successively factors of the size and c...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Cercle linguistique du Centre et de l'Ouest - CerLICO
2017-02-01
|
Series: | Corela |
Subjects: | |
Online Access: | http://journals.openedition.org/corela/4810 |
id |
doaj-85b7a1187e634410bd5fee897553b2b0 |
---|---|
record_format |
Article |
spelling |
doaj-85b7a1187e634410bd5fee897553b2b02020-11-25T00:12:52ZengCercle linguistique du Centre et de l'Ouest - CerLICOCorela1638-573X2017-02-0110.4000/corela.4810Pièges méthodologiques des corpus parallèles et comment les éviterOlga NádvorníkováIn this article, we present methodological problems pertaining to the exploitation of parallel corpora, i.e. corpora composed of translations and their respective originals, and we try to propose the principles and rules helping to avoid said problems. We treat successively factors of the size and composition of parallel corpora (section 2), technical factors such as alignment or bilingual POS-tagging (section 3), the tricky question of the equivalence of the parallel versions (4.1), the importance of metadata about the originals as well as the translations (4.2) and the particularities of the language of translation (4.3). The methodological principles are illustrated by specific studies on the parallel corpus InterCorp.http://journals.openedition.org/corela/4810parallel corpusmethodologycontrastive linguisticstranslationpunctuationgerund |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Olga Nádvorníková |
spellingShingle |
Olga Nádvorníková Pièges méthodologiques des corpus parallèles et comment les éviter Corela parallel corpus methodology contrastive linguistics translation punctuation gerund |
author_facet |
Olga Nádvorníková |
author_sort |
Olga Nádvorníková |
title |
Pièges méthodologiques des corpus parallèles et comment les éviter |
title_short |
Pièges méthodologiques des corpus parallèles et comment les éviter |
title_full |
Pièges méthodologiques des corpus parallèles et comment les éviter |
title_fullStr |
Pièges méthodologiques des corpus parallèles et comment les éviter |
title_full_unstemmed |
Pièges méthodologiques des corpus parallèles et comment les éviter |
title_sort |
pièges méthodologiques des corpus parallèles et comment les éviter |
publisher |
Cercle linguistique du Centre et de l'Ouest - CerLICO |
series |
Corela |
issn |
1638-573X |
publishDate |
2017-02-01 |
description |
In this article, we present methodological problems pertaining to the exploitation of parallel corpora, i.e. corpora composed of translations and their respective originals, and we try to propose the principles and rules helping to avoid said problems. We treat successively factors of the size and composition of parallel corpora (section 2), technical factors such as alignment or bilingual POS-tagging (section 3), the tricky question of the equivalence of the parallel versions (4.1), the importance of metadata about the originals as well as the translations (4.2) and the particularities of the language of translation (4.3). The methodological principles are illustrated by specific studies on the parallel corpus InterCorp. |
topic |
parallel corpus methodology contrastive linguistics translation punctuation gerund |
url |
http://journals.openedition.org/corela/4810 |
work_keys_str_mv |
AT olganadvornikova piegesmethodologiquesdescorpusparallelesetcommentleseviter |
_version_ |
1725397082261422080 |