Corpus Analysis Tool
The corpus analysis tool used in our experiment was WordSmith Tools, developed by Mike Scott. WordSmith provides the following features which we felt could be of use to translators:
a) a concordancer, which finds and displays, in an easy-to-read format (e.g. KWIC — key word in context) all occurrences of a search term (and minor varia- tions thereof); b) a collocation viewer, which allows users to see which words "go together;" c) frequency operations, which provide statistical information about the central- ity of a pattern (i.e. whether it is just one author's idiosyncratic usage or an ac- cepted pattern in expert discourse).
The students had been given training on how to use the tool during their corpus linguistics class, so they were familiar with these basic features; however, they had never previously tried to apply the tool to a translation situation.
8 Meta, XLIII, 4, 1998
Table 1 An evaluation of Computer Select with regard to the criteria required for a corpus that is a balanced and representative source for specialized translation
NATIVE-LANGUAGE CORPORA AS A TRANSLATION RESOURCE 9
Table 2 The five basic types of publications found in Computer Select are classified according to the type of information they are intended to impart
Table 3 The nineteen genres of texts identified by the publishers of Computer Select are classified according to the type of information they are intended to impart
10 Meta, XLIII, 4, 1998
Table 4 Total number of words and bytes in the scanner corpus
Experiment For the purpose of this experiment, all students were translating out of their for- eign language (French) into their native language (English). Students were presented with two different extracts from an article on optical scanners. They were asked to translate one of the texts using the following resources as necessary: a selection of gen- eral bilingual dictionaries and a selection of monolingual specialized lexicographic and non-lexicographic resources.15 They were then asked to translate the second text using the following resources as necessary: a selection of general bilingual dictionaries and a specialized monolingual native-language corpus coupled with the WordSmith Tools corpus analysis tool. In an attempt to compensate for 1) any potential text-specific dif- ficulties, and 2) differences in student ability, students 1-7 were asked to translate text i using the conventional resources and text ii using the corpus, whereas students 8-14 were asked to translate text ii using conventional resources and text i using the corpus. The students were given two hours to translate each text. They were also asked to comment on the usefulness of the monolingual resources they had at their disposal (whether it be the conventional resources or the corpus).
Data Analysis The aim of the pilot study was to determine whether a specialized monolingual native-language corpus would help translators to produce improved quality transla- tions. Because our sample size was small in a number of respects (14 students, 2 texts, 1 subject field), we felt that we would not be able to make any definitive conclusions, but could only reasonably measure general trends. Therefore, translations were assessed for the following broad categories of errors: 1) comprehension errors, specifically errors resulting from a lack of comprehension of the subject field; 2) production errors including incorrect choice of term, non-idiomatic constructions, grammatical errors, and incorrect register.
3.7.1. Improved subject comprehension: an example The following example shows how the corpus has the potential to help students to acquire an increased understanding of the subject field. In text i, most of the students had difficulty grasping one of the concepts in the opening sentence of the text.
quelle que soit leur sensibilité aux nuances, leur rapidité, leur précision,...
which should logically be translated along the following lines:
Regardless of such characteristics as colour-recognition capability, speed, precision,...
NATIVE-LANGUAGE CORPORA AS A TRANSLATION RESOURCE 11
The specialized dictionaries treated the concept scanner in general, but did not discuss the colour-recognition capability of scanners. The user manual made a passing reference to "black-and-white vs. colour scanners." The desktop publishing monograph contained a half-page discussion on the difference between "non-greyscale scanners, greyscale scanners, and colour scanners," but this section did not feature specifically in the table of contents and was therefore not easy to locate (though there were index ref- erences to each of the terms individually). The journal article treated the issue of colour recognition in some depth, but this discussion was towards the end of the article and was not visibly set off from the rest of the text. In other words, the necessary informa- tion was there, but it was not easy to find. Not surprisingly, none of the students using the conventional resources came close to rendering the concept correctly; in fact, some of them proposed rather peculiar translations which showed a definite lack of subject field understanding. The corpus users, however, were able to go directly to those areas of the text that dealt with this subject. Even if students did not know the correct translation for nuances, a collocation search on the word form "sensitiv*" (i.e. the translation of sensi- ble) revealed that the following words were among those that appeared in its vicinity: colour (5), greyscale (4), shade* (122), shading (8).16Students were able to read these particular contexts and achieve a somewhat better understanding of the subject field. As shown in table 5, three of these students came quite close to expressing the correct idea by referring to shades or shading, and another student actually referred to colour. Admittedly, none of them came up with a particularly elegant rendering for that spe- cific concept, however, they did at least seem to have a better understanding of the idea that was being referred to in the source text.
Table 5 Proposed translations of the expression quelle que soit leur sensibilité aux nuances... The plus symbol (+) indicates which students came up with a rendering that seems to show a reasonable understanding of the concept
12 Meta, XLIII, 4, 1998
3.7.2. Improved term choice: an example The following example shows how the corpus has the potential to help students find and use the correct terms. In text i, the term vitre is most properly translated as either glass platen or scan bed. The specialized dictionaries provided did not contain the term glass platen, and the translations given in the general bilingual dictionaries included glass, pane of glass, and window. The term glass platen did appear in the user guide, but was not in the index and was therefore difficult to find. It did not appear (to the best of my knowledge) in the desktop publishing monograph or the journal article. None of the students using the conventional resources used either of these terms, whereas three of the students using the corpus correctly used the term glass platen, which appeared in the corpus 41 times and ranked highly as one of the collocates of glass.
Table 6 Proposed equivalents for the term vitre. The plus symbol (+) indicates which students used one of the preferred terms
A similar example occurred with the term scanner à plat, which is properly trans- lated as flatbed scanner. Of the students using the conventional resources, three trans- lated the term improperly as flat scanner, even though the term flatbed scanner appeared several times in the monograph on desktop publishing. All of the students using the corpus came up with the correct term, and student (14) even made the fol- lowing comment: "I was unsure of which spelling to use — flatbed or flat-bed — because I had seen both in the corpus. I looked up both terms in the frequency list and saw that flatbed occurred 1508 times and flat-bed only occurred 92 times, so I went with flatbed."
3.7.3. Improved idiomatic construction: an example The following example shows how the corpus has the potential to help students create more idiomatic constructions. One of the phrases appearing in text i is: photo-
NATIVE-LANGUAGE CORPORA AS A TRANSLATION RESOURCE 13
diodes sensibles à la lumière. Some of the specialized dictionaries contained the term photodiode and the desktop publishing monograph referred to light-sensitive elements (though not in the index, thereby making it difficult to locate). The journal article did not make any reference to photodiodes. I hypothesize that the majority of the students using the conventional resources verified in the dictionaries that photodiode was a term and then simply followed the syntax of the source text to produce a construction that, while grammatically correct, is not idiomatic according to the expert discourse. There were no instances in the corpus where this concept was expressed using the syntax pat- tern photodiodes sensitive to the light, and this may explain why all the students who had access to the corpus used one of the two more idiomatic constructions which appeared there: light-sensitive photodiodes (which appeared twice) or photosensitive diodes (which also appeared twice).
Table 7 Proposed translations of the expression photodiodes sensibles à la lumière. The X indicates a non-idiomatic construction
A similar example occurred with the expression la tête de numérisation du scanner, which is best translated as scan head, but could also be translated as scanning head or scanner head. Of the students using the conventional resources, four of them used an idiomatic construction, but three of them followed the French syntax and rendered the phrase as head of the scanner. There were no instances in the corpus where this concept was expressed using the syntax pattern head of the scanner, and all of the students using the corpus employed one of the idiomatic constructions.
©2015 arhivinfo.ru Все права принадлежат авторам размещенных материалов.
|