Introduction
Russian-English Computer Security Parallel Text was developed by The
MITRE Corporation. It consists of parallel sentences from a set of computer
security reports published in Russian and translated into English by translators
with particular expertise in the technical area. Translators were instructed
to err on the side of literal translation if required, but to maintain the technical
writing style of the source and to make the resulting English as natural as
possible. The translators followed specific guidelines for translation, and
those are included in this distribution.
Data
There are 6,276 lines of parallel Russian and English, with a total of 60,059
words of Russian and 76,437 words of English, presented in a separate UTF-8
plain text file for each language. The sentences were translated in sequential
order and presented in a scrambled order, such that parallel sentences at identical
line numbers are translations. For example, the 31st line of the English file
is a translation of the 31st line of the Russian file. The original line sequence
is not provided. 1,694 untranslated lines (such as code snippets) are included
as a separate file
Samples
For a sample of the data please view this
English text sample and
Russian text sample. Here is the Russian sample as an image.
Updates
None at this time.
Content Copyright
Portions © 2012 The MITRE Corporation, © 2012 Trustees of the University of
Pennsylvania
|