Parallel Corpus

The ACTRES Parallel Corpus (P-ACTRES 2.0) is a bidirectional English-Spanish corpus consisting of original texts in one language and their translation into the other. P-ACTRES 2.0 contains over 4 million words considering both directions together, i.e., from original English texts to their Spanish translations (the former P-ACTRES 1.0) and from original Spanish texts to their English translations. The table below shows the composition:

English Spanish Total
EN→ES
P-ACTRES 2.0 Total
ES→EN
Spanish English
396.462 421.065 817.527 Books – fiction
2.374.226
1.556.699 766.796 789.903
481.056 538.813 1.019.869 Books – nonfiction
1.048.142
28.273 15.068 13.205
115.502 137.202 252.704 Newspaper articles
252.704
169.840 174.314 344.154 Magazine articles
415.006
70.852 37.027 33.825
40.178 49.026 89.204 Miscellaneous
89.204
2.523.458 words TOTAL Corpus
4.179.282 words
1.655.824 words
P-ACTRES EN→ES

Contains slightly more than 2.5 million words distributed into five sub-corpora, comprising different text-types: books fiction, books non-fiction, newspaper articles, magazine articles and miscellaneous texts. Regarding the first two sub-categories, excerpts of around 15,000 words have been extracted from a variety of books. As for the other three sub-corpora, full articles or texts have been included.

P-ACTRES ES→EN

Is still under construction, mirroring the compilation of P-ACTRES EN→ES. At present, it contains ca. 1.7 million words belonging mainly to the sub-corpora of books-fiction, while pairs of non-fiction books are being aligned presently.

P-ACTRES 2.0
Has been compiled as a tool to carry out corpus-based contrastive studies and translation studies either independently or jointly. It has proved to be a useful tool for studies at both lexico-grammatical and rhetorical level. It is searched with a browser originally developed by Knut Hofland (University of Bergen) on the basis of CWB (Corpus Web Bench) for P-ACTRES 1.0. The browser has later been modified to house both repositories, by Hugo Sanjurjo-González (University of León) in collaboration with Knut Hofland.
P-Annual-Accounts-EN-ES
Is a specialized parallel corpus of Spanish original texts and their translations into English. It consists of approximately 2,000,000 words in English and 2,000,000 in Spanish and it includes annual reports and university manuals of marketing, macroeconomics, microeconomics and organization. This corpus provides material for studies comparing original English with translated English in a terminological, grammatical and textual level.
P-Business-News-EN-ES

Is a specialized parallel corpus of English original texts and their translations into Spanish. It consists of 49,421 words in English and 32,334 words in Spanish and includes magazine articles from The Economist published in Actualidad Económica. It is used in contrastive rhetorical studies comparing original Spanish with translated Spanish.