Parallel Corpora

The ACTRES Parallel Corpus (P-ACTRES 2.0) is a bidirectional English-Spanish corpus consisting of original texts in one language and their translation into the other. P-ACTRES 2.0 contains over 6 million words considering both directions together, i.e., from original English texts to their Spanish translations (the former P-ACTRES 1.0) and from original Spanish texts to their English translations. The table below shows the composition:

English Spanish Total
EN→ES
P-ACTRES 2.0 Total
ES→EN
Spanish English
1.276.791 1.357.296 2.634.087 Books – fiction
4.191.056
1.556.969 766.796 790.173
514.786 573.523 1.088.309 Books – nonfiction
1.116.582
28.273 15.068 13.205
118.665 129.810 248.475 Newspaper articles
248.475
114.634 122.024 236.658 Magazine articles
307.510
70.852 37.027 33.825
40.178 49.026 89.204 Miscellaneous
89.204
4.296.733 words TOTAL Corpus
5.952.827 words
1.656.094 words