Els llenguatges controlats i la seva aplicació a la traducció automàtica: 20 anys després

Autors/ores

Resum

Aquest estudi revisa l’ús de lenguatges controlats com a estratègia de preedició per millorar la qualitat de la traducció automàtica. Tracem un marc teòric del qual es desprenen variables lingüístiques que serveixen per dissenyar un caso pràctic. A partir d’una avaluació quantitativa i qualitativa, analitzem la productivitat de la preedició i proposem línies d’investigació futures.

Paraules clau

llenguatges controlats, traducció automàtica, preedició, traductibilitat, qualitat de la traducció automàtica

Referències

ASD Aerospace and Defence Industries Association of Europe (2013). Simplified Technical English. Specification ASD-STE100, v. 6.

Bohnet, Bernd (2009). Efficient Parsing of Syntactic and Semantic Dependency Structures. In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL 2009): Shared Task. Association for Computational Linguistics, pp. 67–72. <https://aclanthology.org/W09-1210/>. [Accessed: 20250428].

Bourland, David (1965). A linguistic note: Writing in E-prime. General Semantics Bulletin, v. 32, n. 3, pp. 111–114.

Bustos Gisbert, José M. (2017). Naturaleza sintáctica de los enunciados textuales en el discurso escrito. ELUA, v. 31, pp. 67–95. <https://doi.org/10.14198/ELUA2017.31.04>. [Accessed: 20250428].

Chomsky, Noam (1956). Three models for the description of language. IRE Transactions on Information Theory, v. 2, n. 3, pp. 113–124. <https://doi.org/10.1109/TIT.1956.1056813>. [Accessed: 20250428].

Cregan, Anne; Schwitter, Rolf; Meyer, Thomas (2007). Sydney OWL Syntax—towards a controlled natural language syntax for OWL 1.1. In: Proceedings of OWLED 2007. <https://ceur-ws.org/Vol-258/paper36.pdf>. [Accessed: 20250428].

Da Cunha, Iria (2022). Lenguaje claro y tecnología en la Administración. Granada: Comares. <https://doi.org/10.55323/edc.2023.52>. [Accessed: 20250428].

Da Cunha, Iria (2024). Inteligencia artificial y lenguaje claro en español. In: Retegui, Andrea (ed.). Lenguaje claro en Iberoamérica. Principios y prácticas. Buenos Aires: Thomson Reuters, pp. 417–444.

De Vecchi, Dardo Mario (2018). Company-speak, organisation-speak. In: Humbley, John; Budin, Gerhard; Laurén, Christer (eds.). Languages for Special Purposes: An International Handbook. Basel: De Gruyter Mouton, pp. 279–288. <https://doi.org/10.1515/9783110228014-014>. [Accessed: 20250428].

DeepL (2024). About DeepL language models. <https://support.deepl.com/hc/en-us/articles/14241705319580-About-DeepL-language-models>. [Accessed: 20250428].

Feng, Zhaopeng et al. (2024). TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement. arXiv. <https://doi.org/10.48550/arXiv.2402.16379>. [Accessed: 20251216].

Fernández Vivanco, Andrea; Bustos Gisbert, José M. (2025). Lenguajes controlados aplicados a la traducción automática: corpus y evaluación [Data set]. Zenodo. <https://doi.org/10.5281/zenodo.17521360>. [Accessed: 20250428].

Fernández Vivanco, Andrea; Sanfilippo, Vincenzo (2025). La lecturabilidad de textos escritos en lengua española: revisión crítica de métricas y propuesta de nuevas variables. ELUA, v. 44, pp. 29–56. <https://doi.org/10.14198/ELUA.29266>. [Accessed: 20250428].

Funk, Adam et al. (2007). CLOnE: Controlled language for ontology editing. In: Proceedings of ISWC 2007 + ASWC 2007, pp. 142–155. <https://doi.org/10.1007/978-3-540-76298-0_11>. [Accessed: 20251216].

García Muñoz, Óscar (2012). Lectura fácil: métodos de redacción y evaluación. Madrid: Real Patronato sobre Discapacidad. <https://www.plenainclusion.org/wp-content/uploads/2021/03/lectura-facil-metodos.pdf>. [Accessed: 20251216].

Guilou, Liane; Hardmeier, Christian (2016). PROTEST: A Test Suite for Evaluating Pronouns in Machine Translation. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16), pp. 636–643. <https://aclanthology.org/L16-1100/>. [Accessed: 20251216].

Huijsen, Willem Olaf (1998). Controlled Language – An Introduction. In: Proceedings of CLAW 98, pp. 1–15.

Intento (2025). The State of Translation Automation 2025. <https://inten.to/the-state-of-translation-automation-2025/>. [Accessed: 20250428].

Jiménez Hurtado, Catalina; Medina Reguera, Ana (2022). Metodología de la traducción a lectura fácil: Retos de investigación. In: Castillo Bernal, María P. (ed.). Translation, Mediation and Accessibility for Linguistic Minorities. Berlin: Frank & Timme, pp. 205–222. <https://doi.org/10.26530/20.500.12657/54058>. [Accessed: 20250428].

Kittredge, Richard (2014). Sublanguages and Controlled Languages. In: Mitkov, Ruslan (ed.). The Oxford Handbook of Computational Linguistics. 2nd ed. Oxford: Oxford Academic, pp. 454–472. <https://doi.org/10.1093/oxfordhb/9780199573691.001.0001> [Accessed: 20250428].

Kittredge, Richard; Lehrberger, John (1982). Sublanguages: Studies of Language in Restricted Semantic Domains. Berlin / New York: De Gruyter. <https://doi.org/10.1515/9783110844818>. [Accessed: 20250428].

Kuhn, Tobias (2014). A Survey and Classification of Controlled Natural Languages. Computational Linguistics, v. 40, n. 1, pp. 121–171. <https://doi.org/10.1162/COLI_a_00168>. [Accessed: 20250428].

Kuhn, Tobias B.; Barbano, Paolo E.; Nagy, Mate Levente.; Krauthammer, Michael (2013). Broadening the scope of nanopublications. In: Proceedings of ESWC 2013, pp. 487–501. <https://doi.org/10.1007/978-3-642-38288-8_33>. [Accessed: 20250428].

Lohar, Pintu; Xie, Guangyi; Gallagher, Daniel; Way, Andy (2023). Building Neural Machine Translation Systems for Multilingual Participatory Spaces. Analytics, v. 2, n. 2, pp. 393–409. <https://doi.org/10.3390/analytics2020022>. [Accessed: 20250428].

Montolío Durán, Estrella (2023). Guía de redacción judicial clara. Madrid: Ministerio de la Presidencia, Justicia y Relaciones con las Cortes.

Núñez Ladevéze, Luis (1993). Teoría y práctica de la construcción del texto. Barcelona: Ariel.

Nyberg, Eric; Mitamura, Teruko; Huijsen, Willem Olaf (2003). Controlled language for authoring and translation. In: Sommers, Harold (ed.). Computers and Translation: A Translator’s Guide. Amsterdam: John Benjamins, pp. 245–281. <https://doi.org/10.1075/btl.35.17nyb>. [Accessed: 20250428].

O’Brien, Sharon (2003). Controlling controlled English. In: EAMT Workshop: Improving MT through Other Language Technology Tools. Budapest: European Association for Machine Translation. <https://aclanthology.org/2003.eamt-1.12/>. [Accessed: 20251216].

O’Brien, Sharon (2006). Eye-tracking and translation memory matches. Perspectives: Studies in Translation Theory and Practice, v. 14, n. 3, pp. 185–205. <https://doi.org/10.1080/09076760708669037>. [Accessed: 20250428].

O’Brien, Sharon (2010). Controlled Language and Readability. In: Shreve, Gregory; Angelone, Erik (eds.). Translation and Cognition. Amsterdam: John Benjamins, pp. 143–165. <https://doi.org/10.1075/ata.xv>. [Accessed: 20251216].

O’Brien, Sharon (2019). Controlled Language and Writing for an International Audience. In: Maylath, Bruce; St.Amant, Kirk (eds.). Translation and Localization: A Guide for Technical and Professional Communicators. New York: Routledge, pp. 95–119. <https://doi.org/10.1080/10572252.2021.1915064>. [Accessed: 20251216].

Pease, Adam; Li, John (2010). Controlled English to logic translation. In: Poli, Roberto; Healy, Michael; Kameas, Achilles (eds.). Theory and Applications of Ontology: Computer Applications. Amsterdam: Springer, pp. 245–258. <https://doi.org/10.1007/978-90-481-8847-5_11>. [Accessed: 20250428].

Pellicer, Lluis (2024). España se enfrenta a otra dana solo dos semanas después de la catástrofe de Valencia. El País. <https://elpais.com/espana/2024-11-12/espana-se-prepara-para-otra-dana-solo-dos-semanas-despues-de-la-catastrofe-de-valencia.html>. [Accessed: 20240912].

Pistola Grille, Sara; Viñuales-Ferreiro, Susana (2021). Una clasificación actualizada de los géneros textuales de la Administración pública española. Revista de Llengua i Dret, v. 75, pp. 181–203. <https://doi.org/10.2436/rld.i75.2021.3587>. [Accessed: 20250428].

Pratt-Hartmann, Ian (2003). A two-variable fragment of English. Journal of Logic, Language and Information, pp. 13–45. <https://doi.org/10.1023/A:1021149027971>. [Accessed: 20250428].

Ramírez Polo, Laura (2012a). Los lenguajes controlados y la documentación técnica: mejorando la traducibilidad. Tradumàtica, v. 10, pp. 192–204. <https://doi.org/10.5565/rev/tradumatica.25>. [Accessed: 20250428].

Ramírez Polo, Laura (2012b). Use and Evaluation of Controlled Languages in Industrial Environments and Feasibility Study for the Implementation of Machine Translation [Tesis doctoral]. Universitat de València.

Real Academia Española (2024). Guía panhispánica de lenguaje claro y accesible. Madrid: Espasa.

Roturier, Johann (2006). An Investigation into the Impact of Controlled English Rules on the Comprehensibility, Usefulness and Acceptability of Machine-Translated Technical Documentation for French and German Users [Tesis doctoral]. Dublin City University.

Ruiz Cascales, Remedios; Sutcliffe, Richard F. (2003). A specification and validating parser for simplified technical Spanish. In: EAMT Workshop: Improving MT through Other Language Technology Tools. Budapest: European Association for Machine Translation. <https://aclanthology.org/2003.eamt-1.15/>. [Accessed: 20251216].

Saggion, Horacio et al. (2015). Making It Simplext: Implementation and Evaluation of a Text Simplification System for Spanish. ACM Transactions on Accessible Computing, v. 6, n. 7, pp. 1–36. <https://doi.org/10.1145/2738046>. [Accessed: 20250428].

Sánchez-Gijón, Pilar; Kenny, Dorothy (2022). Selecting and preparing texts for machine translation: Pre-editing and writing for a global audience. In: Kenny, Dorothy (ed.). Machine Translation for Everyone. Berlin: Language Science Press, pp. 81–103.

Schieferdecker, Ina (2024). Next-Gen Software Engineering. Big Models for AI-Augmented Model-Driven Software Engineering. arXiv. <https://doi.org/10.48550/arXiv.2409.18048>. [Accessed: 20250428].

Schwitter, Rolf (2002). English as a formal specification language. In: Proceedings of DEXA ’02, pp. 228–232. <https://doi.org/10.1109/DEXA.2002.1045903>. [Accessed: 20250428].

Seibel, Claudia; Carlucci, Laura (2021). El lenguaje controlado: punto de partida hacia la Lectura fácil. HIKMA, v. 20, n. 2, pp. 331–357. <https://doi.org/10.21071/hikma.v20i2.13394>. [Accessed: 20250428].

Sowa, John (2000). Ontology, metadata, and semiotics. In: Proceedings of ICCS 2000, pp. 55–81. <https://doi.org/10.1007/10722280_5>. [Accessed: 20250428].

Temnikova, Irina (2012). Text Complexity and Text Simplification [Tesis doctoral]. University of Wolverhampton.

Williams, Christopher (2023). The Impact of Plain Language on Legal English in the United Kingdom. New York: Routledge.

Wu, Yonghui et al. (2016). Google’s Neural Machine Translation System: Bridging the Gap between Human and Machine Translation. arXiv. <https://doi.org/10.48550/arXiv.1609.08144>. [Accessed: 20250428].

Publicades

22-12-2025

Com citar

(1)
Fernández Vivanco, A.; Bustos Gisbert, J. M. Els Llenguatges Controlats I La Seva Aplicació a La Traducció automàtica: 20 Anys després. tradumatica 2025, 36-64.

Descàrregues

Les dades de descàrrega encara no estan disponibles.

Funding data

  • Junta de Castilla y León
  • European Social Fund Plus
    Grant numbers Ayudas destinadas a financiar la contratación predoctoral de personal investigador (cofinanciado por la Junta de Castilla y León y el Fondo Social Europeo Plus, Orden EDU/1009/2024)
  • European Regional Development Fund
    Grant numbers COMPLETEXT: Complejidad textual y lecturabilidad. Estudio aplicado a la IA generativa y la didáctica de las lenguas (proyecto financiado por la JCyL, FEDER-UE, SA067P24)