Ítem
Acceso Abierto

Aprendizaje por refuerzo de un parser semántico óptimo en DRT
dc.contributor.advisor | Andrade Lotero, Édgar José | |
dc.creator | Piza Londoño, Jessenia | |
dc.creator.degree | Magíster en Matemáticas Aplicadas y Ciencias de la Computación | |
dc.creator.degreetype | Full time | |
dc.date.accessioned | 2024-08-13T20:32:42Z | |
dc.date.available | 2024-08-13T20:32:42Z | |
dc.date.created | 2024-08-12 | |
dc.description | Este documento se trata del procesamiento de lenguaje natural (NLP, por sus siglas en inglés), que se enfoca en desarrollar sistemas de comunicación efectivos entre computadoras y humanos. Aunque los mayores avances en esta área se han logrado mediante grandes modelos de lenguaje (LLMs, por sus siglas en inglés), estos suelen ser imprecisos en dominios regidos por reglas, como las relaciones espaciales o las normas legales. Para abordar estos dominios, se utilizan parsers semánticos que asignan representaciones lógicas a los textos a través del análisis de su estructura sintáctica y la interpretación semántica. Sin embargo, estos parsers son complejos y su diseño es complicado debido a la implementación manual de reglas específicas. Este estudio propone un enfoque innovador que utiliza el aprendizaje por refuerzo profundo para desarrollar un parser semántico que pueda aprender y adaptarse automáticamente. El agente, a través de recompensas, optimizará su comportamiento con el tiempo, lo que podría tener un impacto significativo en el avance del procesamiento de lenguaje natural. | |
dc.description.abstract | This document is about natural language processing (NLP), which focuses on developing effective communication systems between computers and humans. While the most significant advances in this area have been achieved through large language models (LLMs), these models often lack precision in rule-governed domains, such as spatial relations or legal norms. To address these domains, semantic parsers are used to assign logical representations to texts by analyzing their syntactic structure and semantic interpretation. However, these parsers are complex, and their design is challenging due to the manual implementation of specific rules. This study proposes an innovative approach using deep reinforcement learning to develop a semantic parser that can learn and adapt automatically. Through rewards, the agent will optimize its behavior over time, which could have a significant impact on the advancement of natural language processing. | |
dc.format.extent | 44 PP | |
dc.format.mimetype | application/pdf | |
dc.identifier.doi | https://doi.org/10.48713/10336_43268 | |
dc.identifier.uri | https://repository.urosario.edu.co/handle/10336/43268 | |
dc.language.iso | spa | |
dc.publisher | Universidad del Rosario | spa |
dc.publisher.department | Escuela de Ingeniería, Ciencia y Tecnología | spa |
dc.publisher.program | Maestría en Matemáticas Aplicadas y Ciencias de la Computación | spa |
dc.rights | Attribution-NonCommercial-ShareAlike 4.0 International | * |
dc.rights.accesRights | info:eu-repo/semantics/openAccess | |
dc.rights.acceso | Abierto (Texto Completo) | |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | * |
dc.source.bibliographicCitation | Eisenstein, Jacob (2019) Introduction to Natural Language Processing. : MIT Press; | |
dc.source.bibliographicCitation | Davidson, Donald (2001) Essays on Actions and Events: Philosophical Essays Volume 1. Oxford, GB: Clarendon Press; | |
dc.source.bibliographicCitation | Kamp, H; Reyle, U (1993) From Discourse to Logic. Dordrecht: Kluwer; | |
dc.source.bibliographicCitation | Geurts, Bart; Beaver, David I; Maier, Emar (2020) Discourse Representation Theory. : Metaphysics Research Lab, Stanford University; Disponible en: https://plato.stanford.edu/archives/spr2020/entries/discourse-representation-theory/. | |
dc.source.bibliographicCitation | van Noord, Rik; Abzianidze, Lasha; Toral, Antonio; Bos, Johan (2018) Exploring Neural Methods for Parsing Discourse Representation Structures. En: Transactions of the Association for Computational Linguistics. Vol. 6; pp. 619-633 Cambridge, MA: MIT Press; Disponible en: https://aclanthology.org/Q18-1043; http://dx.doi.org/10.1162/tacl_a_00241. Disponible en: 10.1162/tacl_a_00241. | |
dc.source.bibliographicCitation | Bos, Johan (2008) Wide-Coverage Semantic Analysis with Boxer. En: Semantics in Text Processing. STEP 2008 Conference Proceedings. pp. 277-286 : College Publications; Disponible en: https://aclanthology.org/W08-2222. | |
dc.source.bibliographicCitation | van Lambalgen, Michiel; Hamm, Fritz (2008) The Proper Treatment of Events. En: Explorations in Semantics.: Wiley; 9780470759226; | |
dc.source.bibliographicCitation | Andrade-Lotero, Edgar (2006) Meaning and Form in Event Calculus. MSc. Thesis. : ILLC, Universiteit van Amsterdam; | |
dc.source.bibliographicCitation | Jurafsky, Daniel; Martin, James (2008) Speech and Language Processing. : Prentice Hall; | |
dc.source.bibliographicCitation | van Harmelen, Frank; Lifschitz, Vladimir; Porter, Bruce (2008) Handbook of Knowledge Representation. Amsterdam: Elsevier; | |
dc.source.bibliographicCitation | Vaswani, Ashish; Shazeer, Noam; Parmar, Niki; Uszkoreit, Jakob; Jones, Llion; Gomez, Aidan N; Kaiser, Lukasz; Polosukhin, Illia (2023) Attention Is All You Need. En: arXiv [cs.CL]. Disponible en: http://arxiv.org/abs/1706.03762. | |
dc.source.bibliographicCitation | Traylor, Aaron; Feiman, Roman; Pavlick, Ellie; Zong, Chengqing; Xia, Fei; Li, Wenjie; Navigli, Roberto (2021) AND does not mean OR: Using Formal Languages to Study Language Models'. En: Proceedings of the 59th Annual Meeting of the Association for. pp. 158-167 : Association for Computational Linguistics; | |
dc.source.bibliographicCitation | Kassner, Nora; Krojer, Benno; Schütze, Hinrich; Fernández, Raquel; Linzen, Tal (2020) Are Pretrained Language Models Symbolic Reasoners over Knowledge?. En: Proceedings of the 24th Conference on Computational Natural Language. pp. 552-564 : Association for Computational Linguistics; | |
dc.source.bibliographicCitation | Basmov, Victoria; Goldberg, Yoav; Tsarfaty, Reut (2024) Simple Linguistic Inferences of Large Language Models (LLMs): Blind Spots. En: arXiv [cs.CL]. Disponible en: http://arxiv.org/abs/2305.14785. | |
dc.source.bibliographicCitation | Minsky, Marvin; Winston, P H (1975) A framework for representing knowledge. En: The Psychology of Computer Vision.: McGraw-Hill; | |
dc.source.bibliographicCitation | McDermott, D V (1987) A critique of pure reason. En: Computational Intelligence. Vol. 3; pp. 151-160 | |
dc.source.bibliographicCitation | Gamut, L T F (1991) Logic, Language and Meaning Vol. 2. : University of Chicago Press; | |
dc.source.bibliographicCitation | Mueller, Erik T (2006) Common Sense Reasoning. : Elsevier; | |
dc.source.bibliographicCitation | Sutton, Richard S; Barto, Andrew G (2018) Reinforcement Learning. : MIT Press; | |
dc.source.bibliographicCitation | Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; Graves, Alex; Antonoglou, Ioannis; Wierstra, Daan; Riedmiller, Martin A (2013) Playing Atari with Deep Reinforcement Learning. En: CoRR. Vol. abs/1312.5602; | |
dc.source.bibliographicCitation | Abzianidze, Lasha; Bjerva, Johannes; Evang, Kilian; Haagsma, Hessel; van Noord, Rik; Ludmann, Pierre; Nguyen, Duc-Duy; Bos, Johan (2017) The Parallel Meaning Bank: Towards a Multilingual Corpus of Translations. En: Proceedings of the 15th Conference of the European Chapter of the. pp. 242-247 : Association for Computational Linguistics; Disponible en: https://aclanthology.org/E17-2039. | |
dc.source.bibliographicCitation | Zai, Alexander; Brown, Brandon (2020) Deep Reinforcement Learning in Action. : Manning Publications; | |
dc.source.bibliographicCitation | Ozdemir, Sinan (2023) Quick Start Guide to Large Language Models: Strategies and Best Practices. : Addison-Wesley Professional; | |
dc.source.bibliographicCitation | Montague, Richard (1974) Formal Philosophy: Selected Papers of Richard Montague. : New Haven: Yale University Press; | |
dc.source.bibliographicCitation | Hugging Face (2024) sentence-transformers/distiluse-base-multilingual-cased-v1. Disponible en: https://huggingface.co/sentence-transformers/distiluse-base-multilingual-cased-v1. | |
dc.source.bibliographicCitation | Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; Others, (2015) Human-level control through deep reinforcement learning. En: Nature. Vol. 518; No. 7540; pp. 529-533 : Nature Publishing Group; | |
dc.source.bibliographicCitation | Binz, Marcel; Schulz, Eric (2023) Using cognitive psychology to understand GPT-3. En: Proceedings of the National Academy of Sciences. Vol. 120; No. 6; Proceedings of the National Academy of Sciences; 1091-6490; Disponible en: http://dx.doi.org/10.1073/pnas.2218523120. Disponible en: 10.1073/pnas.2218523120. | |
dc.source.bibliographicCitation | Uc-Cetina, Víctor; Navarro-Guerrero, Nicolás; Martin-Gonzalez, Anabel; Weber, Cornelius; Wermter, Stefan (2022) Survey on reinforcement learning for language processing. En: Artificial Intelligence Review. Vol. 56; No. 2; pp. 1543–1575 : Springer Science and Business Media LLC; 1573-7462; Disponible en: http://dx.doi.org/10.1007/s10462-022-10205-5. Disponible en: 10.1007/s10462-022-10205-5. | |
dc.source.bibliographicCitation | Schulman, John; Wolski, Filip; Dhariwal, Prafulla; Radford, Alec; Klimov, Oleg (2017) Proximal Policy Optimization Algorithms. En: arXiv [cs.LG]. Disponible en: http://arxiv.org/abs/1707.06347. | |
dc.source.bibliographicCitation | Starc, Janez; Mladenić, Dunja (2016) Joint learning of ontology and semantic parser from text. En: arXiv [cs.AI]. Disponible en: http://arxiv.org/abs/1601.00901. | |
dc.source.bibliographicCitation | Yang, Zhilin; Qi, Peng; Zhang, Saizheng; Bengio, Yoshua; Cohen, William W; Salakhutdinov, Ruslan; Manning, Christopher D (2018) HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering. En: arXiv [cs.CL]. Disponible en: http://arxiv.org/abs/1809.09600. | |
dc.source.bibliographicCitation | Geva, Mor; Khashabi, Daniel; Segal, Elad; Khot, Tushar; Roth, Dan; Berant, Jonathan (2021) Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit. En: Transactions of the Association for Computational Linguistics. Vol. 9; pp. 346-361 Disponible en: https://api.semanticscholar.org/CorpusID:230799347. | |
dc.source.bibliographicCitation | Ho, Xanh; Duong Nguyen, Anh-Khoa; Sugawara, Saku; Aizawa, Akiko; Scott, Donia; Bel, Nuria; Zong, Chengqing (2020) Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of. En: Proceedings of the 28th International Conference on Computational. pp. 6609-6625 : International Committee on Computational Linguistics; Disponible en: https://aclanthology.org/2020.coling-main.580; http://dx.doi.org/10.18653/v1/2020.coling-main.580. Disponible en: 10.18653/v1/2020.coling-main.580. | |
dc.source.instname | instname:Universidad del Rosario | |
dc.source.reponame | reponame:Repositorio Institucional EdocUR | spa |
dc.subject | Representación formal del lenguaje | |
dc.subject | Razonamiento automático | |
dc.subject | Inferencia Lógica | |
dc.subject | Teoría de la Representación del Discurso | |
dc.subject | Procesamiento de Lenguaje Natural | |
dc.subject.keyword | Formal representation of language | |
dc.subject.keyword | Automatic reasoning | |
dc.subject.keyword | Logical inference | |
dc.subject.keyword | Discourse representation theory | |
dc.subject.keyword | Natural language processing | |
dc.title | Aprendizaje por refuerzo de un parser semántico óptimo en DRT | |
dc.title.TranslatedTitle | Reinforcement Learning of an Optimal Semantic Parser in DRT | |
dc.type | bachelorThesis | |
dc.type.document | Trabajo de grado | |
dc.type.spa | Trabajo de grado | |
local.department.report | Escuela de Ingeniería, Ciencia y Tecnología | |
local.regiones | Bogotá |