Comparación de modelos de aprendizaje automático para la predicción de células cancerígenas a partir del complejo MHC I

Navas Luquez, Mateo

doi:https://doi.org/10.48713/10336_24401

Ítem

Acceso Abierto

Comparación de modelos de aprendizaje automático para la predicción de células cancerígenas a partir del complejo MHC I

Mostrar el registro sencillo de la publicación

dc.contributor.advisor	Orjuela Cañón, Alvaro David
dc.creator	Navas Luquez, Mateo
dc.creator.degree	Ingeniero Biomédico	spa
dc.creator.degreetype	Full time	spa
dc.date.accessioned	2020-05-27T21:38:23Z
dc.date.available	2020-05-27T21:38:23Z
dc.date.created	2020-05-22
dc.description	El presente trabajo propone una comparación de modelos de aprendizaje automático para la detección de células cancerígenas a partir de los antígenos del complejo MHC I. Utilizando protocolos de extracción de características físico-químicas de las proteínas y un proceso comparativo de las medidas de desempeño en la fase de validación y prueba de los modelos. Con este procedimiento se pretende determinar cuál modelo de aprendizaje automático presenta el mejor desempeño en la predicción de antígenos cancerígenos, utilizando propiedades fisicoquímicas como marcadores de entrada.	spa
dc.description.abstract	The present work proposes a comparison of machine learning models for the detection of cancer cells from the MHC I complex antigens. Using protocols for the extraction of physical-chemical characteristics of proteins and a comparative process of performance measurements in the model validation and testing phase. This procedure aims to determine the machine learning model presenting the best performance in the prediction of carcinogenic antigens, using physicochemical properties as input markers.	spa
dc.format.mimetype	application/pdf
dc.identifier.doi	https://doi.org/10.48713/10336_24401
dc.identifier.uri	https://repository.urosario.edu.co/handle/10336/24401
dc.language.iso	spa	spa
dc.publisher	Universidad del Rosario	spa
dc.publisher.department	Escuela de Medicina y Ciencias de la Salud	spa
dc.publisher.program	Ingeniería Biomédica	spa
dc.rights	Atribución 2.5 Colombia	spa
dc.rights.accesRights	info:eu-repo/semantics/openAccess
dc.rights.acceso	Abierto (Texto Completo)	spa
dc.rights.licencia	EL AUTOR, manifiesta que la obra objeto de la presente autorización es original y la realizó sin violar o usurpar derechos de autor de terceros, por lo tanto la obra es de exclusiva autoría y tiene la titularidad sobre la misma.	spa
dc.rights.uri	http://creativecommons.org/licenses/by/2.5/co/
dc.source.bibliographicCitation	OMS, «Cáncer,» 12 septiembre 2018. [En línea]. Available: https://www.who.int/es/news-room/fact-sheets/detail/cancerH	spa
dc.source.bibliographicCitation	ASCO, «American Society of Clinical Oncology,» Journal of Clinical Oncology, pp. 212-222, 20 Febrero 2017	spa
dc.source.bibliographicCitation	DANE, «Boletín Técnico,» 20 Diciembre 2019. [En línea]. Available: https://www.dane.gov.co/files/investigaciones/poblacion/bt_estadisticasvitales_IIItr im_2019pr-20-diciembre-2019.pdf	spa
dc.source.bibliographicCitation	MinSalud, «PLAN DECENAL PARA EL CONTROL EN COLOMBIA,» 17 Marzo 2012. [En línea]. Available: https://www.minsalud.gov.co/sites/rid/Lists/BibliotecaDigital/RIDE/IA/INCA/plannacional-control-cancer.pdf	spa
dc.source.bibliographicCitation	MinSalud, «Cáncer,» Enero 25 2020. [En línea]. Available: https://www.minsalud.gov.co/salud/publica/PENT/Paginas/Prevenciondelcancer.aspx	spa
dc.source.bibliographicCitation	A. L. L. T. G. Óscar, «Costos directos de la atención del cáncer,» Cancerol, vol. XX, nº 2, pp. 52-60, 2016	spa
dc.source.bibliographicCitation	J. Greening, «The peptidome comes of age: Mass spectrometry-based characterization of the circulating cancer peptidome,» Enzymes, vol. 42, pp. 27-64, 2017	spa
dc.source.bibliographicCitation	S. Lou, «Automated detection of radiology reports that require follow-up imaging using natural language processing feature engineering and machine learning classification,» Journal of Digital Imaging, vol. 33, nº 1, pp. 131-136, 2020	spa
dc.source.bibliographicCitation	G. M. H. R. E. &. W. N. Cooper, La célula: Geoffrey M. Cooper y Robert E. Hausman, Madrid: Madrid: Marbán, 2014	spa
dc.source.bibliographicCitation	B. S. D. Lodish, Biología Celular y Molecular, 5 ed., Madrid: Panamericana, 2016, pp. 590-630	spa
dc.source.bibliographicCitation	S. I. Fox, FISIOLOGÍA HUMANA, vol. XII, New York: Pierce College, 2011, pp. 50- 90	spa
dc.source.bibliographicCitation	V. M. Saikumar P., Apoptosis and Cell Death. In: Allen T., vol. II, Boston: Molecular Pathology Library, 2009	spa
dc.source.bibliographicCitation	C. W. K. Murphy, ImmunBiology, 9 ed., vol. I, Bostom: Garlad Science, 2017, pp. 3-35	spa
dc.source.bibliographicCitation	W. R. Hanahan D, «Review: Hallmarks of Cancer,» 11 Junio 2011. [En línea]. Available: http://ez.urosario.edu.co/login?url=http://search.ebscohost.com/login.aspx?direct= true&db=ed selp&AN=S0092867411001279&lang=es&site=eds-live&scope=site	spa
dc.source.bibliographicCitation	S. Gortzak-Uzan, «A Proteome Resource of Ovarian Cancer Ascites: Integrated Proteomic and Bioinformatic Analyses To Identify Putative Biomarkers,» Journal of Proteome Research, vol. VII, nº 1, p. 339–351, 2013	spa
dc.source.bibliographicCitation	H. Mattsson, «Improved pan-specific prediction of MHC class I peptide binding using a novel receptor clustering data partitioning strategy,» HLA Inmune Response Genetics, pp. 1-6, 2016	spa
dc.source.bibliographicCitation	R. J. A. N. M. N. H. V. I. Jurtz, «An introduction to Deep learning on biological sequence data – Examples and solutions,» Oxford University Press, 2017	spa
dc.source.bibliographicCitation	J. &. V. A. &. C. S. &. D. N. Gauthier, «A Brief History of Bioinformatics. Briefings in Bioinformatics,» pp. 11-34, 2018	spa
dc.source.bibliographicCitation	B. C. R. S. C. B. J. G. I. I. Pedro Larrañaga, «Machine learning in bioinformatics,» p. 86–112, 2006	spa
dc.source.bibliographicCitation	S. R. J. Y. H. J. L. H. J. L. E. J. L. Kim, «Bioinformatic and metabolomic analysis reveals miR-155 regulates thiamine level in breast cancer,» Cancer Letters, vol. II, nº 357, p. 488–497, 2015	spa
dc.source.bibliographicCitation	G. B. V. C. D. F. &. S. S. Musumarra, «A Bioinformatic Approach to the Identification of Candidate Genes for the Development of New Cancer Diagnostics,» Biological Chemistry, vol. II, nº 384, pp. 391-398, 2004	spa
dc.source.bibliographicCitation	C. G. B. T. Y. C. S. Zhou. J, «Genetic and bioinformatic analyses of the expression and function of PI3K regulatory subunit PIK3R3 in an Asian patient gastric cancer library,» Medical Genomic, vol. V, nº 1, pp. 5-12, 2012	spa
dc.source.bibliographicCitation	J. F. H. B. L. &. F. J. G. Beltrán Lissabet, «TTAgP 1.0: A computational tool for the specific prediction of tumor T cell antigens,» Computational Biology and Chemistry, nº 83, 2019	spa
dc.source.bibliographicCitation	D. Chicco, «Ten quick tips for machine learning in computational biology,» BioMed Central, vol. I, nº 10, pp. 1-17, 2017	spa
dc.source.bibliographicCitation	C. Bishop, PATTERN RECOGNITION AND MACHINE LEARNING, Primera ed., 2006, pp. 5-15	spa
dc.source.bibliographicCitation	O. Theobald, Machine Learning for Absolute Beginners, vol. 2, Independently Published, 2018, pp. 10-12	spa
dc.source.bibliographicCitation	A. M. D. A. A. L. Swan, «Application of Machine Learning to Proteomics Data: Classification and Biomarker Identification in Postgenomics Biology,» OMICS : a Journal of Integrative Biology, vol. XVII, nº 12, pp. 595-610, 2010	spa
dc.source.bibliographicCitation	J. Ulintz, «Improved Classification of Mass Spectrometry Database Search Results Using Newer Machine Learning Approaches,» Molecular & Cellular Proteomics, vol. V, nº 3, pp. 97-509, 2005	spa
dc.source.bibliographicCitation	S. Jeet, «Machine Learning Biogeographic Processes from Biotic Patterns: A New Trait-Dependent Dispersal and Diversification Model with Model Choice By Simulation-Trained Discriminant Analysis,» Systematic Biology, vol. 65, nº 3, pp. 525-55, 2016	spa
dc.source.bibliographicCitation	S. Zhang, «PromPDD, a web-based tool for the prediction, deciphering and design of promiscuous peptides that bind to HLA class I molecules,» Journal of Immunological Methods, pp. 476-489, 2020	spa
dc.source.bibliographicCitation	K. D. R. N. Tomar, «Immunoinformatics: an integrated scenario,» British Society of Immunology, vol. 131, nº 2, p. 153–168, 2010	spa
dc.source.bibliographicCitation	K. D. H. Youngmahn, «Deep convolutional neural networks for pan-specific peptideMHC class I binding prediction,» BMC Bioinformatics, 2017	spa
dc.source.bibliographicCitation	E. Loan, «Building MHC Class II Epitope Predictor Using Machine Learning Approaches,» Springer, vol. 1268, pp. 67-73, 2015	spa
dc.source.bibliographicCitation	T. Alvarez, «NNAlign-MA; MHC peptidome deconvolution for accurate MHC binding motif characterization and improved t-cell epitode,» Molecular and Cellular Proteomic, vol. 18, nº 12, pp. 2459-2477, 2019	spa
dc.source.bibliographicCitation	M. M. H. &. B. M. Nosrati, «Introducing of an integrated artificial neural network and chou's pseudo amino acid composition approach for computational epitopemapping of crimean-congo haemorrhagic fever virus antigens,» International Immunopharmacology, nº 78, 2020	spa
dc.source.bibliographicCitation	S. X. Z. Weilong, «Systematically benchmarking peptide-MHC binding predictors: From synthetic to naturally processed epitopes,» Computacional Biology, pp. 1-28, 2018	spa
dc.source.bibliographicCitation	R. D. A.D. Irini, «VaxiJen: a server for prediction of protective antigens, tumour antigens and subunit vaccines,» BMC Bioinformatics, pp. 22-26, 2007	spa
dc.source.bibliographicCitation	F. F. W. J. T. Elias, «TIminer: NGS data mining pipeline for cancer immunology and immunotherapy,» Bioinformatics, vol. 33, nº 19, p. 3140–3141, 2017	spa
dc.source.bibliographicCitation	L. T. S. L. H. Olsen, «TANTIGEN: a comprehensive database of tumor T cell antigens,» Cancer Immunol Immunother, nº 66, 2017	spa
dc.source.bibliographicCitation	C. L. S. S. R. L. G. a. L. Pommié, «IMGT standardized criteria for statistical analysis of immunoglobulin V‐REGION amino acid properties,» IMGT, nº 17, pp. 17-32, 2004	spa
dc.source.bibliographicCitation	J.-L. C. M. K. L. B. V. A. &. P. V. FAUCHÈRE, «Amino acid side chain parameters for correlation studies in biology and pharmacology,» International Journal of Peptide and Protein Research, nº 32(4), p. 269–278, 2009	spa
dc.source.bibliographicCitation	ImMunoGeneTics, «Amino acids,» 20 04 2004. [En línea]. Available: http://www.imgt.org/IMGTeducation/Aidememoire/_UK/aminoacids/IMGTclasses.html	spa
dc.source.bibliographicCitation	Q. B. A. G. K. Maricel, «Optimization of a new score function for the detection of remote homologs,» Proteins: Structure, Function, and Bioinformatics, vol. XLI, nº 4, pp. 498-503, 2000	spa
dc.source.bibliographicCitation	k. S.-H. T. H. D. G. Eleni, «Verification and validation of bioinformatics software without a gold standard: a case study of BWA and Bowtie,» BMC Bioinformatics, vol. XV, nº 16, 2014	spa
dc.source.bibliographicCitation	R. T. J. F. T. Hastie, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Segunda ed., Stanford, California: Springer, 2008	spa
dc.source.bibliographicCitation	G. Thomas, «Machine Learning Bias, Statistical Bias, and Statistical Variance of Decision Tree Algorithms,» 5 1995. [En línea]. Available: http://www.cems.uwe.ac.uk/~irjohnso/coursenotes/uqc832/tr-bias.pdf	spa
dc.source.bibliographicCitation	SciKit Learn, «sklearn.ensemble.RandomForestClassifier,» 2019. [En línea]. Available: https://scikitlearn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.ht ml	spa
dc.source.bibliographicCitation	S. Haykin, Neural Network and Learning Machines, Tercera ed., Boston: Person, 2009	spa
dc.source.bibliographicCitation	F. J. &. P.-M. C. Valverde-Albacete, «100% classification accuracy considered harmful: the normalized information transfer factor explains the accuracy paradox.,» Plos One, vol. IX, nº 1, 2014	spa
dc.source.bibliographicCitation	Scikit Learn, «GridSearchCV,» 2 Enero 2020. [En línea]. Available: https://scikitlearn.org/stable/modules/generated/sklearn.model_selection.GridSearchCV.html# sklearn.model_selection.GridSearchCV	spa
dc.source.bibliographicCitation	C. C. H. S. M. S. Ciampi A., «Recursive Partition: A Versatile Method for Exploratory-Data Analysis in Biostatistics,» Springer, vol. XXXVII, 1999	spa
dc.source.bibliographicCitation	B. Y. B. James, «Random Search for Hyper-Parameter Optimization,» Journal of Machine Learning Research, vol. XIII, pp. 281-305, 2012	spa
dc.source.bibliographicCitation	T. S. H. T. A. ENDO, «Comparison of Seven Algorithms to Predict Breast Cancer Survival,» International Journal of Biomedical Soft Computing and Human Sciences: the official journal of the Biomedical Fuzzy Systems Association, vol. XIII, nº 2, pp. 11-16, 2008	spa
dc.source.bibliographicCitation	M. J. D. A. F.F. João, «PLEURAL TUBERCULOSIS DIAGNOSIS BASED ON ARTIFICIAL,» Sociedade Brasileira de Inteligência Computacional, 2011	spa
dc.source.bibliographicCitation	Kakau, «ROC curves,» 17 06 2010. [En línea]. Available: http://creativecommons.org/licenses/by-sa/3.0/	spa
dc.source.bibliographicCitation	SciKit Learn, «sklearn.neural_network.MLPClassifier,» 20 Enero 2020. [En línea]. Available: https://scikitlearn.org/stable/modules/generated/sklearn.neural_network.MLPClassifier.html#s klearn.neural_network.MLPClassifier.predict_proba	spa
dc.source.bibliographicCitation	C. L. D. I. Seixas JM, «Relevance criteria for variable selection in classifier design. In: International conference on engineering applications of neural networks,» de International conference on engineering applications of neural networks, London, 1996	spa
dc.source.bibliographicCitation	F. S. T. R. C. P. J. V. K. A. L. S. J. M. &. M. F. C. Aguiar, «Development of two artificial neural network models to support the diagnosis of pulmonary tuberculosis in hospitalized patients in Rio de Janeiro,» Medical & biological engineering & computing, vol. 54, nº 11, pp. 1751-1759, 2016	spa
dc.source.bibliographicCitation	M. L. F. W. N.A Obuchowski, «ROC curves in clinical chemistry: uses, misuses, and possible solutions.,» Clenecal Chemistry, vol. L, nº 7, pp. 18-25, 2004	spa
dc.source.bibliographicCitation	N. Tkachev, «Flexible data trimming improves performance of global machine learning methods in omics-based personalized oncology,» International Journal of Molecular Sciences, vol. XXI, nº 3, 2020	spa
dc.source.bibliographicCitation	L. Breiman, «Random forests,» Springer Netherlands, vol. XLV, nº 1, pp. 5-32, 2001	spa
dc.source.instname	instname:Universidad del Rosario	spa
dc.source.instname	instname:Universidad del Rosario	spa
dc.source.reponame	reponame:Repositorio Institucional EdocUR
dc.subject	Antígenos	spa
dc.subject	Aprendizaje automático	spa
dc.subject	Cáncer	spa
dc.subject.ddc	Incidencia & prevención de la enfermedad	spa
dc.subject.ddc	Sistemas	spa
dc.subject.keyword	Antigen	spa
dc.subject.keyword	Cancer	spa
dc.subject.keyword	Machine Learning	spa
dc.title	Comparación de modelos de aprendizaje automático para la predicción de células cancerígenas a partir del complejo MHC I	spa
dc.title.TranslatedTitle	Comparison of machine learning models for the prediction of cancer cells from the MHC I complex	eng
dc.title.alternative	Predicción de células cancerígenas	spa
dc.type	bachelorThesis	eng
dc.type.document	Análisis de caso	spa
dc.type.hasVersion	info:eu-repo/semantics/acceptedVersion
dc.type.spa	Trabajo de grado	spa
local.department.report	Escuela de Medicina y Ciencias de la Salud	spa

Archivos

Bloque original

Mostrando1 - 1 de 1

Nombre:: Comparacion-de-modelos-de-aprendizaje-automatico-para-la-prediccion-de-celulas-cancerigenas-a-partir-del-complejo-MHC-I.pdf
Tamaño:: 1.64 MB
Formato:: Adobe Portable Document Format
Descripción:

Descargar