Performance of multivariate mutual information and autocorrelation encoding methods for the prediction of protein-protein interactions

Penulis: Bustamam, Alhadi; Sunggawa, Mohamad Irlin; Siswantining, Titin
Informasi
JurnalIAES International Journal of Artificial Intelligence
PenerbitInstitute of Advanced Engineering and Science, IAES Institute of Advanced Engineering and Science
Volume & EdisiVol. 11,Edisi 2
Halaman773 - 786
Tahun Publikasi2022
ISSN20894872
Jenis SumberScopus
Sitasi
Scopus: 3
Google Scholar: 3
PubMed: 3
Abstrak
Protein interactions play an essential role in the study of how an organism can be infected with a disease and also its effects. One of the challenges in computational methods in the prediction of protein-protein interactions is how to represent a sequence of amino acids in a vector so that it can be used in machine learning to create a model that can predict whether or not an interaction occurs in a protein pair. This paper examined the qualitative feature encoding methods of amino acid sequence, namely, multivariate mutual information (MMI), and the quantitative feature encoding methods, namely, autocorrelation. We develop the new design for MMI and autocorrelation feature encoding methods which give better results than the previous research. There are four ways to build the MMI method and six ways to build the autocorrelation method that we tested. We also built four types of MMI-autocorrelation (mixed) method and look for the best form of each type of MMI, autocorrelation, and mixed-method. We combine these feature encoding methods with support vector machine (SVM) as machine learning methods. We also test the encoding methods we propose to several machine learning classifier methods, such as random forest (RF), k-nearest neighbor (KNN), and gradient boosting. © 2022, Institute of Advanced Engineering and Science. All rights reserved.
Dokumen & Tautan

© 2025 Universitas Indonesia. Seluruh hak cipta dilindungi.