Yayın:
LSTM-GRU Based Deep Learning Model with Word2Vec for Transcription Factors in Primates

dc.contributor.authorÖNCÜL, Ali Burak
dc.date.accessioned2026-01-04T18:21:31Z
dc.date.issued2023-01-30
dc.description.abstractThe study of the structures of proteins and the relationships of amino acids remains a challenging problem in biology. Although some bioinformatics-based studies provide partial solutions, some major problems remain. At the beginning of these problems are the logic of the sequence of amino acids and the diversity of proteins. Although these variations are biologically detectable, these experiments are costly and time-consuming. Considering that there are many unclassified sequences in the world, it is inevitable that a faster solution must be found. For this reason, we propose a deep learning model to classify transcription factor proteins of primates. Our model has a hybrid structure that uses Recurrent Neural Network (RNN) based Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) networks with Word2Vec preprocessing step. Our model has 97.96% test accuracy, 97.55% precision, 95.26% recall, 96.22% f1-score. Our model was also tested with 5-fold cross-validation and reached 97.42% result. In the prepared model, LSTM was used in layers with fewer units, and GRU was used in layers with more units, and it was aimed to make the model a model that can be trained and run as quickly as possible. With the added dropout layers, the overfitting problem of the model is prevented.
dc.description.urihttps://doi.org/10.17694/bajece.1191009
dc.description.urihttps://dergipark.org.tr/tr/pub/bajece/issue/75680/1191009
dc.identifier.doi10.17694/bajece.1191009
dc.identifier.endpage49
dc.identifier.issn2147-284X
dc.identifier.openairedoi_dedup___::0ad96da16edf2bfaa675f80e47fa17d1
dc.identifier.orcid0000-0001-9612-1787
dc.identifier.startpage42
dc.identifier.urihttps://hdl.handle.net/20.500.12597/40478
dc.identifier.volume11
dc.publisherBalkan Journal of Electrical & Computer Engineering (BAJECE)
dc.relation.ispartofBalkan Journal of Electrical and Computer Engineering
dc.rightsOPEN
dc.subjectYapay Zeka
dc.subjectArtificial Intelligence
dc.subjectProtein classification
dc.subjectProtein classification
dc.subjectHybrid deep learning
dc.subjectWord2Vec
dc.subjectLSTM
dc.subjectGRU
dc.titleLSTM-GRU Based Deep Learning Model with Word2Vec for Transcription Factors in Primates
dc.typeArticle
dspace.entity.typePublication
local.api.response{"authors":[{"fullName":"Ali Burak ÖNCÜL","name":"Ali Burak","surname":"ÖNCÜL","rank":1,"pid":{"id":{"scheme":"orcid","value":"0000-0001-9612-1787"},"provenance":null}}],"openAccessColor":"gold","publiclyFunded":false,"type":"publication","language":{"code":"und","label":"Undetermined"},"countries":null,"subjects":[{"subject":{"scheme":"keyword","value":"Yapay Zeka"},"provenance":null},{"subject":{"scheme":"keyword","value":"Artificial Intelligence"},"provenance":null},{"subject":{"scheme":"keyword","value":"Protein classification;Protein classification;Hybrid deep learning;Word2Vec;LSTM;GRU"},"provenance":null}],"mainTitle":"LSTM-GRU Based Deep Learning Model with Word2Vec for Transcription Factors in Primates","subTitle":null,"descriptions":["<jats:p xml:lang=\"en\">The study of the structures of proteins and the relationships of amino acids remains a challenging problem in biology. Although some bioinformatics-based studies provide partial solutions, some major problems remain. At the beginning of these problems are the logic of the sequence of amino acids and the diversity of proteins. Although these variations are biologically detectable, these experiments are costly and time-consuming. Considering that there are many unclassified sequences in the world, it is inevitable that a faster solution must be found. For this reason, we propose a deep learning model to classify transcription factor proteins of primates. Our model has a hybrid structure that uses Recurrent Neural Network (RNN) based Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) networks with Word2Vec preprocessing step. Our model has 97.96% test accuracy, 97.55% precision, 95.26% recall, 96.22% f1-score. Our model was also tested with 5-fold cross-validation and reached 97.42% result. In the prepared model, LSTM was used in layers with fewer units, and GRU was used in layers with more units, and it was aimed to make the model a model that can be trained and run as quickly as possible. With the added dropout layers, the overfitting problem of the model is prevented.</jats:p>"],"publicationDate":"2023-01-30","publisher":"Balkan Journal of Electrical & Computer Engineering (BAJECE)","embargoEndDate":null,"sources":["Crossref","Volume: 11, Issue: 1 42-49","2147-284X","Balkan Journal of Electrical and Computer Engineering"],"formats":["application/pdf"],"contributors":null,"coverages":null,"bestAccessRight":{"code":"c_abf2","label":"OPEN","scheme":"http://vocabularies.coar-repositories.org/documentation/access_rights/"},"container":{"name":"Balkan Journal of Electrical and Computer Engineering","issnPrinted":"2147-284X","issnOnline":null,"issnLinking":null,"ep":"49","iss":null,"sp":"42","vol":"11","edition":null,"conferencePlace":null,"conferenceDate":null},"documentationUrls":null,"codeRepositoryUrl":null,"programmingLanguage":null,"contactPeople":null,"contactGroups":null,"tools":null,"size":null,"version":null,"geoLocations":null,"id":"doi_dedup___::0ad96da16edf2bfaa675f80e47fa17d1","originalIds":["10.17694/bajece.1191009","50|doiboost____|0ad96da16edf2bfaa675f80e47fa17d1","oai:dergipark.org.tr:article/1191009","50|tubitakulakb::90f8bf8f94973e9b33ecc37c55ea0fb2"],"pids":[{"scheme":"doi","value":"10.17694/bajece.1191009"}],"dateOfCollection":null,"lastUpdateTimeStamp":null,"indicators":{"citationImpact":{"citationCount":4,"influence":2.7634859e-9,"popularity":5.360166e-9,"impulse":4,"citationClass":"C5","influenceClass":"C5","impulseClass":"C5","popularityClass":"C4"}},"instances":[{"pids":[{"scheme":"doi","value":"10.17694/bajece.1191009"}],"type":"Article","urls":["https://doi.org/10.17694/bajece.1191009"],"publicationDate":"2023-01-30","refereed":"peerReviewed"},{"alternateIdentifiers":[{"scheme":"doi","value":"10.17694/bajece.1191009"}],"type":"Article","urls":["https://dergipark.org.tr/tr/pub/bajece/issue/75680/1191009"],"publicationDate":"2022-10-18","refereed":"nonPeerReviewed"}],"isGreen":false,"isInDiamondJournal":false}
local.import.sourceOpenAire

Dosyalar

Koleksiyonlar