Yayın:
UTILIZING ROOTS AND PATTERNS TO IDENTIFY ARABIC NAMED ENTITIES

dc.contributor.authorTOSUN, ALİ RIZA
dc.contributor.authorHANÇRLİOĞULLARI, AYBABA
dc.contributor.authorAHMED, ABDULMONEM
dc.date.accessioned2026-01-04T17:26:06Z
dc.date.issued2022-11-07
dc.description.abstractNamed Entity Recognition NER is a subset of information extraction that seeks to recognize and categorize named things in text data into specified categories, such as people's names, organizations' names, geographic locations, and so on. This task has recently attracted a lot of attention due to the discovery it has the potential to boost the performance of a variety of NLP applications. In the domains of Question Answering and Summarization Systems, Information Retrieval and Extraction, Machine Translation, Video Annotation, Semantic Web Search, and Bioinformatics, the majority of difficulties require named entity recognition. Arabic is an inflectional language, which allows for non-concatenative morphological operations on the root. The purpose of this study is to extract and recognize entity names from Arabic articles. We proposed an algorithm for determining names from roots using patterns. We developed it in Python and leveraged the "pyqt5" visual package to see the results immediately, as well as modify and add patterns easily. To replicate the names, we used a random sample of 400 names and 45 different patterns. The algorithm correctly identified 370 names easily and quickly, yielding a success rate of 93%. All names with the same recognized names will be known in the same way by the method and do not need any manipulation in code or design. The names that are not recognized by our algorithm have no roots in the list of known Arabic roots. Our research shows that the approach can recognize names with roots with high speed and accuracy, but it is not possible to identify nouns that are not in the Arabic language using this method. As a result, we recommend using a hybrid method that incorporates multiple concepts.
dc.description.urihttps://doi.org/10.56557/ajomcor/2022/v29i27922
dc.identifier.doi10.56557/ajomcor/2022/v29i27922
dc.identifier.eissn2395-4213
dc.identifier.endpage42
dc.identifier.openairedoi_________::50fbb6c57a3708c3915afc2547c6d986
dc.identifier.orcid0000-0001-9816-9717
dc.identifier.startpage33
dc.identifier.urihttps://hdl.handle.net/20.500.12597/40121
dc.publisherIK Press
dc.relation.ispartofAsian Journal of Mathematics and Computer Research
dc.titleUTILIZING ROOTS AND PATTERNS TO IDENTIFY ARABIC NAMED ENTITIES
dc.typeArticle
dspace.entity.typePublication
local.api.response{"authors":[{"fullName":"ALİ RIZA TOSUN","name":"ALİ RIZA","surname":"TOSUN","rank":1,"pid":null},{"fullName":"AYBABA HANÇRLİOĞULLARI","name":"AYBABA","surname":"HANÇRLİOĞULLARI","rank":2,"pid":null},{"fullName":"ABDULMONEM AHMED","name":"ABDULMONEM","surname":"AHMED","rank":3,"pid":{"id":{"scheme":"orcid","value":"0000-0001-9816-9717"},"provenance":null}}],"openAccessColor":null,"publiclyFunded":false,"type":"publication","language":{"code":"und","label":"Undetermined"},"countries":null,"subjects":null,"mainTitle":"UTILIZING ROOTS AND PATTERNS TO IDENTIFY ARABIC NAMED ENTITIES","subTitle":null,"descriptions":["<jats:p>Named Entity Recognition NER is a subset of information extraction that seeks to recognize and categorize named things in text data into specified categories, such as people's names, organizations' names, geographic locations, and so on. This task has recently attracted a lot of attention due to the discovery it has the potential to boost the performance of a variety of NLP applications. In the domains of Question Answering and Summarization Systems, Information Retrieval and Extraction, Machine Translation, Video Annotation, Semantic Web Search, and Bioinformatics, the majority of difficulties require named entity recognition. Arabic is an inflectional language, which allows for non-concatenative morphological operations on the root. The purpose of this study is to extract and recognize entity names from Arabic articles. We proposed an algorithm for determining names from roots using patterns. We developed it in Python and leveraged the \"pyqt5\" visual package to see the results immediately, as well as modify and add patterns easily. To replicate the names, we used a random sample of 400 names and 45 different patterns. The algorithm correctly identified 370 names easily and quickly, yielding a success rate of 93%. All names with the same recognized names will be known in the same way by the method and do not need any manipulation in code or design. The names that are not recognized by our algorithm have no roots in the list of known Arabic roots. Our research shows that the approach can recognize names with roots with high speed and accuracy, but it is not possible to identify nouns that are not in the Arabic language using this method. As a result, we recommend using a hybrid method that incorporates multiple concepts.</jats:p>"],"publicationDate":"2022-11-07","publisher":"IK Press","embargoEndDate":null,"sources":["Crossref"],"formats":null,"contributors":null,"coverages":null,"bestAccessRight":null,"container":{"name":"Asian Journal of Mathematics and Computer Research","issnPrinted":null,"issnOnline":"2395-4213","issnLinking":null,"ep":"42","iss":null,"sp":"33","vol":null,"edition":null,"conferencePlace":null,"conferenceDate":null},"documentationUrls":null,"codeRepositoryUrl":null,"programmingLanguage":null,"contactPeople":null,"contactGroups":null,"tools":null,"size":null,"version":null,"geoLocations":null,"id":"doi_________::50fbb6c57a3708c3915afc2547c6d986","originalIds":["10.56557/ajomcor/2022/v29i27922","50|doiboost____|50fbb6c57a3708c3915afc2547c6d986"],"pids":[{"scheme":"doi","value":"10.56557/ajomcor/2022/v29i27922"}],"dateOfCollection":null,"lastUpdateTimeStamp":null,"indicators":{"citationImpact":{"citationCount":0,"influence":2.5349236e-9,"popularity":1.8548826e-9,"impulse":0,"citationClass":"C5","influenceClass":"C5","impulseClass":"C5","popularityClass":"C5"}},"instances":[{"pids":[{"scheme":"doi","value":"10.56557/ajomcor/2022/v29i27922"}],"type":"Article","urls":["https://doi.org/10.56557/ajomcor/2022/v29i27922"],"publicationDate":"2022-11-07","refereed":"peerReviewed"}],"isGreen":false,"isInDiamondJournal":false}
local.import.sourceOpenAire

Dosyalar

Koleksiyonlar