Publications des scientifiques de l'IRD

Happi Happi Bill gates, Pelap G. F., Symeonidou D., Larmande Pierre. (2025). GRU-SCANET: unleashing the power of GRU-based sinusoidal capture network for precision-driven named entity recognition. Bioinformatics Advances, 5 (1), p. vbaf096 [10 p.].

Titre du document
GRU-SCANET: unleashing the power of GRU-based sinusoidal capture network for precision-driven named entity recognition
Année de publication
2025
Type de document
Article référencé dans le Web of Science WOS:001516593600001
Auteurs
Happi Happi Bill gates, Pelap G. F., Symeonidou D., Larmande Pierre
Source
Bioinformatics Advances, 2025, 5 (1), p. vbaf096 [10 p.]
Motivation Pre-trained Language Models (PLMs) have achieved remarkable performance across various natural language processing tasks. However, they encounter challenges in biomedical named entity recognition (NER), such as high computational costs and the need for complex fine-tuning. These limitations hinder the efficient recognition of biological entities, especially within specialized corpora. To address these issues, we introduce GRU-SCANET (Gated Recurrent Unit-based Sinusoidal Capture Network), a novel architecture that directly models the relationship between input tokens and entity classes. Our approach offers a computationally efficient alternative for extracting biological entities by capturing contextual dependencies within biomedical texts.Results GRU-SCANET combines positional encoding, bidirectional GRUs (BiGRUs), an attention-based encoder, and a conditional random field (CRF) decoder to achieve high precision in entity labeling. This design effectively mitigates the challenges posed by unbalanced data across multiple corpora. Our model consistently outperforms leading benchmarks, achieving better performance than BioBERT (8/8 evaluations), PubMedBERT (5/5 evaluations), and the previous state-of-the-art (SOTA) models (8/8 evaluations), including Bern2 (5/5 evaluations). These results highlight the strength of our approach in capturing token-entity relationships more effectively than existing methods, advancing the state of biomedicalNER.Availability and implementation https://github.com/ANR-DIG-AI/GRU-SCANET.
Plan de classement
Sciences fondamentales / Techniques d'analyse et de recherche [020]
Localisation
Fonds IRD [F B010094282]
Identifiant IRD
fdi:010094282
Contact