%0 Journal Article
%9 ACL : Articles dans des revues avec comité de lecture répertoriées par l'AERES
%A Raoufi, E.
%A Happi Happi, Bill Gates
%A Larmande, Pierre
%A Scharffe, F.
%A Todorov, K.
%T Analysis of the performance of representation learning methods for entity alignment : benchmark versus real-world data
%D 2026
%L fdi:010095821
%G ENG
%J Semantic Web
%@ 1570-0844
%K entity alignment ; knowledge graphs ; representation learning ; knowledge graph heterogeneity ; EA benchmarks
%M ISI:001629059600001
%N 1
%P 09217134251389825 [24 ]
%R 10.1177/09217134251389825
%U https://www.documentation.ird.fr/hor/fdi:010095821
%> https://horizon.documentation.ird.fr/exl-doc/pleins_textes/2026-01/010095821.pdf
%V 17
%W Horizon (IRD)
%X Representation learning for entity alignment (EA) aims to identify entities in different knowledge graphs (KGs) that refer to the same real-world object by comparing their embedding similarity. Although many EA models perform well on synthetic benchmark datasets, this performance does not always transfer to real-world, incomplete, and domain-specific data. A systematic comparison between synthetic benchmarks and original heterogeneous datasets is still limited. Many EA models also restrict the alignment search space to validation entities, limiting coverage of real KG content. Within this setting, our results show that embedding-based EA models continue to face generalization challenges in realistic large-scale KG search spaces. We evaluate several competitive EA models-commonly tested on benchmarks such as DBP15K-on multiple real-world heterogeneous datasets. The experiments reveal a performance decrease when moving beyond synthetic benchmarks, indicating that current models do not fully capture the characteristics of real data. We also analyze semantic similarity and profiling features of the datasets to help explain these differences. This study outlines practical limitations of embedding-based EA methods and provides insights for developing approaches that better handle the variability and complexity found in real-world KG alignment tasks.
%$ 122 ; 020