<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
  <mods>
    <titleInfo>
      <title>Analysis of the performance of representation learning methods for entity alignment : benchmark versus real-world data</title>
    </titleInfo>
    <name type="personnal">
      <namePart type="family">Raoufi</namePart>
      <namePart type="given">E.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Happi Happi</namePart>
      <namePart type="given">Bill Gates</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Larmande</namePart>
      <namePart type="given">Pierre</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Scharffe</namePart>
      <namePart type="given">F.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Todorov</namePart>
      <namePart type="given">K.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <typeOfResource>text</typeOfResource>
    <genre authority="local">journalArticle</genre>
    <language>
      <languageTerm type="code" authority="iso639-2b">eng</languageTerm>
    </language>
    <physicalDescription>
      <internetMediaType>text/pdf</internetMediaType>
      <digitalOrigin>reformatted digital</digitalOrigin>
      <reformattingQuality>access</reformattingQuality>
    </physicalDescription>
    <abstract>Representation learning for entity alignment (EA) aims to identify entities in different knowledge graphs (KGs) that refer to the same real-world object by comparing their embedding similarity. Although many EA models perform well on synthetic benchmark datasets, this performance does not always transfer to real-world, incomplete, and domain-specific data. A systematic comparison between synthetic benchmarks and original heterogeneous datasets is still limited. Many EA models also restrict the alignment search space to validation entities, limiting coverage of real KG content. Within this setting, our results show that embedding-based EA models continue to face generalization challenges in realistic large-scale KG search spaces. We evaluate several competitive EA models-commonly tested on benchmarks such as DBP15K-on multiple real-world heterogeneous datasets. The experiments reveal a performance decrease when moving beyond synthetic benchmarks, indicating that current models do not fully capture the characteristics of real data. We also analyze semantic similarity and profiling features of the datasets to help explain these differences. This study outlines practical limitations of embedding-based EA methods and provides insights for developing approaches that better handle the variability and complexity found in real-world KG alignment tasks.</abstract>
    <targetAudience authority="marctarget">specialized</targetAudience>
    <subject>
      <topic>entity alignment</topic>
      <topic>knowledge graphs</topic>
      <topic>representation learning</topic>
      <topic>knowledge graph heterogeneity</topic>
      <topic>EA benchmarks</topic>
    </subject>
    <classification authority="local">122</classification>
    <classification authority="local">020</classification>
    <relatedItem type="host">
      <titleInfo>
        <title>Semantic Web</title>
      </titleInfo>
      <part>
        <detail type="volume">
          <number>17</number>
        </detail>
        <detail type="volume">
          <number>1</number>
        </detail>
        <extent unit="pages">
          <list>09217134251389825 [24 ]</list>
        </extent>
      </part>
      <originInfo>
        <dateIssued>2026</dateIssued>
      </originInfo>
      <identifier type="issn">1570-0844</identifier>
    </relatedItem>
    <identifier type="uri">https://www.documentation.ird.fr/hor/fdi:010095821</identifier>
    <identifier type="doi">10.1177/09217134251389825</identifier>
    <identifier type="issn">1570-0844</identifier>
    <location>
      <shelfLocator>[F B010095821]</shelfLocator>
      <url usage="primary display" access="object in context">https://www.documentation.ird.fr/hor/fdi:010095821</url>
      <url access="row object">https://horizon.documentation.ird.fr/exl-doc/pleins_textes/2026-01/010095821.pdf</url>
    </location>
    <recordInfo>
      <recordContentSource>IRD - Base Horizon / Pleins textes</recordContentSource>
      <recordCreationDate encoding="w3cdtf">2026-01-14</recordCreationDate>
      <recordChangeDate encoding="w3cdtf">2026-01-19</recordChangeDate>
      <recordIdentifier>fdi:010095821</recordIdentifier>
      <languageOfCataloging>
        <languageTerm authority="iso639-2b">fre</languageTerm>
      </languageOfCataloging>
    </recordInfo>
  </mods>
</modsCollection>
