<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
  <mods>
    <titleInfo>
      <title>Graph embeddings meet link keys discovery for entity matching</title>
    </titleInfo>
    <name type="personnal">
      <namePart type="family">Jradeh</namePart>
      <namePart type="given">C. K.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Raoufi</namePart>
      <namePart type="given">E.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">David</namePart>
      <namePart type="given">J.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Larmande</namePart>
      <namePart type="given">Pierre</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Scharffe</namePart>
      <namePart type="given">F.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Todorov</namePart>
      <namePart type="given">K.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Trojahn</namePart>
      <namePart type="given">C.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <typeOfResource>text</typeOfResource>
    <genre authority="local">journalArticle</genre>
    <language>
      <languageTerm type="code" authority="iso639-2b">eng</languageTerm>
    </language>
    <physicalDescription>
      <internetMediaType>text/pdf</internetMediaType>
      <digitalOrigin>reformatted digital</digitalOrigin>
      <reformattingQuality>access</reformattingQuality>
    </physicalDescription>
    <abstract>Entity Matching (EM) automates the discovery of identity links between entities within different Knowledge Graphs (KGs). Link keys are crucial for EM, serving as rules allowing to identify identity links across different KGs, possibly described using different ontologies. However, the approach for extracting link keys struggles to scale on large KGs. While embedding-based EM methods efficiently handle large KGs they lack explainability. This paper proposes a novel hybrid EM approach to guarantee the scalability link key extraction approach and improve the explainability of embeddingbased EM methods. First, embedding-based EM approaches are used to sample the KGs based on the identity links they generate, thereby reducing the search space to relevant sub-graphs for link key extraction. Second, rules (in the form of link keys) are extracted to explain the generation of identity links by the embedding-based methods. Experimental results demonstrate that the proposed approach allows link key extraction to scale on large KGs, preserving the quality of the extracted link keys. Additionally, it shows that link keys can improve the explainability of the identity links generated by embedding-methods, allowing for the regeneration of 77% of the identity links produced for a specific EM task, thereby providing an approximation of the reasons behind their generation.</abstract>
    <targetAudience authority="marctarget">specialized</targetAudience>
    <subject>
      <topic>Entity matching</topic>
      <topic>Knowledge graphs</topic>
      <topic>Link keys</topic>
      <topic>Embedding-based EM</topic>
      <topic>Symbolic EM</topic>
      <topic>Graph embeddings</topic>
      <topic>Language models</topic>
      <topic>Hybrid AI</topic>
    </subject>
    <classification authority="local">020</classification>
    <classification authority="local">122</classification>
    <relatedItem type="host">
      <titleInfo>
        <title>Proceedings of the ACM Web Conference 2025, WWW 2025</title>
      </titleInfo>
      <part>
        <extent unit="pages">
          <list> 3344-3353</list>
        </extent>
      </part>
      <originInfo>
        <dateIssued>2025</dateIssued>
      </originInfo>
    </relatedItem>
    <identifier type="uri">https://www.documentation.ird.fr/hor/fdi:010094312</identifier>
    <identifier type="doi">10.1145/3696410.3714581</identifier>
    <location>
      <shelfLocator>[F B010094312]</shelfLocator>
      <url usage="primary display" access="object in context">https://www.documentation.ird.fr/hor/fdi:010094312</url>
      <url access="row object">https://horizon.documentation.ird.fr/exl-doc/pleins_textes/2025-08/010094312.pdf</url>
    </location>
    <recordInfo>
      <recordContentSource>IRD - Base Horizon / Pleins textes</recordContentSource>
      <recordCreationDate encoding="w3cdtf">2025-08-27</recordCreationDate>
      <recordChangeDate encoding="w3cdtf">2026-06-12</recordChangeDate>
      <recordIdentifier>fdi:010094312</recordIdentifier>
      <languageOfCataloging>
        <languageTerm authority="iso639-2b">fre</languageTerm>
      </languageOfCataloging>
    </recordInfo>
  </mods>
</modsCollection>
