<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
  <mods>
    <titleInfo>
      <title>M2-Mixer : a multimodal mixer with multi-head loss for classification from multimodal data</title>
    </titleInfo>
    <name type="personnal">
      <namePart type="family">Bezirganyan</namePart>
      <namePart type="given">G.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Sellami</namePart>
      <namePart type="given">S.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Berti-Equille</namePart>
      <namePart type="given">Laure</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Fournier</namePart>
      <namePart type="given">S.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <typeOfResource>text</typeOfResource>
    <genre authority="local">bookSection</genre>
    <language>
      <languageTerm type="code" authority="iso639-2b">eng</languageTerm>
    </language>
    <physicalDescription>
      <internetMediaType>text/pdf</internetMediaType>
      <digitalOrigin>born digital</digitalOrigin>
      <reformattingQuality>access</reformattingQuality>
    </physicalDescription>
    <abstract>In this paper, we propose M2-Mixer, an MLP-Mixer based architecture with multi-head loss for multimodal classification. It achieves better performances than the convolutional, recurrent, or neural architecture search based baseline models with the main advantage of conceptual and computational simplicity. The proposed multi-head loss function addresses the problem of modality predominance (i.e., when one of the modalities is favored over the others by the training algorithm). Our experiments demonstrate that our multimodal mixer architecture, combined with the multi-head loss function, outperforms the baseline models on two benchmark multimodal datasets: AVMNIST and MIMIC-III with respectively, on average, + 0.43% in accuracy and 6. 4 times reduction in training time and + 0.33% in accuracy and 13. 3 times reduction in training time, compared with previous best performing models.</abstract>
    <targetAudience authority="marctarget">specialized</targetAudience>
    <classification authority="local">122</classification>
    <relatedItem type="host">
      <titleInfo>
        <title>2023 IEEE International Conference on Big Data</title>
      </titleInfo>
      <name type="personnal">
        <namePart type="family">He</namePart>
        <namePart type="given">J.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Palpanas</namePart>
        <namePart type="given">T.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Cuzzocrea</namePart>
        <namePart type="given">A.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Dou</namePart>
        <namePart type="given">D.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Slezak</namePart>
        <namePart type="given">D.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Wang</namePart>
        <namePart type="given">W.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Gruca</namePart>
        <namePart type="given">A.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Chun-Wei</namePart>
        <namePart type="given">J.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Agrawal</namePart>
        <namePart type="given">R.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <part>
        <extent unit="pages">
          <list>1052-1058</list>
        </extent>
      </part>
      <originInfo>
        <place type="text">
          <placeTerm>Piscataway</placeTerm>
        </place>
        <publisher>IEEE</publisher>
        <dateIssued key="date">2023</dateIssued>
      </originInfo>
      <name type="conference">
        <namePart>IEEE International Conference on Big Data, Sorrento (ITA), 2023/12/15-2023/12/18</namePart>
      </name>
    </relatedItem>
    <identifier type="uri">https://www.documentation.ird.fr/hor/fdi:010090983</identifier>
    <identifier type="doi">10.1109/BigData59044.2023.10386252</identifier>
    <identifier type="isbn">979-8-3503-2446-4</identifier>
    <location>
      <shelfLocator>[F B010090983]</shelfLocator>
      <url usage="primary display" access="object in context">https://www.documentation.ird.fr/hor/fdi:010090983</url>
      <url access="row object">https://www.documentation.ird.fr/intranet/publi/2024-08/010090983.pdf</url>
    </location>
    <accessCondition type="restriction access" displayLabel="Accès réservé">Accès réservé (Intranet de l'IRD)</accessCondition>
    <recordInfo>
      <recordContentSource>IRD - Base Horizon / Pleins textes</recordContentSource>
      <recordCreationDate encoding="w3cdtf">2024-06-28</recordCreationDate>
      <recordChangeDate encoding="w3cdtf">2025-02-24</recordChangeDate>
      <recordIdentifier>fdi:010090983</recordIdentifier>
      <languageOfCataloging>
        <languageTerm authority="iso639-2b">fre</languageTerm>
      </languageOfCataloging>
    </recordInfo>
  </mods>
</modsCollection>
