<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
  <mods>
    <titleInfo>
      <title>Using unlabeled data to discover bivariate causality with deep restricted Boltzmann machines</title>
    </titleInfo>
    <name type="personnal">
      <namePart type="family">Sokolovska</namePart>
      <namePart type="given">N.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Permiakova</namePart>
      <namePart type="given">O.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Forslund</namePart>
      <namePart type="given">S. K.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Zucker</namePart>
      <namePart type="given">Jean-Daniel</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <typeOfResource>text</typeOfResource>
    <genre authority="local">journalArticle</genre>
    <language>
      <languageTerm type="code" authority="iso639-2b">eng</languageTerm>
    </language>
    <physicalDescription>
      <internetMediaType>text/pdf</internetMediaType>
      <digitalOrigin>reformatted digital</digitalOrigin>
      <reformattingQuality>access</reformattingQuality>
    </physicalDescription>
    <abstract>An important question in microbiology is whether treatment causes changes in gut flora, and whether it also affects metabolism. The reconstruction of causal relations purely from non-temporal observational data is challenging. We address the problem of causal inference in a bivariate case, where the joint distribution of two variables is observed. We consider, in particular, data on discrete domains. The state-of-the-art causal inference methods for continuous data suffer from high computational complexity. Some modern approaches are not suitable for categorical data, and others need to estimate and fix multiple hyper-parameters. In this contribution, we introduce a novel method of causal inference which is based on the widely used assumption that if X causes Y, then P(X) and P(Y vertical bar X) are independent. We propose to explore a semi-supervised approach where P(Y vertical bar X) and P(X) are estimated from labeled and unlabeled data respectively, whereas the marginal probability is estimated potentially from much more (cheap unlabeled) data than the conditional distribution. We validate the proposed method on the standard cause-effect pairs. We illustrate by experiments on several benchmarks of biological network reconstruction that the proposed approach is very competitive in terms of computational time and accuracy compared to the state-of-the-art methods. Finally, we apply the proposed method to an original medical task where we study whether drugs confound human metagenome.</abstract>
    <targetAudience authority="marctarget">specialized</targetAudience>
    <subject>
      <topic>Causal inference</topic>
      <topic>semi-supervised learning</topic>
      <topic>probabilistic models</topic>
      <topic>metagenomic data</topic>
    </subject>
    <classification authority="local">020</classification>
    <classification authority="local">122</classification>
    <relatedItem type="host">
      <titleInfo>
        <title>IEEE-ACM Transactions on Computational Biology and Bioinformatics</title>
      </titleInfo>
      <part>
        <detail type="volume">
          <number>17</number>
        </detail>
        <detail type="volume">
          <number>1</number>
        </detail>
        <extent unit="pages">
          <list> 358-364</list>
        </extent>
      </part>
      <originInfo>
        <dateIssued>2020</dateIssued>
      </originInfo>
      <identifier type="issn">1545-5963</identifier>
    </relatedItem>
    <identifier type="uri">https://www.documentation.ird.fr/hor/PAR00020756</identifier>
    <identifier type="doi">10.1109/tcbb.2018.2879504</identifier>
    <identifier type="issn">1545-5963</identifier>
    <location>
      <shelfLocator>[F B010080283]</shelfLocator>
      <url usage="primary display" access="object in context">https://www.documentation.ird.fr/hor/PAR00020756</url>
      <url access="row object">https://horizon.documentation.ird.fr/exl-doc/pleins_textes/divers21-04/010080283.pdf</url>
    </location>
    <recordInfo>
      <recordContentSource>IRD - Base Horizon / Pleins textes</recordContentSource>
      <recordCreationDate encoding="w3cdtf">2020-06-09</recordCreationDate>
      <recordChangeDate encoding="w3cdtf">2025-02-24</recordChangeDate>
      <recordIdentifier>PAR00020756</recordIdentifier>
      <languageOfCataloging>
        <languageTerm authority="iso639-2b">fre</languageTerm>
      </languageOfCataloging>
    </recordInfo>
  </mods>
</modsCollection>
